# robots.txt file created at http://www.tintrongnuoc.net/ # Sat, 15 Apr 2006 04:56:53 -0400 # Exclude Files From All Robots: User-agent: * Disallow: /tin tuc viet nam Disallow: /bao chi viet nam Disallow: /tin moi trong ngay Disallow: /thoi su Disallow: /tin the gioi Disallow: /tin the thao Disallow: /tin quoc te Disallow: /tin giao duc Disallow: /tin van hoa Disallow: /tin tuc thoi so Disallow: /tin phap luathinh su Disallow: /tin tuc tin tuc # End robots.txt file # # robots.txt for http://www.tintrongnuoc.net/tintuc-vietnam.aspx # # $Id: robots.txt,v 1.41 2006/02/22 16:31:58 ted Exp $ # # For use by search.tintrongnuoc.net User-agent: W3C-gsa Disallow: /Out-Of-Date # W3C Link checker User-agent: W3C-checklink Disallow: # exclude some access-controlled areas User-agent: * Disallow: /2004/ontaria/basic Disallow: /Team Disallow: /chatroom Disallow: /Systems Disallow: /Web Disallow: /History Disallow: /Out-Of-Date Disallow: /2002/02/mid Disallow: /mid/ Disallow: /People/all/ Disallow: /RDF/Validator/ARPServlet Disallow: /2003/03/Translations/byLanguage Disallow: /2003/03/Translations/byTechnology Disallow: /2005/11/Translations/Query Disallow: /2003/glossary/subglossary/ #Disallow: /2005/06/blog/ #Disallow: /2001/07/pubrules-checker #shouldnt get transparent proxies but will ml links of things like pubrules Disallow: /2000/06/webdata/xslt Disallow: /2000/09/webdata/xslt Disallow: /Bugs/