Robots.txt Examples

by dan on November 4, 2008

Fend off those pesky robots!

I just released a new project that resulted from having to specifically block useless bots that requests thousands of pages per day and send no traffic. Several example bots files are included that could be renamed and copied to robots.txt in the root of your web application. Your robots file should be accessible at http://www.yourdomain.com/robots.txt

Files included:

  • robots.txt Standard bot file should be usable for most sites. Only disallows know bad bots.
  • robots.major.txt Has a white-list for major search engine and blocks everything else
  • robots.noarchive.txt same as above but disallows archive.org bot which uses lots of traffic and doesn’t send much traffic
  • robots.wordpress.txt Entries for Wordpress blogs
  • robots.none.txt Block all bots

Checkout the source hosted on GitHub.

{ 0 comments… add one now }

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>