robots.txt is a text file that contains commands for search engine indexing robots that specify the pages that can and cannot be indexed. When a search engine explores a website, it starts by looking for the robots.txt file at the root of the site.
The robots.txt file is an ASCII file found at the root of the site. It can contain the following commands:
![]() |
The robots.txt file should not contain any empty lines! |
Here are some examples of robots.txt files:
User Agent: * Disallow: /
User Agent: * Disallow:
User Agent: RobotName Disallow: User Agent: * Disallow: /
User Agent: RobotName Disallow: / User Agent: * Disallow:
User Agent: * Disallow: /directory/path/page.html
User Agent: * Disallow: /directory/
Here are a few examples of User Agents for the most popular search engines:
| Search Engine Name | User Agent: |
|---|---|
| Alta Vista | Scooter |
| Excite | ArchitextSpider |
| Googlebot | |
| HotBot | Slurp |
| InfoSeek | InfoSeek Sidewinder |
| Lycos | T-Rex |
| Voilà | Echo |