Flux rss

Robots.txt

Presentation of the robots.txt File

robots.txt is a text file that contains commands for search engine indexing robots that specify the pages that can and cannot be indexed. When a search engine explores a website, it starts by looking for the robots.txt file at the root of the site.

robots.txt File Format

The robots.txt file is an ASCII file found at the root of the site. It can contain the following commands:

  • User Agent: used to specify the robot that is subject to the following orders. The value * means "all search engines"
  • Disallow: used to identify the pages to be excluded during indexing. Each page or path that is to be excluded must be on a separate line and must start with / The value / alone means "all of the website's pages".

Warning: The robots.txt file should not contain any empty lines!

Here are some examples of robots.txt files:

  • All pages are excluded:
    User Agent: *
    Disallow: /
  • No pages are excluded (equivalent to having no robots.txt file, meaning that all the pages are visited):
    User Agent: *
    Disallow: 
  • Only one robot is authorized:
    User Agent: RobotName
    Disallow:
    User Agent: *
    Disallow: /
  • One robot is excluded:
    User Agent: RobotName
    Disallow: /
    User Agent: *
    Disallow:
  • One page is excluded:
    User Agent: *
    Disallow: /directory/path/page.html
  • All pages from a directory and its subfolders are excluded:
    User Agent: *
    Disallow: /directory/

Examples of User Agents

Here are a few examples of User Agents for the most popular search engines:

Search Engine Name User Agent:
Alta Vista Scooter
Excite ArchitextSpider
Google Googlebot
HotBot Slurp
InfoSeek InfoSeek Sidewinder
Lycos T-Rex
Voilà Echo

For More Information

The web robots page

Last update on Thursday October 16, 2008 02:43:14 PM.

This document entitled « Robots.txt » from Kioskea (en.kioskea.net) is made available under the Creative Commons license. You can copy, modify copies of this page, under the conditions stipulated by the licence, as this note appears clearly.

Results for Robots.txt

Download Robot Benri When you are not at the home, you can control what takes place even there with hindsight. Robot Benri is a tool allowing to use your webcam or other sensor of pictures in so much of surveillance camera. You only have to connect it up your apparatus... en.kioskea.net/telecharger/telecharger-601-robot-benri
Pomi the robot penguin has hidden depths A Robot penguin named Pomi, the latest line in robotic pets by South Korean researchers being unveiled in Seoul South Korean researchers, showcasing their latest line in robotic pets, have unveiled a penguin which can interact with humans. Pomi... en.kioskea.net/actualites/pomi-the-robot-penguin-has-hidden-depths-10446-actualite.php3
Bat script 2delete files from .txt Hello, I need to delete files from a ftp box via a batch script. I will have a .txt file (generated by our source system) which will have a list of files to be deleted. Suppose in a.txt file i have the following files a.123234.txt b.354543.csv... en.kioskea.net/forum/affich-13796-bat-script-2delete-files-from-txt

Results for Robots.txt

SubstitutionSubstitution Basic Substitution Simple Global Targetted Conditioned Substitution Simplified Advanced Bloc Advanced Basic Substitution Simple 1st match (each line) encountered only sed 's/la/LA/' fichier.txt Global All... en.kioskea.net/faq/sujet-931-substitution
Writing in batch in text fileWriting in batch in text file To write in a file text, you just have to use a redirect “>”: echo text > output_file.txt To write in an existing file: echo " Writing at the end of the file ">> output_file.txt en.kioskea.net/faq/sujet-1050-writing-in-batch-in-text-file
Sed - inserting spacesSed - inserting spaces Insert a blank line after each sentence (punctuated by a carriage return) sed G file.txt Insert a blank line after each sentence (punctuated by a carriage return), without taking into account the existing white... en.kioskea.net/faq/sujet-917-sed-inserting-spaces

Results for Robots.txt

Batch file incorrect writing to txt file (Solved)Hello dear friends, I have some trouble writing some application data from windows to a .txt file located on a usb stick (E:/ cant even CD it), however when i relocate the log.bat on the C drive it works fine! Can anyone spot my error i cant find out... en.kioskea.net/forum/affich-37487-batch-file-incorrect-writing-to-txt-file
Script that searches lines in a txtWell im searching for a script that searches lines in a txt file that end with a $ and echos them to another txt file, any help will be greatly appreciated T Thanks. en.kioskea.net/forum/affich-16013-script-that-searches-lines-in-a-txt
Script that searches lines in a txtHello, Well im searching for a script that searches lines in a txt file that end with a $ and echos them to another txt file, any help will be greatly appreciated thanks. en.kioskea.net/forum/affich-14887-script-that-searches-lines-in-a-txt

Results for Robots.txt

Download MonoDescribed as being a part "from Asteroids, from Robotron and of Paint Shop Pro" Monoskiing is a game not as others. He combines the famous games of shooting and with a screen of completely extraordinary wakefulness. The purpose of the program is to... en.kioskea.net/telecharger/telecharger-486-mono
Download Some Txt to PDF ConverterDocuments PDF are formats most on and the most stable for the electronic transfers, since they cannot be changed. Because of this or that you do not risk changing by keeping the format of the text or the layout of the document. Some Text to PDF... en.kioskea.net/telecharger/telecharger-573-some-txt-to-pdf-converter
Download Free Word/Doc Txt to Image Jpg/Jpeg Bmp Tiff PngIt is usually the images that we insert into Word, Excel or PowerPoint documents. This time, it is the opposite, because we are going to convert these documents to image formats. All to Jpg / JPEG Image Bmp Tiff Png Converter is a powerful tool which... en.kioskea.net/telecharger/telecharger-1630-free-word-doc-txt-to-image-jpg-jpeg-bmp-tiff-png

Results for Robots.txt

Toshiba robot can do the job of the remote controlResearcher for Japanese electronics giant Toshiba, Daisuke Yamamoto, displays the prototype model for the new desktop-sized robot called the "ApriPoko", which can recognize human voices and operate electronic devices such as televisions and air... en.kioskea.net/actualites/toshiba-robot-can-do-the-job-of-the-remote-control-10241-actualite.php3
Japan companies unite to bring robots to homesThe presidents of Japanese robot ventures pose with their robots for photographers during a joint press conference in Tokyo. Tmsuk, ZMP, VStone, Business Design Laboratory Co (BDL) said they were forming a loose federation to exchange technology with... en.kioskea.net/actualites/japan-companies-unite-to-bring-robots-to-homes-10458-actualite.php3
Hardwired for love: Are robots the sex partners of the future?This Honda Motor 2007 handout shows the company's humanoid robot Asimo guiding a woman. David Levy, a PhD in gender studies and artificial intelligence and author of "Sex with Robots: The Evolution of Human-Robot Relations," predicts that by mid... en.kioskea.net/actualites/hardwired-for-love-are-robots-the-sex-partners-of-the-future-10096-actualite.php3

Results for Robots.txt

Internet technologies - The V90 standard The Rockwell company has introduced a new standard: the K56flex standard. This standard is offered as an alternative to the X2 technology of US ROBOTICS. It enables speeds in the region of 56Kb/s to be reached over an asynchronous connection. It is... en.kioskea.net/technologies/k56flex.php3
UNIX system - Commands Unix Commands Description Options ls lists the content of a directory -a Displays all files, including hidden files -I Displays a detailed listing -R Displays the files recursively (i.e. in the sub-directories) -d Displays only the directories and... en.kioskea.net/unix/unixcomm.php3