The Robots.txt File

Techie Talks November 6, 2017

0 3,483 4 minutes read

The Robots.txt File One way to begin understanding what’s running on a web server is to view the server’s robots.txt file. The robots.txt file is a listing of the directories and files on a web server that the owner wants web crawlers to omit from the indexing process. A web crawler is a piece of software that is used to catalog web information to be used in search engines and archives that are mostly commonly deployed by search engines such as Google and Yahoo. These web crawlers scour the internet and index (archive) all possible findings to improve the accuracy and speed of their internet search functionality.

To a hacker, the robots.txt file is a road map to identify sensitive information because any web server’s robots.txt file can be retrieved in a browser by simply requesting it in the URL. Here is an example robots.txt file that you can easily retrieve directly in your browser by simply requesting /robots.txt after a host URL.

# Notice: Crawling Facebook is prohibited unless you have express written

# permission. See: http://www.facebook.com/apps/site_scraping_tos_terms.php

User-agent: Applebot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: baiduspider

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Bingbot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Googlebot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: ia_archiver

Disallow: /

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: msnbot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Naverbot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: seznambot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Slurp

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: teoma

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Twitterbot

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Yandex

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Yeti

Disallow: /ajax/

Disallow: /album.php

Disallow: /checkpoint/

Disallow: /contact_importer/

Disallow: /feeds/

Disallow: /file_download.php

Disallow: /hashtag/

Disallow: /l.php

Disallow: /live/

Disallow: /moments_app/

Disallow: /p.php

Disallow: /photo.php

Disallow: /photos.php

Disallow: /sharer/

User-agent: Applebot