How to create Robots.txt file

•

0 likes•353 views

Tanuja Talekar

This ppt will show you how to preapre a robots.txt file for your website.Just follow all the steps carefully.

Education

What is Robots.txt file ?
The robots exclusion protocol (REP), or robots.txt is a text file webmasters create
to instruct robots (typically search engine robots) how to crawl and index pages
on their website. In short Web site owners use the /robots.txt file to give
instructions about their site to web robots; this is called The Robots Exclusion
Protocol.

How to create robots.txt file for your website?
Step1 : Go to the following website :
http://tools.seobook.com/robots-txt/generator/
You will the following screen :

Step 2 : Suppose you don’t want the robots to have access to your about-us page of
your website. Then just select /about-us.html from your website as shown in the
image below :

And paste that part in the files or directories tab and click add you your robots.txt will be
ready as shown below :
Copy this code in notepad and save
as robots.txt in your main folder
Very important point

Step 3 : Then upload the robots.txt file to your website using filezilla ftp client :

Step 4 : Check whether the file is uploaded to your website by typing robots.txt infront of
your website’s url :
Your robots.txt file is uploaded
successfully to your website

You can directly write the code in notepad and save the file as robots.txt too if you don’t
want to use the online tool.
Important Note :
The "/robots.txt" file is a text file, with one or more records. Usually contains a single
record looking like this:
User-agent: *
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /~joe/
In this example, three directories are excluded.
Note that you need a separate "Disallow" line for every URL prefix you want to exclude
you cannot say "Disallow: /cgi-bin/ /tmp/" on a single line. Also, you may not have blank
lines in a record, as they are used to delimit multiple records.
Note also that globbing and regular expression are not supported in either the User-agent
or Disallow lines. The '*' in the User-agent field is a special value meaning "any robot".
Specifically, you cannot have lines like "User-agent: *bot*", "Disallow: /tmp/*" or
"Disallow: *.gif".
What you want to exclude depends on your server. Everything not explicitly disallowed is
considered fair game to retrieve. Here follow some examples:

Some examples :
1) To exclude all robots from the entire server
2) To allow all robots complete access
(or just create an empty "/robots.txt" file, or don't use one at all)
3) To exclude all robots from part of the server
User-agent: *
Disallow: /
User-agent: *
Disallow:
User-agent: *
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /junk/

4) To exclude a single robot
5) To allow a single robot
6) To exclude all files except one
This is currently a bit awkward, as there is no "Allow" field. The easy way is to put all files to be disallowed
into a separate directory, say "stuff", and leave the one file in the level above this directory:
7) Alternatively you can explicitly disallow all disallowed pages :
User-agent: BadBot
Disallow: /
User-agent: Google
Disallow: User-agent: *
Disallow: /
User-agent: *
Disallow: /~joe/stuff/
User-agent: *
Disallow: /~joe/junk.html
Disallow: /~joe/foo.html
Disallow: /~joe/bar.html

Viewers also liked

how to create a blog on wordpress OM Maurya

How to create rss feed for your websiteOM Maurya

Chapple, R. M. 2014 A Game of Murals. Westeros & Changing Times in East Belfa...Robert M Chapple

how to setup Google analytics tracking code for websiteOM Maurya

How to create sitemap for websiteOM Maurya

How to create Display network only campaign in AdwordsTanuja Talekar

Wordpress getting startedNeha Nayak

Viewers also liked (7)

how to create a blog on wordpress

How to create rss feed for your website

Chapple, R. M. 2014 A Game of Murals. Westeros & Changing Times in East Belfa...

how to setup Google analytics tracking code for website

How to create sitemap for website

How to create Display network only campaign in Adwords

Wordpress getting started

Recently uploaded

Expanded definition: technical and operationalssuser3e220a

MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir

prashanth updated resume 2024 for Teaching ProfessionSri Sairam College Of Engineering Bengaluru

Measures of Position DECILES for ungrouped dataBabyAnnMotar

ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1

Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar

4.11.24 Poverty and Inequality in America.pptxmary850239

Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1

Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar

Mattingly "AI & Prompt Design: Large Language Models"National Information Standards Organization (NISO)

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection

Transaction Management in Database Management SystemChristalin Nelson

BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar

Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar

Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringSri Sairam College Of Engineering Bengaluru

Concurrency Control in Database Management systemChristalin Nelson

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar

Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco

Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav

Recently uploaded (20)

Expanded definition: technical and operational

MS4 level being good citizen -imperative- (1) (1).pdf

prashanth updated resume 2024 for Teaching Profession

Measures of Position DECILES for ungrouped data

ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv

Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx

4.11.24 Poverty and Inequality in America.pptx

Reading and Writing Skills 11 quarter 4 melc 1

Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx

Mattingly "AI & Prompt Design: Large Language Models"

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...

Transaction Management in Database Management System

BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx

Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx

Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering

Concurrency Control in Database Management system

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...

Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf

Narcotic and Non Narcotic Analgesic..pdf

How to create Robots.txt file

1. What is Robots.txt file ? The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website. In short Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

2. How to create robots.txt file for your website? Step1 : Go to the following website : http://tools.seobook.com/robots-txt/generator/ You will the following screen :

3. Step 2 : Suppose you don’t want the robots to have access to your about-us page of your website. Then just select /about-us.html from your website as shown in the image below :

4. And paste that part in the files or directories tab and click add you your robots.txt will be ready as shown below : Copy this code in notepad and save as robots.txt in your main folder Very important point

5. Step 3 : Then upload the robots.txt file to your website using filezilla ftp client :

6. Step 4 : Check whether the file is uploaded to your website by typing robots.txt infront of your website’s url : Your robots.txt file is uploaded successfully to your website

7. You can directly write the code in notepad and save the file as robots.txt too if you don’t want to use the online tool. Important Note : The "/robots.txt" file is a text file, with one or more records. Usually contains a single record looking like this: User-agent: * Disallow: /cgi-bin/ Disallow: /tmp/ Disallow: /~joe/ In this example, three directories are excluded. Note that you need a separate "Disallow" line for every URL prefix you want to exclude you cannot say "Disallow: /cgi-bin/ /tmp/" on a single line. Also, you may not have blank lines in a record, as they are used to delimit multiple records. Note also that globbing and regular expression are not supported in either the User-agent or Disallow lines. The '*' in the User-agent field is a special value meaning "any robot". Specifically, you cannot have lines like "User-agent: *bot*", "Disallow: /tmp/*" or "Disallow: *.gif". What you want to exclude depends on your server. Everything not explicitly disallowed is considered fair game to retrieve. Here follow some examples:

8. Some examples : 1) To exclude all robots from the entire server 2) To allow all robots complete access (or just create an empty "/robots.txt" file, or don't use one at all) 3) To exclude all robots from part of the server User-agent: * Disallow: / User-agent: * Disallow: User-agent: * Disallow: /cgi-bin/ Disallow: /tmp/ Disallow: /junk/

9. 4) To exclude a single robot 5) To allow a single robot 6) To exclude all files except one This is currently a bit awkward, as there is no "Allow" field. The easy way is to put all files to be disallowed into a separate directory, say "stuff", and leave the one file in the level above this directory: 7) Alternatively you can explicitly disallow all disallowed pages : User-agent: BadBot Disallow: / User-agent: Google Disallow: User-agent: * Disallow: / User-agent: * Disallow: /~joe/stuff/ User-agent: * Disallow: /~joe/junk.html Disallow: /~joe/foo.html Disallow: /~joe/bar.html

10. THANK YOU U T H K N A

How to create Robots.txt file

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

More from Tanuja Talekar

More from Tanuja Talekar (7)

Recently uploaded

Recently uploaded (20)

How to create Robots.txt file