Jump to content

Help with Robot.txt


otuatail

Recommended Posts

Hi > have been to the miscellaneous group without success.

 

I have tried asking this before but got nowhere. I have a robots text file like.

 

User-agent: Googlebot

Disallow: /news/

Disallow: /cms/

 

User-agent: Slurp

Disallow: /news/

Disallow: /cms/

 

This allows Google and Yahoo to crawl pages other than the ones listed above. My problem is if there is a new robot

 

how do you get the User-agent string. For example I have been told to stop Yahoo use Slurp. how do you arive at

 

that.

 

The complete string is

Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)

 

Am I to understand that I only need to use a part of this i.e. Slurp and that the Robots do a string test to see if

 

Slurp is in there string.  Or is there a seperate list of strings just for the robot.txt file.

 

 

Desmond.

 

P.S. I dont want to kill off all robots that would be dumb. I want to takle a few major ones and direct where they can go.

 

Link to comment
https://forums.phpfreaks.com/topic/123926-help-with-robottxt/
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.