Jump to content


Photo

robots.txt


  • Please log in to reply
3 replies to this topic

#1 johnnyk

johnnyk
  • Members
  • PipPipPip
  • Advanced Member
  • 126 posts

Posted 12 July 2006 - 12:45 AM

This isn't really an HTML question, but close enough:

If I specify nothing in the robots.txt, what does the bot index? Say I have a directory /scripts/ which contains the action scripts of my forms (but my website does not link to any file in this directory via anchor), would I have to put that under disallow? How about a directory that contains php functions?
Rediscover the Web:
<a href="http://www.spreadfir...;t=219"></a><br />

#2 Daniel0

Daniel0
  • Staff Alumni
  • Advanced Member
  • 11,956 posts

Posted 12 July 2006 - 07:35 AM

If nothing is in robots.txt it's like having none (= it indexes everything). Everything that the search-engines may not see should be as disallow in robots.txt

#3 johnnyk

johnnyk
  • Members
  • PipPipPip
  • Advanced Member
  • 126 posts

Posted 12 July 2006 - 04:28 PM

If they may not see it, why disallow it? Just in case?
Rediscover the Web:
<a href="http://www.spreadfir...;t=219"></a><br />

#4 nogray

nogray
  • Members
  • PipPipPip
  • Advanced Member
  • 930 posts
  • LocationSan Francisco CA

Posted 12 July 2006 - 05:24 PM

If you have no links to your directory /scripts/ anywhere and no one else is linking to that folder, search engines shouldn't find it.

robots.txt was created to stop search engines from getting pages that have linked but access requires sign up or something else. Or the site owner doesn't want his page on search engines.

Seconds, search engines only read the output of the file (so, php, asp, html, or whatever) will be the same for search engines. So, if your script files don't output anything, nothing will be spidred.

NoGray.com





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users