Jump to content

Crawler


hackalive

Recommended Posts

Hello I am looking to build a web crawler such as MLBot. It must recognise robots.txt and ROBOTS meta tag, but in saying that when a site such as Wordpress shows visitor stats it lists the crawler (eg MLBot http://www.metadatalabs.com/mlbot). So how can I build a crawler that will list as HackAliveBot or HackAliveCrawler and will recognise robots.txt and ROBOTS meta tag.

 

Thanks so much in advance

Link to comment
Share on other sites

This board is for help with specific programming questions, your post is fare too vague and too widely scoped to possibly be answered within a simple forum post.

 

I suggest you start posting these questions elsewhere.

Link to comment
Share on other sites

Guest
This topic is now closed to further replies.
×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.