Jump to content

Block scraper


oracle765

Recommended Posts

		 	Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.59.52) [Label IP Address]
Referring URL:(No referring link)
Visit Page: UK Airport Parking Prices & Charges | Compare & Choose

		 	
Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.54.225) [Label IP Address]
Referring URL:(No referring link)
Entry Page: Shopping Prices & Online Shopping Deals in Australia
Exit Page: Compare Travel Insurance Prices & Quotes | Compare & Choose

		 	
Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.62.56) [Label IP Address]
Referring URL:(No referring link)
Entry Page: Switch Energy Suppliers and save | Compare & Choose
Exit Page: Trauma Insurance Cover & Quotes | Compare & Choose

		 	
Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.59.127) [Label IP Address]
Referring URL:(No referring link)
Entry Page: Compare and choose home loans, products and services
Exit Page:
 Our Terms and Conditions | Compare & Choose

		 	
Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.55.230) [Label IP Address]
Referring URL:(No referring link)
Entry Page: Contact Compare and Choose by email or telephone
Exit Page: Compare Sports Products & Goods | Compare & Choose

		 	
Total Visits:1
Location:Ashburn, Virginia, United States
IP Address:Amazon.com (54.174.55.240) [Label IP Address]
Referring URL:(No referring link)
Entry Page: Compare Home & Garden Store Prices | Compare & Choose
Exit Page: Compare Shoes, Trainers & Footwear Prices | Compare & Choose

Hi Professionals

 

Our website is written in php on a Linux

 

 

We are currently being trawled by the following address, as you can see it is the same place but a different IP every time

 

Is there a way to stop this

 

thanks in avance

Link to comment
Share on other sites

It's a safe bet that nothing being hosted on or through an amazon web services server should be making requests to your web site.

 

however, doing a whois ip lookup for two of those ip addresses, gives the following information - http://www.whois.com/whois/54.174.55.230 and http://www.whois.com/whois/54.174.62.56

 

these are apparently for a company named hubspot. those two whois lookups have different abuse contacts. get the whois lookup information for each of the different ip addresses you are getting requests from, then do two things -

 

1) provide the abuse contact(s) with the ip and datetime information about the requests. they should be able to determine for that set of ip addresses and datetime, what is sending the requests (perhaps they have a bot/proxy running on their system(s).)

 

2) find the range of ip addresses that each of the ip addresses is part of and assuming you are using an Apache web server, add an entry in a domain root .htaccess file that blocks (deny) requests from that entire range. repeat for any other ranges of ip addresses.

Link to comment
Share on other sites

This thread is more than a year old. Please don't revive it unless you have something important to add.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.