Search the Community
Showing results for tags 'scraping'.
-
Hello all, I have a research task where I am suppose to write a script that will scrape the data and give an output in form of a json. The challenge I am facing is the table I am suppose to scrape from is unstructured (uneven pattern) how do we extract data from such a table I am putting down the table example here. I have tried various loops using simple_html_dom but failed (because of the table format), can someone guide me what should be the approach. I have added the html table here https://www.protectedtext.com/get-json-from-table-using-php-script password : 123. Any suggestion will be a help. Thanks in Advance!!!
-
Hello everyone, Would like some direction, as I want to start a project and I'm not even sure if I'm headed the right way. I have a local news site, which I would like to "scrape" various of the news items off it. I already talked with their webmaster, and he said it's good to go. Ok, so I believe (please correct) that a good tool for the job would be PHP and cURL. What about using PHP Simple HTML DOM Parser? I ask because I'm just not sure of where to head. I'm a n00b at this, so diving into this project is various hours... before I even realize if what I'm doing will work or not. So, that's the general direction. Should I use PHP and cURL? (a reference doc I found here) Also, I don't know how this works, yet I would like to "scrape" the page 4-5 times per day (at pre-set times), and then save the info over in my server. So when a user to my website visits, I server the scraped information from my site (as opposed to re-scraping from the original site?). Any thoughts on this project? Thank you very much everyone!