Jump to content

How to extarct news URL links from any website


digitalalpha

Recommended Posts

Can someone help me to explain how can I used PHP to extract news links for sites and stored them in database.

This is for sites which does not offer RSS feed etc. e.g.

 

http://www.cosco.com.sg/ir_multimedia.html

http://www.capitamall.com/newsroom.html etc.

 

 

basically, wants some code example which can explain how to read a website news link, and then collect LATEST news (URL, Subject line) and stored it to a database.

 

Many Thanks in advance.

this is where View Page Source comes in, or save as html.

 

than ya load it into yer favorite text editor (notepad,notepad++,textpad,ultraedit).

and look how they construct the news.

 

you will see a pattern, and ya can manually extract the data

but the more common way is to use regex

 

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.