matfish Posted October 20, 2006 Share Posted October 20, 2006 Hi, anyone point me in the right direction of extracting a list of URLs/Links of an external page (such as extracting a page of links from google which are related to a keyword)? I can then manipulate this data into my database?Just want to extract links from a page which I could specify and maybe put the urls into an array so I could then play with?Many thanks for any help. Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/ Share on other sites More sharing options...
matfish Posted October 24, 2006 Author Share Posted October 24, 2006 Ok, lets start again.Anyone know how to extract URLs from a specific site in php?Thanks Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/#findComment-113509 Share on other sites More sharing options...
heckenschutze Posted October 24, 2006 Share Posted October 24, 2006 With regular expressions, Heres my crappy attempt at regex :D[code]<?phpfunction GetLinks($url){ $aOut = array(); preg_match_all("/http:\/\/?[^ ][^\"][^'][^<][^>]+/i", file_get_contents($url), $aOut, PREG_PATTERN_ORDER); print_r($aOut);}echo "<pre>";GetLinks("http://google.com.au");echo "</pre>";?>[/code]Hey, its a start ;) Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/#findComment-113514 Share on other sites More sharing options...
matfish Posted October 24, 2006 Author Share Posted October 24, 2006 Thats brilliant thank you, It contains all of the ahref tag but from that I can pick out the URLs which is what I need.Many thanks!!!! Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/#findComment-113522 Share on other sites More sharing options...
matfish Posted October 24, 2006 Author Share Posted October 24, 2006 Hey dude, Im having a bit of trouble reading the array, for example: picking out a random array - say number 4?Just returns "Array" Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/#findComment-113654 Share on other sites More sharing options...
True`Logic Posted October 24, 2006 Share Posted October 24, 2006 random:$num = rand(0, count([b]ARRAY[/b]));echo [b]ARRAY[/b][$num];entire:$num2 = count([b]ARRAY[/b]);$num3 = 0;while($num3 <= $num2) {echo [b]ARRAY[/b][$num3] . "< br >";$num3++;}hope this helped =) Link to comment https://forums.phpfreaks.com/topic/24604-urllink-extracting/#findComment-113665 Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.