Guest MrLeN Posted January 5, 2016 Share Posted January 5, 2016 <?php //==Get URL's $srcUrl='http://www.url.com'; $html = file_get_contents($srcUrl); $dom = new DOMDocument(); @$dom->loadHTML($html); // grab all the on the page $xpath = new DOMXPath($dom); //finding the a tag $hrefs = $xpath->evaluate("/html/body//a"); //Loop to display all the links for ($i = 0; $i < $hrefs->length; $i++) { $href = $hrefs->item($i); $url = $href->getAttribute('href'); //Filter the null links if($url!='#') { if (preg_match('/cars-vans-utes/',$url)) { if (preg_match('/s-ad/',$url)) { echo $url / "<br />"; } } } } ?> The above code works so that I can get all the URL's off a page. However, I don't want the script to get absolutely every single link.I want it to delete all the file content before a certain phrase in the HTML (say "abracadabra" for example)....and then display only the links that are positioned after the word "abracadabra" appears in the HTML.Can anyone point me in the right direction to be able to do this? Quote Link to comment Share on other sites More sharing options...
Solution Guest MrLeN Posted January 5, 2016 Solution Share Posted January 5, 2016 I worked it out! $html = substr(strstr($html, 'abracadabra'), strlen('abracadabra')); Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.