smith.james0 Posted April 28, 2012 Share Posted April 28, 2012 Hi I have tried to write a script to collect web links from a html page put them in an array and compare it with an array of urls from my db. I am using this expression to find the urls in the page /href="([^\s"]+)/ it works but it returns http://www.google.com when I want google.com I have search the net but most of the expressions I came across don't work or return image urls as well. Can anyone help? James Link to comment https://forums.phpfreaks.com/topic/261753-help-with-regex/ Share on other sites More sharing options...
xyph Posted April 28, 2012 Share Posted April 28, 2012 Use an HTML parser like DOMDocument to grab all of the anchor tags in a page. Link to comment https://forums.phpfreaks.com/topic/261753-help-with-regex/#findComment-1341379 Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.