shage Posted June 1, 2008 Share Posted June 1, 2008 Im trying to figure out the regex to extract rss links of a page, so xml extensions of any href ive tried a ton but cant figure it out thank you in advance Link to comment https://forums.phpfreaks.com/topic/108194-regex/ Share on other sites More sharing options...
BillyBoB Posted June 1, 2008 Share Posted June 1, 2008 Here you go this one was really easy. There is one substring which is the path and the file (not the ext. though). /href=\"(.+)\.xml\"/Ui So you would do something like: <?php $text = "<html><head><title>test</title></head><body><a href=\"blah.xml\">Blah.xml</a></body></html>"; preg_match_all("/href=\"(.+)\.xml\"/Ui", $text, $matches); print_r($matches); echo "<br/><br/>"; $i = 0; while(isset($matches[0][$i])) { echo $matches[0][$i] . ".xml"; $i++; } ?> That should display something like: Array ( [0] => Array ( [0] => "Blah" ) ) Blah.xml Link to comment https://forums.phpfreaks.com/topic/108194-regex/#findComment-554658 Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.