shage Posted June 1, 2008 Share Posted June 1, 2008 Im trying to figure out the regex to extract rss links of a page, so xml extensions of any href ive tried a ton but cant figure it out thank you in advance Quote Link to comment Share on other sites More sharing options...
BillyBoB Posted June 1, 2008 Share Posted June 1, 2008 Here you go this one was really easy. There is one substring which is the path and the file (not the ext. though). /href=\"(.+)\.xml\"/Ui So you would do something like: <?php $text = "<html><head><title>test</title></head><body><a href=\"blah.xml\">Blah.xml</a></body></html>"; preg_match_all("/href=\"(.+)\.xml\"/Ui", $text, $matches); print_r($matches); echo "<br/><br/>"; $i = 0; while(isset($matches[0][$i])) { echo $matches[0][$i] . ".xml"; $i++; } ?> That should display something like: Array ( [0] => Array ( [0] => "Blah" ) ) Blah.xml Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.