Jump to content

Regex to match urls ending with file extensions etc .html,.pdf


jamesmiller

Recommended Posts

Hey Guys, I am searching through a page and only pulling certian urls from a page i have this regex:-

 

preg_match_all('@\bhttp[s]*?(:|:)//[-A-Z0-9+&#/%?=~_|$!:,.;]+[A-Z0-9+&#/%=~_|$][.*?].@i',$page,$links);

 

this works fine for pulling out the links needed the only problem is it will only pull out the first letter after the "." and i want it to pull out all of the URLS with file extensions on for example i would like it to pull

 

http://website.com/files/402005378/balablah.rar.html

 

and it will only return

 

 

http://website.com/files/402005378/balablah.ra

 

Thanks for any help

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.