Novice Script Help

e1seix · February 6, 2008

I think I did post this some time ago, but can't find original thread to rehash. I have some questions regarding lifting data from a particular webpage. What makes this unusual and why I need to ask some questions is that within the tags, there is a lot of white space and the data doesn't actually sit within closed tags eg. <span>data</span>, it falls like this

                                        <td width="55%"><div class="value">
                                            £6.99 <font size="3"> </font></div>
									    
                                        </td>

With the "£6.99" being what I want to extract and use. for example, this code works perfectly for a different website.

$url = 'http://www.cheapsmells.com/viewProduct.php?id=3462';
$html = file_get_contents($url);

preg_match('/<div class=\'productOurPrice\'?>(.+?)(\d+\.\d+)(.+?)?<\/div>/', $html, $match);
$out = $match[2];

Where the url is http://www.directcosmetics.com/results/products.cfm?ctype=ME&range=Hummer&code=34744 which is where the first example where the whitespace is, how can I adjust the above regex to obtain the information required, in this case literally "6.99" and nothing more. Is it possible because it's not within closed brackets?

Any help you can shed my way wouldbe greatly apprecviated.

Cheers

effigy · February 6, 2008

The /s modifier will allow . to match new lines.

Sign In

Novice Script Help

Recommended Posts

e1seix

Link to comment

Share on other sites

effigy

Link to comment

Share on other sites

Archived

Browse

Activity

Important Information