Jump to content

Recommended Posts

ok...in that case...you don't want to use an XML parser. if the page doesn't validate as strict XHTML (which MOST pages don't) you won't be able to retrieve it. Instead, get it with regular expressions:

<?php
$contents = file_get_contents('http://www.cnn.com');
if(preg_match('/<title>(.+?)<\/title>/',$contents,$matches)){
  $title = $matches[1];
  print $title;
}
?>

ok...in that case...you don't want to use an XML parser. if the page doesn't validate as strict XHTML (which MOST pages don't) you won't be able to retrieve it. Instead, get it with regular expressions:

 

Very good point. I went for DOM because it would't matter if the title tags were on different lines. I might try to stip out any newline characters.

 

Thanks, I will give your approach a go this evening.

This thread is more than a year old. Please don't revive it unless you have something important to add.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.