drisate Posted October 5, 2012 Share Posted October 5, 2012 Hey guys i am trying to retreive the HTML of a specefic div containing the class name "main" The code works but for some reason, it only returns the txt ... no HTML. All the text formating is striped out ... normaly there should be images as well. What did i miss? $page = file_get_contents($item_url); $doc = new DOMDocument(); $doc->loadHTML($page); $divs = $doc->getElementsByTagName('div'); foreach($divs as $div) { if ($div->getAttribute('class') === "main") { $divContent = html_decode($div->nodeValue); } } $divContent = preg_replace('#^\d+#', '', $divContent); $divContent = nl2br(trim(htmlentities($divContent, ENT_QUOTES, "UTF-8"))); Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.