kukipei Posted December 6, 2007 Share Posted December 6, 2007 Hi to all, I am trying to solve one problem couple of days. I am scraping one html page. Coding of that page is utf-8. There are some east europian characters on that page čćžš etc.. This is a code I am using function handle_final_scrape($html, $ISIN, $mb) { global $database; $ind1 = strpos($html, $ISIN); $ind1table = strpos($html, "<table", $ind1); $ind2table = strpos($html, "</table", $ind1table); $table = substr($html, $ind1table, $ind2table + 8 - $ind1table); echo $table; $dom = new DOMDocument("1.0", "UTF-8"); @$dom->loadHTML($table); After echo $table I have table nice printed on screen. I can see letters (ščćđ) nice. But after dom functions anything I echo on screen instead of čćžđ I get some unknow characters. It is look like that dom functions somehow change encoding. Is it possible to solve this problem. Best Regards, Predrag Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.