deane034 Posted December 29, 2008 Share Posted December 29, 2008 Does anyone have a class/script or Ideas on how to do this? I am working on a data-extraction project, where I want to extract data tables and feed them into a Database. I have figured out a way to do this for HTML files. So I'm thinking if there is a way to convert PDFs to HTML. Any help in this regard would be much appreciated. Link to comment https://forums.phpfreaks.com/topic/138721-converting-pdf-to-html-through-php/ Share on other sites More sharing options...
redarrow Posted December 29, 2008 Share Posted December 29, 2008 The closest thing is pdf2html which is a command line tool http://sourceforge.net/projects/pdftohtml/ all i can help with sorry If you need to actually read the .PDF files and convert them to browser-viewable HTML documents there is an open source PDF2HTML class that may help you - available @ phpclasses.org, but it has problems with converting/displaying elaborate graphics. The best web presentation solution though is probably to convert the .PDF to .XML documents to make them cross-platform compatible. From there they can be converted to HTML, Word, Excel, etc, depending upon your needs. If you only need to make them downloadable - that is viewable to anyone with Acrobat Reader installed on their machine - why not just leave them in .PDF format and reference them to the other server? Link to comment https://forums.phpfreaks.com/topic/138721-converting-pdf-to-html-through-php/#findComment-725252 Share on other sites More sharing options...
deane034 Posted December 29, 2008 Author Share Posted December 29, 2008 redarrow Thanks for the link mate, ran across the same. will check it out and let you know if it's workable. I cannot seem to find the PDF2HTML Class at the PHPclasses.org, do send me a link if you have one. I can't just display the PDF files, Because I want to extract Data-tables out of it and manipulate them. So at a bare minimum I should be able to read the PDFs, I don't really care about Images. If anyone knows how to read PDFs (forget the HTML part) that would be much appreciated.. Link to comment https://forums.phpfreaks.com/topic/138721-converting-pdf-to-html-through-php/#findComment-725313 Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.