Jump to content

Archived

This topic is now archived and is closed to further replies.

Nimwei

PDF's - Is there a way to read a PDF and pull text from it?

Recommended Posts

I've got a PDF document that is generated every week by a third-party that I need to go out, download, and parse out the text.  It is a simple word document that has a generic table and then converted to a PDF so the parsing won't be bad.

I've searched around and I can't seem to find any libraries or examples of how to do this.

Anyone help me?

Thanks.

Share this post


Link to post
Share on other sites
[url=http://www.google.com/search?q=php%20pdf%20classes]Search results for PHP.[/url] There are lots of other tools out there to do this, such as pdf2txt.

Share this post


Link to post
Share on other sites
I don't need a tool to do it.  I need to write a script to do it because I don't want to have to manually go out and convert it every week.  Thanks for the search link though. I've been through them but I'll look further.

Share this post


Link to post
Share on other sites
php has PDF functions which you sould be able to use if installed:
http://www.php.net/manual/en/ref.pdf.php

Share this post


Link to post
Share on other sites
Yes, I'm aware of the functions for PDFs.  Unfortunately, they are not documented so I was hoping someone could help me out and point me int eh right way.

Share this post


Link to post
Share on other sites

×

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.