Reading Adobe PDF File

Chris Rebert clp2 at rebertia.com
Sun Jan 29 00:59:29 EST 2012


On Sat, Jan 28, 2012 at 9:52 PM, Shrewd Investor <cltung at gmail.com> wrote:
> Hi,
>
> I have a very large Adobe PDF file.  I was hoping to use a script to
> extract the information for it.  Is there a way to loop through a PDF
> file using Python?

Haven't used it myself, but:
http://www.unixuser.org/~euske/python/pdfminer/

> Or do I need to find a way to convert a PDF file into a text file?  If
> so how?

The pdf2txt.py script from the same package happens to do exactly this.

Cheers,
Chris



More information about the Python-list mailing list