Fw: PDF library for reading PDF files

Jeff Sandys sandysj at juno.com
Tue Jan 20 12:41:36 EST 2004


Peter Galfi wrote:
> 
...
> The "information" I am trying to extract from the PDF file is the text, 
> specifically in a way to keep the original paragraphs of the text.
...
> 
> Any suggestions?

Ghostscript has an Extract Text capability that I have used 
successfully on some pdf files (but not on some others):
     http://www.cs.wisc.edu/~ghost/

Thanks,
Jeff Sandys



More information about the Python-list mailing list