Fw: PDF library for reading PDF files

Robert Kern rkern at ucsd.edu
Sun Jan 18 18:20:52 EST 2004


Cameron Laird wrote:
> In article <Xns9474CBDE9B2D7cpl19ghumspamgourmet at 62.153.159.134>,
> Harald Massa  <cpl.19.ghum at spamgourmet.com> wrote:
> 
>>>I am looking for a library in Python that would read PDF files and I
>>>could extract information from the PDF with it. I have searched with
>>>google, but only found libraries that can be used to write PDF files. 
>>
>>reportlab has a lib called pagecatcher; it is fully supported with python, 
>>it is not free.
>>
>>Harald
> 
> 
> ReportLab's libraries are great things--but they do not "extract
> information from the PDF" in the sense I believe the original
> questioner intended.  

No, but ReportLab (the company) has a product separate from reportlab 
(the package) called PageCatcher that does exactly what the OP asked 
for. It is not open source, however, and costs a chunk of change.



More information about the Python-list mailing list