Working with PDFs?

jyoung79 at kc.rr.com jyoung79 at kc.rr.com
Sat Aug 14 19:44:50 EDT 2010


Just curious if anyone knows if it's possible to work with pdf documents 
with Python?  I'd like to do the following:

- Pull out text from each PDF page (to search for specific words)
- Combine separate pdf documents into one document
- Add bookmarks (with destination settings)

A few programs I've been looking at are pdfminer, pyPDF, etc from this 
link:
http://pypi.python.org/pypi?%3Aaction=search&term=pdf&submit=search

Originally, I was using AppleScript and JavaScript to do this in Acrobat. 
But now Acrobat 9 has broken this process and I can't seem to make it 
work.  I'd like to find other workarounds instead of having to rely on 
Adobe.

Thanks for your help.

Jay



More information about the Python-list mailing list