Extracting images from a PDF file

Max Erickson maxerickson at gmail.com
Thu Dec 27 14:17:51 EST 2007


Doug Farrell <writeson at charter.net> wrote:

> Hi all,
> 
> Does anyone know how to extract images from a PDF file? What I'm
> looking to do is use pdflib_py to open large PDF files on our
> Linux servers, then use PIL to verify image data. I want to do
> this in order to find corrupt images in the PDF files. If anyone
> could help me out, or point me in the right direction, it would
> be most appreciated!
> 
> Also, does anyone know of a way to validate a PDF file? 
> 
> Thanks in advance,
> Doug

There is some discussion here: 

http://nedbatchelder.com/blog/200712.html#e20071210T064608



max




More information about the Python-list mailing list