Extracting images from a PDF file

writeson doug.farrell at gmail.com
Fri Dec 28 08:26:27 EST 2007


On Dec 27, 2:17 pm, Max Erickson <maxerick... at gmail.com> wrote:
> Doug Farrell <write... at charter.net> wrote:
> > Hi all,
>
> > Does anyone know how to extract images from aPDFfile? What I'm
> > looking to do is use pdflib_py to open largePDFfiles on our
> > Linux servers, then use PIL to verify image data. I want to do
> > this in order to find corrupt images in thePDFfiles. If anyone
> > could help me out, or point me in the right direction, it would
> > be most appreciated!
>
> > Also, does anyone know of a way to validate aPDFfile?
>
> > Thanks in advance,
> > Doug
>
> There is some discussion here:
>
> http://nedbatchelder.com/blog/200712.html#e20071210T064608
>
> max

Max,

That's a very interesting snippet of code, thanks for posting the
link! Much appreciated!

Doug




More information about the Python-list mailing list