[Chicago] Extract information from PDF

Jon Sudlow jsudlow at gmail.com
Thu Jul 16 01:58:22 CEST 2009


I really like pyPdf, search on google for it. Its real easy to extract data
and do other cool stuff. The attractive thing about pyPDF is its an all
python library so all you have to do is download the module for it, put it
in your python path and import it. Their are several examples on the
internet. Its not too bad.

On Wed, Jul 15, 2009 at 5:20 PM, Carl Karsten <carl at personnelware.com>wrote:

> my guess:
>
> https://code.edge.launchpad.net/~poppler-python<https://code.edge.launchpad.net/%7Epoppler-python>
>
> On Wed, Jul 15, 2009 at 5:17 PM, Ian Bicking<ianb at colorstudy.com> wrote:
> > PDFMiner perhaps.
> >
> > On Wed, Jul 15, 2009 at 3:36 PM, Lukasz Szybalski <szybalski at gmail.com>
> > wrote:
> >>
> >> Hello,
> >> Would anybody know if there a standard python pdf library that can
> >> extract information from a pdf?
> >>
> >> I'm looking to extract:
> >>
> >> 1. Subject
> >> 2. Created Date
> >> 3. Number of Pages
> >>
> >> Anybody know how to do that?
> >>
> >> Thanks,
> >> Lucas
> >>
> >>
> >> --
> >> Using rsync. How to setup rsyncd.
> >> http://lucasmanual.com/mywiki/rsync
> >> OpenLdap - From start to finish.
> >> http://lucasmanual.com/mywiki/OpenLdap
> >> _______________________________________________
> >> Chicago mailing list
> >> Chicago at python.org
> >> http://mail.python.org/mailman/listinfo/chicago
> >
> >
> >
> > --
> > Ian Bicking  |  http://blog.ianbicking.org  |
> >  http://topplabs.org/civichacker
> >
> > _______________________________________________
> > Chicago mailing list
> > Chicago at python.org
> > http://mail.python.org/mailman/listinfo/chicago
> >
> >
>
>
>
> --
> Carl K
> _______________________________________________
> Chicago mailing list
> Chicago at python.org
> http://mail.python.org/mailman/listinfo/chicago
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/chicago/attachments/20090715/48d05db1/attachment.htm>


More information about the Chicago mailing list