extract info from pdf doc,PDF to XML, HTML

Bryan Webb bww00 at amdahl.com
Fri Aug 10 12:47:41 EDT 2001

Previous message (by thread): [ANNOUNCE] PyUnit 1.4.1 released
Next message (by thread): Setting file creation date
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

HI,
    I need to build an index of several pdf docs. I think that the best way
would be to extract some info from the pdf docs and build an html index
page.What is the best way to scan thru the pdf docs and find to find the
info. Is there a way to convert the pdf docs to xml or html and then scan
them.

Thanks for all the help

Bryan Webb

Previous message (by thread): [ANNOUNCE] PyUnit 1.4.1 released
Next message (by thread): Setting file creation date
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the Python-list mailing list