extract info from pdf doc,PDF to XML, HTML

Bryan Webb bww00 at amdahl.com
Fri Aug 10 12:47:41 EDT 2001


HI,
    I need to build an index of several pdf docs. I think that the best way
would be to extract some info from the pdf docs and build an html index
page.What is the best way to scan thru the pdf docs and find to find the
info. Is there a way to convert the pdf docs to xml or html and then scan
them.

Thanks for all the help

Bryan Webb





More information about the Python-list mailing list