Suggestion for converting PDF files to HTML/txt files

brad byte8bits at gmail.com
Mon Aug 11 10:09:38 EDT 2008


srinivasan srinivas wrote:
> Could someone suggest me ways to convert PDF files to HTML files??
> Does Python have any modules to do that job??
> 
> Thanks,
> Srini

Unless there is some recent development, the answer is no, it's not 
possible. Getting text out of PDF is difficult (to say the least) and at 
times impossible... i.e. a PDF can be an image that contains some text, etc.



More information about the Python-list mailing list