[BangPypers] extracting unicode text from pdfs

Eknath Venkataramani eknath.iyer at gmail.com
Mon May 24 17:23:06 CEST 2010


On Mon, May 24, 2010 at 7:51 PM, Dhananjay Nene <dhananjay.nene at gmail.com>wrote:

> You may want to try out pdfminer. Its very similar to xpdf in structure and
> should give you the parsed data into unicode directly.
>
Tried but I got the same output as xpdf. I guess it's because of the point
mentioned by Gora- 'you might not have those fonts installed in your system'


-- 
Eknath Venkataramani


More information about the BangPypers mailing list