Fw: PDF library for reading PDF files

Cameron Laird claird at lairds.com
Tue Jan 20 10:32:48 EST 2004


In article <400CF2E3.29506EAE at netsurf.de>,
Andreas Lobinger  <andreas.lobinger at netsurf.de> wrote:
>Aloha,
>
>Peter Galfi schrieb:
			.
			.
			.
>> having to implement all the decompressions, etc. The "information" I am
>> trying to extract from the PDF file is the text, specifically in a way to
>> keep the original paragraphs of the text. I have seen so far one shareware
			.
			.
			.
>As others wrote here, the simplest solution is to use a external
>pdf-2-text programm and postprocess the data. Read comp.text.pdf
>
>There is no simple and consistent way to extract text from a .pdf
>because there are many ways to set text. The optical impression
			.
			.
			.
I want to emphasize that final sentence.  If you insist on pursuing
this, though, refer to <URL:
http://phaseit.net/claird/comp.text.pdf/PDF_converters.html#pdf2txt >.
-- 

Cameron Laird <claird at phaseit.net>
Business:  http://www.Phaseit.net



More information about the Python-list mailing list