Read the table data from PDF files in Python

Peter Pearson pkpearson at nowhere.invalid
Wed Apr 24 11:16:40 EDT 2019


On Wed, 24 Apr 2019 02:36:27 -0700 (PDT), mrawat213 at gmail.com wrote:
> Hello,
> Anyone knows how to fetch the data from PDF file having tables with
> other text in Python. Need to fetch some cell values based on
> condition from that table.

You might find pdftotext useful.

The command . . .

  pdftotext -layout somefile.pdf

produces a file named somefile.txt.

This will be completely useless if the original PDF is just
a PDF wrapper around an image.  That's what document scanners
tend to produce.

-- 
To email me, substitute nowhere->runbox, invalid->com.



More information about the Python-list mailing list