Fulltext is a simple Python library for converting document and media files to text. It's main purpose is for use with full-text indexing systems. https://github.com/btimby/fulltext http://pypi.python.org/pypi/fulltext/0.1-1 For example, to easily extract text from a PDF file: > python > import fulltext > fulltext.get('resume.pdf') 'Experience: ...'