converting file formats to txt

Gaurav Agarwal gaurav.agarwal1904 at gmail.com
Wed Jul 5 03:31:10 EDT 2006


Hi All,

Thanks for the advise. Am trying to play around with InfoCon, part of
from Dspace project. It does file conversions. But it is written in
java and uses open office plugin.

Regards,
Gaurav Agarwal

BartlebyScrivener wrote:
> I suspect you will have to process those formats separately. But the
> good news, at least for doc files, is that there is a script in the
> Python Cookbook 2Ed that does what you want for MS Word docs and
> another script that does it for Open Office docs.
>
> The scripts are 2.26 and 2.27 pages 101-102.
>
> I think you can probably find them at the ActiveState repository also.
>
> http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/279003
>
> In the book, the title of the script is "Extracting Text from Microsoft
> Word Documents"
>
> It uses PyWin32 extension and COM to perform the conversion.
> 
> rd




More information about the Python-list mailing list