Python script for deMSWordification?

Werner Schiendl ws-news at gmx.at
Wed Mar 7 09:50:38 EST 2001


If you have it on your machine you can use itself with Automation to do the
job.

otherwise

.doc Files are basically OLE Compound Documents that represents an IStorage.
IStorage is an interface that models DCOM structured storage alongside
IStream.
Comapred to a file system IStorage is a directory, and IStream is a file.
There are APIs to cope with that kind of file.
With Visual Studio comes a tool called DocFile Viewer that shows the basic
content of the file.

But I guess you want to do that on non-MS platform.
This sounds to me like big fun, so enjoy it!

hth
werner


<kelvin.chu at uvm.edu> wrote in message
news:985h02$1pbm$1 at swen.emba.uvm.edu...
> Dear fellow Pythoneers;
>
> Numerous colleagues and administration types send memos, colloqium
> announcements and other things around in Micro$oft word format.  Has
anyone
> written a filter in Python that can convert these things into normal text?
>
> If not, is there a basic set of rules for decoding .doc format files
> anywhere that I can look at?
>
> Cheers,
>
> -k
>
> --
>
>   kelvin.chu at uvm.edu      (802) 656-0064
>   http://www.uvm.edu/~kchu/ FAX: (802) 656-0817
>





More information about the Python-list mailing list