Opening MS Word files via Python
Fazer
faizan at jaredweb.com
Sat Apr 24 00:43:04 EDT 2004
jmdeschamps at cvm.qc.ca (jmdeschamps) wrote in message news:<3d06fae9.0404210536.3f277a37 at posting.google.com>...
> Rob Nikander <rnikaREMOVEnder at adelphia.net> wrote in message news:<i7-dnZNwpJ8TfhjdRVn-jg at adelphia.com>...
> > Fazer wrote:
> > > I am curious as to how I should approach this issue. I would just
> > > want to parse simple text and maybe perhaps tables in the future.
> > > Would I have to save the word file and open it in a text editor? That
> > > would kind of....suck... Has anyone else tackled this issue?
> >
> > The win32 extensions for python allow you to get at the COM objects for
> > applications like Word, and that would let you get the text and tables.
> > google: win32 python.
> >
> > word = win32com.client.Dispatch('Word.Application')
> > word.Documents.Open('C:\\myfile.doc')
> >
> > But I don't know the best way to find out the methods and properties of
> > the "word" object.
> >
> > Rob
>
> You can use VBA documentation for Word, and using dot notation and
> normal Pythonesque way of calling functions, play with its diverses
> objects, methods and attributes...
> Here's some pretty straightforward code along these lines:
> #************************
> import win32com.client
> import tkFileDialog
>
> # Launch Word
> MSWord = win32com.client.Dispatch("Word.Application")
> MSWord.Visible = 0
> # Open a specific file
> myWordDoc = tkFileDialog.askopenfilename()
> MSWord.Documents.Open(myWordDoc)
> #Get the textual content
> docText = MSWord.Documents[0].Content
> # Get a list of tables
> listTables= MSWord.Documents[0].Tables
> #************************
>
> Happy parsing,
>
> Jean-Marc
That is Awesome! Thanks!
How would I save something in word format? I am guessing
MSWord.Docments.Save(myWordDoc) or around those lines? where can I
find more documentatin? Thanks.
More information about the Python-list
mailing list