Parse XML using Python

Anil anilby at gmail.com
Thu Dec 9 23:50:20 EST 2004


William Park wrote:
> anilby at gmail.com wrote:
> > Hi,
> >
> > I wanted to write a script that will read the below file:
> >
> > <abcd label="ABC">
> > .
> > <efg label="EFGA">
> > ....
> > <decg label="ABDG">
> > ..
> > </decg>
> > ..
> >
> > </efg>
> >
> > ..
> > <mon1 label="MON">
> > ..
> > </mon1>
> > ..
> > </abcd>
> > .
> > .
> > <xyz label="A1">
> > .
> > <eg1 label="FGA">
> > ....
> > <dg label="BG">
> > ..
> >
> > </dg>
> >
> > ..
> >
> > </eg1>
> >
> > </xyz>
> >
> > ..
> > and so on
> >
> > The output of the script shud be
> >
> > ABC
> > ..EFGA
> > ....ABDG
> > ..MON
> >
> > A1
> > ..FGA
> > ....BG
> >
> > Please help me in writing a Python script for the above task.
>
> Take a look at
>     http://home.eol.ca/~parkw/park-january.html
> on "Expat XML" section towards the end.  Translating it to Python is
> left for homework.
>
> In essence,
>     indent=..
>     start () {
> 	local "${@:2}"
> 	echo "${indent|*XML_ELEMENT_DEPTH-1}$label"
>     }
>     xml -s start "`< file.xml`"
> which prints
>     ..ABC
>     ....EFGA
>     ......ABDG
>     ....MON
>     ..A1
>     ....FGA
>     ......BG
> with modified input, ie. wrapping XML pieces into single root tree.
>
> --
> William Park <opengeometry at yahoo.ca>
> Open Geometry Consulting, Toronto, Canada
> Linux solution for data processing.
Thanks everyone for the responses. I will try the above solutions.




More information about the Python-list mailing list