10GB XML Blows out Memory, Suggestions?

cavallo71 a.cavallo at mailsnare.com
Thu Jun 22 05:10:29 EDT 2006


> > I wrote a program that takes an XML file into memory using Minidom. I
> > found out that the XML document is 10gb.
> >
> > I clearly need SAX or something else?

If the data is composed by a large number of records,
like a database dump of some sort,
then probably you could have a look to a stax processor
for python like pulldom.

In this way you could process each single record one at the time,
without loading the entiere document.

Regards,
Antonio




More information about the Python-list mailing list