[XML-SIG] XML processing

Stefan Behnel stefan_ml at behnel.de
Sun Feb 15 12:22:44 CET 2009


Stefan Behnel wrote:
> For minidom, I get about 60MB, where Fredrik got 80MB. That's still about a
> factor of 17-23 compared to the serialised XML file, whereas lxml and cET
> end up with a factor of 3-5. Your assumption that you can use a system with
> 3GB of RAM to parse a 500MB XML file into an in-memory tree can easily turn
> wrong for XML files with more tags and shorter text content (say, numbers),
> or for documents with non-european languages.

I should add that this was measured on a 32 bit system. 64 bit systems will
require even more memory to store the tree, almost twice as much for each
element.

Stefan



More information about the XML-SIG mailing list