lisp is winner in DOM parsing contest! 8-]

Cameron Laird claird at lairds.us
Mon Jul 12 12:06:23 EDT 2004


In article <7xllhpc3yx.fsf at ruckus.brouhaha.com>,
Paul Rubin  <http://phr.cx@NOSPAM.invalid> wrote:
>Peter Hansen <peter at engcorp.com> writes:
>> For example, there are approaches which can read in the
>> document incrementally (and I'm not just talking SAX here),
>> rather than read the whole thing at once.
>
>Rather than either reading incrementally or else slurping in the
>entire document in many-noded glory, I wonder if anyone's implemented
>a parser that scans over the XML doc and makes a compact sequential
>representation of the tree structure, and then provides access methods
>that let you traverse the tree as if it were a real DOM, by fetching
>the appropriate strings from the (probably mmap'ed) disk file as you
>walk around in the tree.

While I don't yet follow all the places this thread has gone,
tDOM <URL: http://wiki.tcl.tk/tdom > is where I turn when I
want *fast* DOMish handling.  Although its author favors Tcl,
there's no particular reason not to use it with Python.



More information about the Python-list mailing list