lisp is winner in DOM parsing contest! 8-]
Cameron Laird
claird at lairds.us
Mon Jul 12 12:06:23 EDT 2004
In article <7xllhpc3yx.fsf at ruckus.brouhaha.com>,
Paul Rubin <http://phr.cx@NOSPAM.invalid> wrote:
>Peter Hansen <peter at engcorp.com> writes:
>> For example, there are approaches which can read in the
>> document incrementally (and I'm not just talking SAX here),
>> rather than read the whole thing at once.
>
>Rather than either reading incrementally or else slurping in the
>entire document in many-noded glory, I wonder if anyone's implemented
>a parser that scans over the XML doc and makes a compact sequential
>representation of the tree structure, and then provides access methods
>that let you traverse the tree as if it were a real DOM, by fetching
>the appropriate strings from the (probably mmap'ed) disk file as you
>walk around in the tree.
While I don't yet follow all the places this thread has gone,
tDOM <URL: http://wiki.tcl.tk/tdom > is where I turn when I
want *fast* DOMish handling. Although its author favors Tcl,
there's no particular reason not to use it with Python.
More information about the Python-list
mailing list