HTML/DOM parser

Michael Ekstrand michael at elehack.net
Tue Feb 28 08:33:47 EST 2006


On 28 Feb 2006 00:33:11 -0800
"Xah Lee" <xah at xahlee.org> wrote:
> is there a module that lets me parse validated html files and store it
> as a tree?

BeautifulSoup will parse valid HTML (not just XHTML), and also crummy
HTML while it's at it. And generates a tree structure. Warning: I
haven't actually used this myself, I've just looked at it.

http://www.crummy.com/software/BeautifulSoup/

- Michael

-- 
mouse, n: a device for pointing at the xterm in which you want to type.
                -- Fortune



More information about the Python-list mailing list