HTML/DOM parser
Michael Ekstrand
michael at elehack.net
Tue Feb 28 08:33:47 EST 2006
On 28 Feb 2006 00:33:11 -0800
"Xah Lee" <xah at xahlee.org> wrote:
> is there a module that lets me parse validated html files and store it
> as a tree?
BeautifulSoup will parse valid HTML (not just XHTML), and also crummy
HTML while it's at it. And generates a tree structure. Warning: I
haven't actually used this myself, I've just looked at it.
http://www.crummy.com/software/BeautifulSoup/
- Michael
--
mouse, n: a device for pointing at the xterm in which you want to type.
-- Fortune
More information about the Python-list
mailing list