minidom and pulldom

Martin v. Löwis martin at v.loewis.de
Thu Dec 11 16:08:40 EST 2003


pinto at map.com (David Pinto) writes:

> I'm trying to use either the minidom or pulldom to find table tags in
> html web pages.  I've tried parsing two web pages that show up fine in
> my browser, but I get errors when I call minidom.parse, or try to get
> events with pulldom.  Is there a parser that is as forgiving as web
> browsers?

minidom is an XML parser. Most Web pages are not XML, but some form of
HTML.

You should have better chances with parsing HTML using htmllib.

Regards,
Martin





More information about the Python-list mailing list