successor to htmllib

Martin v. Löwis loewis at informatik.hu-berlin.de
Mon Sep 9 05:20:06 EDT 2002


Erik Price <erikprice at mac.com> writes:

> I noticed that the htmllib module is really best suited to HTML 2.0
> documents.  I was wondering if there was a newer (4+) HTML or even
> XHTML parsing module in development right now.  (By XHTML parsing lib
> I mean perhaps an XML parser that is specifically written for the
> XHTML 1.0 DTD.  I realize that any XML parser -should- work.)

PyXML offers xml.dom.ext.reader.HtmlLib, and xml.dom.ext.reader. Both
create HTML DOM trees, one from a HTML document, the other from a
XHTML document.

HTH,
Martin




More information about the Python-list mailing list