trying to parse non valid html documents with HTMLParser

florent florent.newsgroups at kynesthesy.org
Wed Aug 3 11:43:09 EDT 2005


> AFAIK not with HTMLParser or htmllib. You might try (if you haven't done
> yet) htmllib and see, which parser is more forgiving.

You were right, the HTMLParser of htmllib is more permissive. He just 
ignores the bad tags !



More information about the Python-list mailing list