urllib2.urlopen(url) pulling something other than HTML

John J. Lee jjl at pobox.com
Tue Aug 21 17:36:53 EDT 2007


Gabriel Genellina <gagsl-py2 at yahoo.com.ar> writes:
[...]
> Don't even try to understand it - it's a mess. Use the HTMLParser
> module instead.
[...]

Module sgmllib (and therefore module htmllib also) is more tolerant of
bad HTML than module HTMLParser.


John



More information about the Python-list mailing list