[New-bugs-announce] [issue14506] HTMLParser can't handle erronous end tags with additional tags in it

Olaf Tomalka report at bugs.python.org
Thu Apr 5 13:51:41 CEST 2012


New submission from Olaf Tomalka <olaf.tomalka at gmail.com>:

While this is wrongly formated html, I've spotted such an example on real website on the web, and all browsers handle the bad tag gracefully, while the python html parser throws an exception with "bad end tag", I think additional info in end tag should be ignored, no exception thrown and rest of the page parsed.
I'm including minimal example.

----------
components: Library (Lib)
files: minimal.py
messages: 157570
nosy: ritave
priority: normal
severity: normal
status: open
title: HTMLParser can't handle erronous end tags with additional tags in it
type: behavior
versions: Python 3.2
Added file: http://bugs.python.org/file25130/minimal.py

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue14506>
_______________________________________


More information about the New-bugs-announce mailing list