[issue7311] Bug on regexp of HTMLParser

Senthil Kumaran report at bugs.python.org
Thu Apr 7 03:27:55 CEST 2011


Senthil Kumaran <senthil at uthcode.com> added the comment:

> So is the issue7311-3.diff patch fine? 

Just that it allows unquoted attrs for unicode too.

My previous suggestion was not to allow unquoted attribute values, but as the change is already made in 2.7 and discussion pointed out a portion in 4.1 spec which allows unquoted attrs for ASCII, it seems fine. html/parse.py will be bit more permissive than what the spec says.

> It changes the strict regex to match the 2.7 one, and leave the tolerant one unchanged.

That is fine.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue7311>
_______________________________________


More information about the Python-bugs-list mailing list