[issue7311] Bug on regexp of HTMLParser
Senthil Kumaran
report at bugs.python.org
Thu Apr 7 03:27:55 CEST 2011
Senthil Kumaran <senthil at uthcode.com> added the comment:
> So is the issue7311-3.diff patch fine?
Just that it allows unquoted attrs for unicode too.
My previous suggestion was not to allow unquoted attribute values, but as the change is already made in 2.7 and discussion pointed out a portion in 4.1 spec which allows unquoted attrs for ASCII, it seems fine. html/parse.py will be bit more permissive than what the spec says.
> It changes the strict regex to match the 2.7 one, and leave the tolerant one unchanged.
That is fine.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue7311>
_______________________________________
More information about the Python-bugs-list
mailing list