[issue803422] sgmllib doesn't support hex or Unicode character references
Fred L. Drake, Jr.
report at bugs.python.org
Fri Mar 14 17:30:03 CET 2008
Fred L. Drake, Jr. <fdrake at acm.org> added the comment:
SGML TC 2 can be found here:
http://www1.y12.doe.gov/capabilities/sgml/wg8/document/1955.htm
See the section K.4.1 for hexidecimal character references.
Since this is really an update to the SGML standard, and not part of the
original, any support for this should be an optional feature. It's
really only interesting on the web, where standards compliance is... a
little on the lax side. It would be reasonable to enable this by
default from htmllib (if not already supported in htmllib; I don't
remember).
I'm fairly sure hex character references are already supported in
HTMLParser.
----------
nosy: +fdrake
____________________________________
Tracker <report at bugs.python.org>
<http://bugs.python.org/issue803422>
____________________________________
More information about the Python-bugs-list
mailing list