[issue803422] sgmllib doesn't support hex or Unicode character references

Fred L. Drake, Jr. report at bugs.python.org
Fri Mar 14 17:30:03 CET 2008


Fred L. Drake, Jr. <fdrake at acm.org> added the comment:

SGML TC 2 can be found here:
http://www1.y12.doe.gov/capabilities/sgml/wg8/document/1955.htm

See the section K.4.1 for hexidecimal character references.

Since this is really an update to the SGML standard, and not part of the
original, any support for this should be an optional feature.  It's
really only interesting on the web, where standards compliance is... a
little on the lax side.  It would be reasonable to enable this by
default from htmllib (if not already supported in htmllib; I don't
remember).

I'm fairly sure hex character references are already supported in
HTMLParser.

----------
nosy: +fdrake

____________________________________
Tracker <report at bugs.python.org>
<http://bugs.python.org/issue803422>
____________________________________


More information about the Python-bugs-list mailing list