[Python-Dev] Bytes path support

Greg Ewing greg.ewing at canterbury.ac.nz
Sun Aug 24 03:11:10 CEST 2014


Isaac Morland wrote:
> In HTML 5 it allows non-ASCII-compatible encodings as long as U+FEFF 
> (byte order mark) is used:
> 
> http://www.w3.org/TR/html-markup/syntax.html#encoding-declaration
> 
> Not sure about XML.

According to Appendix F here:

http://www.w3.org/TR/xml/#sec-guessing

an XML parser needs to be prepared to try all the encodings it
supports until it finds one that works well enough to decode
the XML declaration, then it can find out the exact encoding
used.

-- 
Greg


More information about the Python-Dev mailing list