[issue10733] plistlib rejects strings containing control characters

Behdad Esfahbod report at bugs.python.org
Thu Apr 23 20:33:14 CEST 2015


Behdad Esfahbod added the comment:

> Replacing all control characters by entities before trying to parse the Plist XML would likely be the best way forward. 

That wouldn't work.  Control characters are disallowed in XML's character set, so they are invalid even if input as entities.

Unfortunately this causes a lot of trouble for clients [0], because it means that XML cannot represent the full Unicode repertoire.  I'm curious about alternates.  Perhaps the expat module can be extended to allow recovering from this if the client chooses to...

[0] eg. https://github.com/behdad/fonttools/issues/249

----------
nosy: +Behdad.Esfahbod

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10733>
_______________________________________


More information about the Python-bugs-list mailing list