Unicode surrogate pairs (Python 3.4)

Jon Ribbens jon+usenet at unequivocal.co.uk
Sun May 3 10:40:05 EDT 2015


If I have a string containing surrogate pairs like this in Python 3.4:

  "\udb40\udd9d"

How do I convert it into the proper form:

  "\U000E019D"

? The answer appears not to be "unicodedata.normalize".



More information about the Python-list mailing list