python 2.7.12 on Linux behaving differently than on Windows

Marko Rauhamaa marko at pacujo.net
Fri Dec 9 07:34:16 EST 2016


eryk sun <eryksun at gmail.com>:

> Windows NTFS doesn't normalize names to a canonical form. It also
> allows lone surrogate codes, which is invalid UTF-16.

Somewhat related, surrogate codes are invalid Unicode and shouldn't be
allowed in Unicode strings. However, Python does allow them.


Marko



More information about the Python-list mailing list