Grapheme clusters, a.k.a.real characters

Marko Rauhamaa marko at pacujo.net
Fri Jul 14 08:05:48 EDT 2017


Marko Rauhamaa <marko at pacujo.net>:

> Chris Angelico <rosuav at gmail.com>:
>> If you're trying to use strings as identifiers in any way (say, file
>> names, or document lookup references), using the NFC/NFD normalized
>> form of the string should be sufficient.
>
> Show me ten Python3 database applications, and I'll show you ten Python3
> database applications that don't normalize their primary keys.

Besides the normal forms don't help you do text processing (no regular
expression matching, no simple way to get a real character).


Marko



More information about the Python-list mailing list