[Python-checkins] r57022 - peps/trunk/pep-3131.txt

martin.v.loewis python-checkins at python.org
Tue Aug 14 18:24:05 CEST 2007


Author: martin.v.loewis
Date: Tue Aug 14 18:24:05 2007
New Revision: 57022

Modified:
   peps/trunk/pep-3131.txt
Log:
Specify NFKC instead of NFC.


Modified: peps/trunk/pep-3131.txt
==============================================================================
--- peps/trunk/pep-3131.txt	(original)
+++ peps/trunk/pep-3131.txt	Tue Aug 14 18:24:05 2007
@@ -82,8 +82,8 @@
 (Nd), connector punctuations (Pc), and characters carryig the
 Other_ID_Continue property.
 
-All identifiers are converted into the normal form NFC while parsing;
-comparison of identifiers is based on NFC.
+All identifiers are converted into the normal form NFKC while parsing;
+comparison of identifiers is based on NFKC.
 
 A non-normative HTML file listing all valid identifier characters for
 Unicode 4.1 can be found at
@@ -117,7 +117,7 @@
    non-identifier character (e.g. a space or punctuation character)
 
 2. The entire UTF-8 string is passed to a function to normalize the
-   string to NFC, and then verify that it follows the identifier
+   string to NFKC, and then verify that it follows the identifier
    syntax. No such callout is made for pure-ASCII identifiers, which
    continue to be parsed the way they are today. The Unicode database
    must start including the Other_ID_{Start|Continue} property.
@@ -154,10 +154,6 @@
 for RTL languages); if there is a need, these can be added
 later.
 
-Another open issue is the choice of normalization form: some
-people suggest to use NFKC instead of NFC, others suggest to
-ban compatibility characters.
-
 Some people would like to see an option on selecting support
 for this PEP at run-time; opinions vary on what precisely
 that option should be, and what precisely its default value


More information about the Python-checkins mailing list