Why asci-only symbols?
Bengt Richter
bokr at oz.net
Sat Oct 15 23:59:28 EDT 2005
On Wed, 12 Oct 2005 10:56:44 +0200, =?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= <martin at v.loewis.de> wrote:
>Mike Meyer wrote:
>> Out of random curiosity, is there a PEP/thread/? that explains why
>> Python symbols are restricted to 7-bit ascii?
>
>No PEP yet; I meant to write one for several years now.
>
>The principles would be
>- sources must use encoding declarations
>- valid identifiers would follow the Unicode consortium guidelines,
> in particular: identifiers would be normalized in NFKC (I think),
> adjusted in the ASCII range for backward compatibility (i.e.
> not introducing any additional ASCII characters as legal identifier
> characters)
>- __dict__ will contain Unicode keys
>- all objects should support Unicode getattr/setattr (potentially
> raising AttributeError, of course)
>- open issue: what to do on the C API (perhaps nothing, perhaps
> allowing UTF-8)
Perhaps string equivalence in keys will be treated like numeric equivalence?
I.e., a key/name representation is established by the initial key/name binding, but
values can be retrieved by "equivalent" key/names with different representations
like unicode vs ascii or latin-1 etc.?
Regards,
Bengt Richter
More information about the Python-list
mailing list