[ python-Bugs-1058351 ] Blatantly false statement in the Unicode section

SourceForge.net noreply at sourceforge.net
Mon Nov 1 21:22:44 CET 2004


Bugs item #1058351, was opened at 2004-11-02 04:47
Message generated for change (Comment added) made by perky
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=1058351&group_id=5470

Category: Documentation
Group: None
>Status: Closed
>Resolution: Invalid
Priority: 5
Submitted By: Olivier Verdier (chtito)
Assigned to: Nobody/Anonymous (nobody)
Summary: Blatantly false statement in the Unicode section

Initial Comment:
The statement is "you will find it convenient that the lower 256 
characters of Unicode are the same as the 256 characters of Latin
-1". 

That is obviously false. Only the first 128 characters (those 
encoded on the first 7 bits) are the same in usual one byte 
encodings (latin-x, for instance) and unicode. The eighth bit is used 
by unicode to trigger the multi-byte mechanism, whereas it is used 
to encode further 128 characters in one byte encodings.

----------------------------------------------------------------------

>Comment By: Hye-Shik Chang (perky)
Date: 2004-11-02 05:22

Message:
Logged In: YES 
user_id=55188

No. Latin-1 exactly overlaps over Unicode for 0~255.
You seem to confuse Unicode encodings and Unicode itself. 
In the context, we're saying about not encodings such as
UTF-8 but Python's "unicode" built-in type which is based on
Unicode character set.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=1058351&group_id=5470


More information about the Python-bugs-list mailing list