unicode mail list archeology

Luuk luuk at invalid.lan
Sat Apr 20 06:50:46 EDT 2019


On 20-4-2019 12:47, Luuk wrote:
> On 20-4-2019 11:26, wxjmfauth at gmail.com wrote:
>> http://unicode.org/mail-arch/unicode-ml/Archives-Old/UML018/0594.html
>>
> 
> [quoot]
>  > It is simple to make a compacter version of UTF-8 using the base
>  > 256 character codes were possible (comacter for many languages).
> 
> No. If you think otherwise, you have completely misunderstood what UTF-8
> is all about. Please read the section "What is UTF-8?" in
>    http://www.cl.cam.ac.uk/~mgk25/unicode.html
> carefully then you will see, why a base256 transfer encoding lacks
> essential properties that make UTF-8 so damn useful.
> [/quoot]
> 
> I must be one of the persons who do not understand what base256 transfer 
> encoding means.
> 
> UTF-8 is, in bytes, just a sequence of 8 bit things, why can it not be 
> transferred using a bas256 transfer  encoding?
> 
> $ echo "just my € 0.02 cents" | hexdump -C
> 6a 75 73 74 20 6d 79 20  e2 82 ac 20 30 2e 30 32 20 63 65 6e 74 73 0a
> 

This is about python...

luuk at computer:$ python
Python 2.7.15rc1 (default, Nov 12 2018, 14:31:15)
[GCC 7.3.0] on linux2
Type "help", "copyright", "credits" or "license" for more information.
 >>> a="just my € 0.02 cents"
 >>> a
'just my \xe2\x82\xac 0.02 cents'
 >>>

-- 
Luuk



More information about the Python-list mailing list