Language design
Mark Janssen
dreamingforward at gmail.com
Wed Sep 11 20:25:57 EDT 2013
>> On Tue, 10 Sep 2013, Ben Finney wrote:
>> > The sooner we replace the erroneous
>> > “text is ASCII” in the common wisdom with “text is Unicode”, the
>> > better.
>>
>> I'd actually argue that it's better to replace the common wisdom with
>> "text is binary data, and we should normally look at that text through
>> Unicode eyes". A little less catchy, but more accurate ;)
>
> No, that's inaccurate. A sequence of bytes is binary data. Unicode is
> not binary data.
Well now, this is an area that is not actually well-defined. I would
say 16-bit Unicode is binary data if you're encoding in base 65,536,
just as 8-bit ascii is binary data if you're encoding in base-256.
Which is to say: there is no intervening data to suggest a TYPE.
--
MarkJ
Tacoma, Washington
More information about the Python-list
mailing list