Unicode and Python - how often do you index strings?

alister alister.nospam.ware at ntlworld.com
Thu Jun 5 11:39:30 EDT 2014


On Thu, 05 Jun 2014 00:06:54 -0700, wxjmfauth wrote:

> Le mercredi 4 juin 2014 16:50:59 UTC+2, Michael Torrie a écrit :
>> On 06/04/2014 12:50 AM, wxjmfauth at gmail.com wrote:
>> 
>> > Like many, you are not understanding unicode because
>> 
>> > you do not understand the coding of characters.
>> 
>> 
>> 
>> If that is true, then I'm sure a well-written paragraph or two can set
>> 
>> him straight.  You continually berate people for not understanding
>> 
>> unicode, but you've posted nothing to explain anything, nor demonstrate
>> 
>> your own understanding.  That's one reason your posts are so
>> frustrating
>> 
>> and considered trolling.  You never ever explain yourself, instead just
>> 
>> flailing around and muttering about folks not understanding unicode,
>> 
>> just as you've done here, true to form.
>> 
>> 
>> 
>> 
>> > 
>> > You do not understand the coding of the characters
>> 
>> > because you do not understand the mathematics behind it.
>> 
>> 
>> 
>> flamebaiting here... FSR *is* UTF-32 internally, compresses off leading
>> 
>> zero bits during string creation.
>> 
>> 
>> 
>> > You focussed on the wrong problem.
>> 
>> 
>> 
>> Frankly it is you who is focused on the wrong problem, at least with
>> 
>> this particular thread.  I think you got distracted by the subject
>> line.
>> 
>>  Chris's original post really has nothing to do with unicode at all.
>> 
>> He's simply asking for use cases for string indexing where O(1) is
>> 
>> desired or necessary.  Could be old Python 2 byte strings, or Python 3
>> 
>> unicode strings.  It does not matter.  Unicode is orthogonal to his
>> 
>> question.
>> 
>> 
>> 
>> Maybe his purpose in asking the question is to justify a fixed-length
>> 
>> encoding scheme (which is what FSR actually is), or maybe it is to
>> 
>> explore the costs of using a much slower, but more compact,
>> 
>> variable-length encoding scheme like UTF-8.  Particularly in the
>> context
>> 
>> of low-memory applications where unicode support would be nice, but
>> 
>> memory is at a premium.  But either way, you got hung up on the wrong
>> thing.
>> 
>> 
>> 
>> 
>> > 
>> > (All this stuff has been discussed, tested and worked on
>> 
>> > 20 (twenty) years ago.)
>> 
>> 
>> > 
>> > Sorry.
>> 
>> 
>> 
>> As am I.
> 
> =========
> 
> Unicode ?
> I have the feeling is similar as explaining,
> i (the imaginary number) is not equal to sqrt(-1).
> 
> jmf
> 
> PS Once I gave you a link pointing to unicode.org doc, you obviously did
> not read it.



And you have may time been given a link explaining the problems with 
posting g=from google groups but deliberately choose to not make your 
replys readable.

-- 
If you're not part of the solution, you're part of the precipitate.



More information about the Python-list mailing list