[Python-Dev] PEP 393 review

Victor Stinner victor.stinner at haypocalc.com
Mon Aug 29 11:19:48 CEST 2011


Le 29/08/2011 11:03, Dirkjan Ochtman a écrit :
> On Sun, Aug 28, 2011 at 21:47, "Martin v. Löwis"<martin at v.loewis.de>  wrote:
>>   result strings. In PEP 393, a buffer must be scanned for the
>>   highest code point, which means that each byte must be inspected
>>   twice (a second time when the copying occurs).
>
> This may be a silly question: are there things in place to optimize
> this for the case where two strings are combined? E.g. highest
> character in combined string is max(highest character in either of the
> strings).

The "double-scan" issue is only for codec decoders.

If you combine two Unicode objects (a+b), you already know the highest 
code point and the kind of each string.

Victor


More information about the Python-Dev mailing list