[issue14654] More fast utf-8 decoding

Martin v. Löwis report at bugs.python.org
Tue Apr 24 13:22:06 CEST 2012


Martin v. Löwis <martin at v.loewis.de> added the comment:

I'm -1 on using signed char in the implementation. If this gives any advantage, it's because the compiler is not able to generate as efficient code for unsigned char as it does for signed char. So the performance results may again change if you switch compilers, or use the next compiler version.

The code should do what is *logically* correct; IMO, UTF-8 is really a sequence of unsigned bytes, conceptually.

So if you want to demonstrate any performance improvements, you need to do so with unsigned chars.

----------
nosy: +loewis

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue14654>
_______________________________________


More information about the Python-bugs-list mailing list