[Python-Dev] Unicode charmap decoders slow

"Martin v. Löwis" martin at v.loewis.de
Wed Oct 5 20:40:04 CEST 2005


Walter Dörwald wrote:
> OK, here's a patch that implements this enhancement to 
> PyUnicode_DecodeCharmap(): http://www.python.org/sf/1313939

Looks nice!

> Creating the decoding_map as a string should probably be done by 
> gencodec.py directly. This way the first import of the codec would be 
> faster too.

Hmm. How would you represent the string in source code? As a Unicode
literal? With \u escapes, or in a UTF-8 source file? Or as a UTF-8
string, with an explicit decode call?

I like the current dictionary style for being readable, as it also
adds the Unicode character names into comments.

Regards,
Martin


More information about the Python-Dev mailing list