[issue18059] Add multibyte encoding support to pyexpat

Serhiy Storchaka report at bugs.python.org
Sat Sep 14 20:53:54 CEST 2013


Serhiy Storchaka added the comment:

Here is a totally rewritten patch, which builds decoding table at first request for encoding and save it in the cache. Decoding should be very fast.

Do you have large testing XML files with multibyte encodings? Could you please measure the time of parsing this files and for comparision the time of parsing this files encoded with utf-8 and utf-16?

----------
Added file: http://bugs.python.org/file31758/pyexpat_multibyte_encodings.patch

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue18059>
_______________________________________


More information about the Python-bugs-list mailing list