[issue2066] Adding new CNS11643, a *huge* charset, support in cjkcodecs

Marc-Andre Lemburg report at bugs.python.org
Thu Feb 14 10:59:47 CET 2008


Marc-Andre Lemburg added the comment:

I think Martin was looking for other optimizations that still leave the
data in a static C const (in order to be shared between processes and
only loaded on demand), but do compress the data representation, e.g.
using some form of Huffman coding.

While I don't see adding a few 100kB of static C data to a DLL as a
major problem (even less so, if it's possible to disable support via a
configure switch, e.g. for embedded systems), it would be interesting to
check whether the lookups tables can be compressed by way of their
structure.

__________________________________
Tracker <report at bugs.python.org>
<http://bugs.python.org/issue2066>
__________________________________


More information about the Python-bugs-list mailing list