[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

Marc-Andre Lemburg report at bugs.python.org
Fri Mar 28 11:09:56 CET 2014


Marc-Andre Lemburg added the comment:

Some comments:

* Please provide some background information how widely the encoding is used. I get less than 1000 hits in Google when looking for "TCVN 5712:1993". Now, the encoding was a standard in Vietnam, but it has been updated in 1999 to TCVN 5712:1999. There's also an encoding called VSCII.

* In the file you write "kind of TCVN 5712:1993 VN3 with CP1252 additions". This won't work, since we can only accept codecs which are based on set standards. It would be better to provide a link to an official Unicode character set mapping table and then use the gencodec.py script on this table.

* For Vietnamese, Python already provides cp1258 - how much is this encoding used in comparison to e.g. TCVN 5712:1993 ?

Resources:

 * Vietnamese encodings: http://www.panl10n.net/english/outputs/Survey/Vietnamese.pdf

 * East Asian encodings: http://www.unicode.org/iuc/iuc15/tb1/slides.pdf

----------
nosy: +lemburg

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue21081>
_______________________________________


More information about the Python-bugs-list mailing list