[Mailman-i18n] Subject lines in Archives
Ben Gertzfield
che@debian.org
Tue, 02 Apr 2002 18:24:05 +0900
>>>>> "Martin" =3D=3D Martin v L=F6wis <loewis@informatik.hu-berlin.de> wri=
tes:
Ben> This is again the fact that many charsets have different
Ben> names as a Python Unicode codec. It looks like all
Ben> "windows-foo" charsets need to be mapped to "cpfoo" for the
Ben> Python Unicode codec.
Martin> In Python 2.3, this has happened (atleast for those known
Martin> to IANA). For mailman, it may be desirable to provide some
Martin> of those mappings even in earlier Python versions; see
Martin> http://sourceforge.net/tracker/?func=3Ddetail&aid=3D538185&grou=
p_id=3D103&atid=3D300103
Thanks for the patch, Martin. I think we will need something
similar to this for the Korean Windows charsets, as in all the
Korean spam I get:
Content-Type: text/html; charset=3D"ks_c_5601-1987"
We will probably need some general fallback to replace completely
unknown charsets with some safe US-ASCII text. Do you think you could
add this? Say, something like "(text with unknown encoding)".
Ben
--=20
Brought to you by the letters N and E and the number 16.
"Bill Gates is a talented evil man."
Debian GNU/Linux maintainer of Gimp and Nethack -- http://www.debian.org/