[Mailman-i18n] Subject lines in Archives

Ben Gertzfield che@debian.org
Tue, 02 Apr 2002 18:24:05 +0900


>>>>> "Martin" =3D=3D Martin v L=F6wis <loewis@informatik.hu-berlin.de> wri=
tes:

    Ben> This is again the fact that many charsets have different
    Ben> names as a Python Unicode codec.  It looks like all
    Ben> "windows-foo" charsets need to be mapped to "cpfoo" for the
    Ben> Python Unicode codec.

    Martin> In Python 2.3, this has happened (atleast for those known
    Martin> to IANA). For mailman, it may be desirable to provide some
    Martin> of those mappings even in earlier Python versions; see

    Martin> http://sourceforge.net/tracker/?func=3Ddetail&aid=3D538185&grou=
p_id=3D103&atid=3D300103

Thanks for the patch, Martin.  I think we will need something
similar to this for the Korean Windows charsets, as in all the
Korean spam I get:

Content-Type: text/html; charset=3D"ks_c_5601-1987"

We will probably need some general fallback to replace completely
unknown charsets with some safe US-ASCII text.  Do you think you could
add this?  Say, something like "(text with unknown encoding)".

Ben

--=20
Brought to you by the letters N and E and the number 16.
"Bill Gates is a talented evil man."
Debian GNU/Linux maintainer of Gimp and Nethack -- http://www.debian.org/