[ mailman-Patches-866238 ] Scrubber for multi-charset language

SourceForge.net noreply at sourceforge.net
Sat Dec 27 02:59:32 EST 2003


Patches item #866238, was opened at 2003-12-27 07:59
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=300103&aid=866238&group_id=103

Category: internationalization
Group: Mailman 2.1
Status: Open
Resolution: None
Priority: 7
Submitted By: Tokio Kikuchi (tkikuchi)
Assigned to: Nobody/Anonymous (nobody)
Summary: Scrubber for multi-charset language

Initial Comment:
Mailman i18n allows multi-charset for one language eg.
for Japanese, 'iso-2022-jp' for mail message and
'euc-jp' for HTML and internal texts. Scrubber.py fails
to recognize the latter charset by using
get_content_charset() which is getting the mail message
charset.

Example of this bug is in 
http://snow.is.kochi-u.ac.jp/pipermail2/mailman2/2003-December/000013.html
while we want to get 
http://snow.is.kochi-u.ac.jp/pipermail2/mailman2/2003-December/000017.html
(Compare using their source because your browser may
not capable of displaying japanese fonts)

This patch tries to use get_charset() first, and if it
is empty, use get_content_charset(). Note that  the
former returns Charset instance not charset string.



----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=300103&aid=866238&group_id=103



More information about the Mailman-coders mailing list