[Mailman-Users] UTF-8 and digests...

Mark Sapiro mark at msapiro.net
Fri Nov 30 12:33:21 EST 2018


On 11/30/18 9:18 AM, Grant Taylor via Mailman-Users wrote:
> Is it expected that Mailman will preserve UTF-8 (punctuation symbols) in
> non-MIME digests?


It depends.


> Is this a known limitation of non-MIME digests?  Or is it possibly a
> symptom of a problem?


Plain text digests are encoded in Mailman's character set for the list's
preferred_language. For English, this is us-ascii unless you've changed
it. Thus, non-ascii unicodes will be rendered as '?' in the plain digest.

You can change Mailman's character set for English to UTF-8 by putting

add_language('en', 'English (USA)', 'utf-8', 'ltr')

in mm_cfg.py but this has other side effects. Most importantly, the
Python email library encodes utf-8 message bodies as base64. this makes
it difficult to find messages in mailboxes with tools like grep.

-- 
Mark Sapiro <mark at msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan


More information about the Mailman-Users mailing list