[Bug 1415406] [NEW] Message excerpt corruption on admindb Web UI

Yasuhito FUTATSUKI at POEM futatuki at poem.co.jp
Wed Jan 28 11:44:13 CET 2015


Public bug reported:

Some messages held on admindb cannot display correctly becase of partial Unicode conversion error or
incomplete multi-byte character on mm_cfg.ADMINDB_PAGE_TEXT_LIMIT boundary.

Message character corruption has been occured in conditions below.

(1) Message charset/encoding is charset/encoding of multi-byte charaters.
(2) Message charset/encoding differs from web display charset/encoding.
(3) Message contains character that cannot convert to Unicode by using Python codec.
or
(3') Message body size exceeds mm_cfg.ADMINDB_PAGE_TEXT_LIMIT in bytes after decoding mime
     and cut down along multi-byte charater's byte sequence.

Under these conditions, Unicode error occur in converting message charset/encoding and message
has remained not to convert charset/encoding.

A patch attached below solves (3) by using decode/encode with 'replace' error handling scheme,
and (3') by rounding on character boundary not to exceeds the limit in bytes after 
character/encoding converted.

Note: Even If Message charset/encoding is same as web display charset/encoding, condition (3')
may produce invalid html, but patch below don't fix it.

** Affects: mailman
     Importance: Undecided
         Status: New

** Attachment added: "patch-admindb-py.txt"
   https://bugs.launchpad.net/bugs/1415406/+attachment/4307114/+files/patch-admindb-py.txt

-- 
You received this bug notification because you are a member of Mailman
Coders, which is subscribed to GNU Mailman.
https://bugs.launchpad.net/bugs/1415406

Title:
  Message excerpt corruption on admindb Web UI

To manage notifications about this bug go to:
https://bugs.launchpad.net/mailman/+bug/1415406/+subscriptions


More information about the Mailman-coders mailing list