[Mailman-Users] A scrubber issue

Todd Zullinger tmz at pobox.com
Sat Dec 9 18:38:43 CET 2006


Related to the second part of Werner's message being scrubbed with the
message:

    An embedded and charset-unspecified text was scrubbed...

Poking in the email package (on python 2.4.4) shows:

    def get_content_charset(self, failobj=None):
        """Return the charset parameter of the Content-Type header.

        The returned string is always coerced to lower case.  If there is no
        Content-Type header, or if that header has no charset parameter,
        failobj is returned.
        """

This seems to violate section 5.2 of RFC 2045 which says parts lacking
a Content-type header should be assumed to be text/plain with a
charset of us-ascii.  The get_content_type method in email.Message
does mention RFC 2045 and uses text/plain if the content-type is
invalid.  Would it be appropriate to set failobj="us-ascii" when
calling this method in Scrubber.py?

BTW, if it's easier for you guys to track this in bugzilla, let me
know and I'll file on one SF.

-- 
Todd        OpenPGP -> KeyID: 0xBEAF0CE3 | URL: www.pobox.com/~tmz/pgp
======================================================================
The marvel of all history is the patience with which men and women
submit to burdens unnecessarily laid upon them by their governments.
    -- William H. Borah

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 542 bytes
Desc: not available
Url : http://mail.python.org/pipermail/mailman-users/attachments/20061209/f4624887/attachment.pgp 


More information about the Mailman-Users mailing list