[New-bugs-announce] [issue10686] email.Generator should use unknown-8bit encoded words for headers with 8 bit data

R. David Murray report at bugs.python.org
Sun Dec 12 19:01:49 CET 2010


New submission from R. David Murray <rdmurray at bitdance.com>:

This is a followon to Issue 4661.  The fix for that issue introduced a way to parse messages containing 8bit bytes.  When Generator is called on a model containing 8 bit bytes, it converts it to 7bit clean.  There is, however, a bug in this conversion process: currently when encountering 8bit bytes in headers, it simply replaces then with ?.  According to the RFCs[*], what it should do instead is to replace them with encoded words using the 'charset' "unknown-8bit".

[*] I'm specifically referring to RFC 1428...email is effectively acting as a translating gateway when requested to do the 8bit to 7bit conversion.  Although that RFC does not explicitly say that the unknown-8bit charset should be used in encoded words, it does imply it strongly in its section 3 prescription.

----------
assignee: r.david.murray
messages: 123842
nosy: r.david.murray
priority: high
severity: normal
stage: needs patch
status: open
title: email.Generator should use unknown-8bit encoded words for headers with 8 bit data
type: behavior
versions: Python 3.2

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10686>
_______________________________________


More information about the New-bugs-announce mailing list