[Web-SIG] Request for Comments on upcoming WSGI Changes
Mark Nottingham
mnot at mnot.net
Tue Sep 22 03:21:03 CEST 2009
HTTP headers *are* ASCII; RFC2616 defined them to be ISO-8859-1, but
HTTPbis currently takes the stance that they're ASCII, as in practice
Latin-1 isn't used and may introduce interop problems.
<http://tools.ietf.org/html/draft-ietf-httpbis-p1-messaging-07#section-4.2
>
> Historically, HTTP has allowed field-content with text in the ISO-
> 8859-1 [ISO-8859-1] character encoding (allowing other character
> sets
> through use of [RFC2047] encoding). In practice, most HTTP header
> field-values use only a subset of the US-ASCII charset [USASCII].
> Newly defined header fields SHOULD constrain their field-values to
> US-ASCII characters. Recipients SHOULD treat other (obs-text)
> octets
> in field-content as opaque data.
What does it mean to "support non-ASCII headers"? As per above, the
only sane thing to do is treat them as opaque data, because you can't
be certain of their encoding unless you have knowledge of the header.
On 21/09/2009, at 12:50 AM, Armin Ronacher wrote:
> Also (something I haven't yet filed as a bug because I guess there
> will
> be more changes involved) the HTTP server in Python 3.1 does not
> support
> non-ASCII headers.
--
Mark Nottingham http://www.mnot.net/
More information about the Web-SIG
mailing list