[Web-SIG] parsing of urlencoded data and Unicode

Bill Janssen janssen at parc.com
Tue Jul 29 22:20:17 CEST 2008


> Also I'd say that if you're dealing with text (text/*) and no
> charset is provided (or the caller hasn't given an override
> default charset); then you must assume US-ASCII.  And
> you should allow any UnicodeDecodeErrors to bubble
> up to the caller.  In other words if a user agent sent text
> in ISO-8859-x and didn't say it was doing so, then an
> error should be raised when non-ASCII data is seen.

Yep.

Bill


More information about the Web-SIG mailing list