[Python-Dev] Python-3.0, unicode, and os.environ

Adam Olsen rhamph at gmail.com
Fri Dec 12 10:19:14 CET 2008


On Fri, Dec 12, 2008 at 2:11 AM, André Malo <nd at perlig.de> wrote:
> * Adam Olsen wrote:
>
>> UTF-8 in percent encodings is becoming a defacto standard.  Otherwise
>> the browser has to display the percent escapes in the address bar,
>> rather than the intended text.
>
> Duh! The address bar should contain the URL, which *is* the intended text.
> The escapes are there for a reason. If I pass some octets using percent
> escapes via the query string or request body, it's not text, not even
> intended. It's still a collection of octets. Translating them back (and
> forth when I press enter in the address bar) is a pretty ambigious
> operation and therefore pretty wrong.
>
> The defacto standard does not exist. There's a real one instead: RFC 2396.

All the heaps of people using non-english wikipedia sites might
disagree with you.  There's only, what, a few *million* pages that
would be affected?

It'd be very interesting if someone at Google could provide some
statistics on URL encodings.


-- 
Adam Olsen, aka Rhamphoryncus


More information about the Python-Dev mailing list