[Python-Dev] Python-3.0, unicode, and os.environ
Adam Olsen
rhamph at gmail.com
Fri Dec 12 10:19:14 CET 2008
On Fri, Dec 12, 2008 at 2:11 AM, André Malo <nd at perlig.de> wrote:
> * Adam Olsen wrote:
>
>> UTF-8 in percent encodings is becoming a defacto standard. Otherwise
>> the browser has to display the percent escapes in the address bar,
>> rather than the intended text.
>
> Duh! The address bar should contain the URL, which *is* the intended text.
> The escapes are there for a reason. If I pass some octets using percent
> escapes via the query string or request body, it's not text, not even
> intended. It's still a collection of octets. Translating them back (and
> forth when I press enter in the address bar) is a pretty ambigious
> operation and therefore pretty wrong.
>
> The defacto standard does not exist. There's a real one instead: RFC 2396.
All the heaps of people using non-english wikipedia sites might
disagree with you. There's only, what, a few *million* pages that
would be affected?
It'd be very interesting if someone at Google could provide some
statistics on URL encodings.
--
Adam Olsen, aka Rhamphoryncus
More information about the Python-Dev
mailing list