[issue3300] urllib.quote and unquote - Unicode issues
Bill Janssen
report at bugs.python.org
Tue Aug 12 05:43:38 CEST 2008
Bill Janssen <bill.janssen at gmail.com> added the comment:
Some interesting notes here (from Erik van der Poel at Google; Guido,
you might want to stroll over to his location and talk with him):
http://lists.w3.org/Archives/Public/www-international/2007JanMar/0004.html
and more particularly
http://lists.w3.org/Archives/Public/www-international/2008AprJun/0092.html,
which says, in part,
``Within the context of HTML and HTTP, queries
[that is, the query part of a URL] don't have to say which
charset they are using, because there is already an agreement in
place: the major browsers and servers use the charset of the HTML.''
So, there's still a sizable number of Latin-1 pages out there, and
queries against these pages will use that encoding in the URL's they send.
And then there's this:
http://lists.w3.org/Archives/Public/www-international/2008AprJun/0014.html
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue3300>
_______________________________________
More information about the Python-bugs-list
mailing list