[issue20559] urllib/http fail to sanitize a non-ascii url
Éric Araujo
report at bugs.python.org
Fri Feb 14 19:49:07 CET 2014
Éric Araujo added the comment:
Even if Python 3’s text model is based on Unicode, some data formats have their own rules. There’s a long debate about whether URIs should be bytes or text; it looks like unlike web browsers, urllib/httplib don’t try to be smart with the URIs they are given but just require them to be properly formatted, i.e. not containing any space or characters that are not %-encoded.
Is the documentation clear about this behaviour? If not, it would probably be simpler to improve the documentation rather than change the behaviour.
----------
nosy: +eric.araujo
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue20559>
_______________________________________
More information about the Python-bugs-list
mailing list