[issue20559] urllib/http fail to sanitize a non-ascii url

Éric Araujo report at bugs.python.org
Fri Feb 14 19:49:07 CET 2014


Éric Araujo added the comment:

Even if Python 3’s text model is based on Unicode, some data formats have their own rules.  There’s a long debate about whether URIs should be bytes or text; it looks like unlike web browsers, urllib/httplib don’t try to be smart with the URIs they are given but just require them to be properly formatted, i.e. not containing any space or characters that are not %-encoded.

Is the documentation clear about this behaviour?  If not, it would probably be simpler to improve the documentation rather than change the behaviour.

----------
nosy: +eric.araujo

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue20559>
_______________________________________


More information about the Python-bugs-list mailing list