[Python-checkins] r86678 - in python/branches/release27-maint: Lib/test/test_urllib2.py Lib/urllib.py Misc/NEWS

senthil.kumaran python-checkins at python.org
Mon Nov 22 06:04:33 CET 2010


Author: senthil.kumaran
Date: Mon Nov 22 06:04:33 2010
New Revision: 86678

Log:
Merged revisions 86676 via svnmerge from 
svn+ssh://pythondev@svn.python.org/python/branches/py3k

........
  r86676 | senthil.kumaran | 2010-11-22 12:48:26 +0800 (Mon, 22 Nov 2010) | 4 lines
  
  Fix Issue4493 - urllib2 adds '/' to the path component of url, when it does not
  starts with one. This behavior is exhibited by browser and other clients.
........


Modified:
   python/branches/release27-maint/   (props changed)
   python/branches/release27-maint/Lib/test/test_urllib2.py
   python/branches/release27-maint/Lib/urllib.py
   python/branches/release27-maint/Misc/NEWS

Modified: python/branches/release27-maint/Lib/test/test_urllib2.py
==============================================================================
--- python/branches/release27-maint/Lib/test/test_urllib2.py	(original)
+++ python/branches/release27-maint/Lib/test/test_urllib2.py	Mon Nov 22 06:04:33 2010
@@ -838,6 +838,25 @@
             p_ds_req = h.do_request_(ds_req)
             self.assertEqual(p_ds_req.unredirected_hdrs["Host"],"example.com")
 
+    def test_fixpath_in_weirdurls(self):
+        # Issue4493: urllib2 to supply '/' when to urls where path does not
+        # start with'/'
+
+        h = urllib2.AbstractHTTPHandler()
+        o = h.parent = MockOpener()
+
+        weird_url = 'http://www.python.org?getspam'
+        req = Request(weird_url)
+        newreq = h.do_request_(req)
+        self.assertEqual(newreq.get_host(),'www.python.org')
+        self.assertEqual(newreq.get_selector(),'/?getspam')
+
+        url_without_path = 'http://www.python.org'
+        req = Request(url_without_path)
+        newreq = h.do_request_(req)
+        self.assertEqual(newreq.get_host(),'www.python.org')
+        self.assertEqual(newreq.get_selector(),'')
+
     def test_errors(self):
         h = urllib2.HTTPErrorProcessor()
         o = h.parent = MockOpener()

Modified: python/branches/release27-maint/Lib/urllib.py
==============================================================================
--- python/branches/release27-maint/Lib/urllib.py	(original)
+++ python/branches/release27-maint/Lib/urllib.py	Mon Nov 22 06:04:33 2010
@@ -1052,7 +1052,12 @@
         _hostprog = re.compile('^//([^/?]*)(.*)$')
 
     match = _hostprog.match(url)
-    if match: return match.group(1, 2)
+    if match:
+        host_port = match.group(1)
+        path = match.group(2)
+        if path and not path.startswith('/'):
+            path = '/' + path
+        return host_port, path
     return None, url
 
 _userprog = None

Modified: python/branches/release27-maint/Misc/NEWS
==============================================================================
--- python/branches/release27-maint/Misc/NEWS	(original)
+++ python/branches/release27-maint/Misc/NEWS	Mon Nov 22 06:04:33 2010
@@ -13,6 +13,9 @@
 Library
 -------
 
+- Issue #4493: urllib2 adds '/' in front of path components which does not
+  start with '/. Common behavior exhibited by browsers and other clients.
+
 - Issue #6378: idle.bat now runs with the appropriate Python version rather than
   the system default. Patch by Sridhar Ratnakumar.
 


More information about the Python-checkins mailing list