[Python-bugs-list] [ python-Bugs-210633 ] urlparse (PR#286)

noreply@sourceforge.net noreply@sourceforge.net
Thu, 14 Feb 2002 10:23:19 -0800


Bugs item #210633, was opened at 2000-07-31 14:09
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=105470&aid=210633&group_id=5470

Category: Python Library
Group: Not a Bug
Status: Closed
Resolution: Invalid
Priority: 5
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Fred L. Drake, Jr. (fdrake)
Summary: urlparse (PR#286)

Initial Comment:
Jitterbug-Id: 286
Submitted-By: alex@shop.com
Date: Mon, 10 Apr 2000 16:40:57 -0400 (EDT)
Version: >=1.5
OS: win32 linux


urlparse requires that the url contain a "/"
so that
urlparse("http://foo.com?q=a#blah")
results in
("http","foo.com?q=a#blah",....)

urlparse should not require slashes in urls that have fragments or query
strings.



====================================================================
Audit trail:
Tue Jul 11 08:29:15 2000	guido	moved from incoming to open

----------------------------------------------------------------------

Comment By: Walter Dörwald (doerwalter)
Date: 2002-02-14 10:23

Message:
Logged In: YES 
user_id=89016

RFC2396 Section 3.2 states that:
"""The authority component is preceded by a double 
slash "//" and is terminated by the next slash "/", 
question-mark "?", or by the end of the URI."""
So IMHO this would mean that "http://foo.com?q=a#blah"
should be parsed by urlsplit as 
('http', 'foo.com', '', 'q=a', 'blah')
(or maybe ('http', 'foo.com', '/', 'q=a', 'blah'))

----------------------------------------------------------------------

Comment By: Aaron Swartz (aaronsw)
Date: 2001-11-26 16:44

Message:
Logged In: YES 
user_id=122141

RFC2396, not RFC1738 is the latest RFC for URI/URL 
defintions.

----------------------------------------------------------------------

Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2000-08-24 08:07

Message:
RFC 1738, section 3.3, discusses the syntax for HTTP URLs.  It implies that the "/" between the is required if either the path of searchpart of the URL is provided, but is not completely clear.

I don't see anything relevant in RFC 1945 (HTTP 1.0), but RFC 2616 (HTTP 1.1), section 3.2.2 clearly indicates that the search part should only exist as a part of the path component, which is required to include the leading "/".

There is some confusion as to how this should relate to parsing of relative URLs (RFC 1808).  This bug can be re-opened if there's evidence urlparse is actually wrong or inconsistent with other URL parsers.

----------------------------------------------------------------------

Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2000-08-16 18:54

Message:
Assigned to me so I can deal with urlparse all at once.

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=105470&aid=210633&group_id=5470