[Tutor] urlparse question

Mon, 26 Aug 2002 11:04:24 -0700 (PDT)

On Mon, 26 Aug 2002 Jaco.Smuts@za.didata.com wrote:

> Hello there
>
> I am busy writing a small app to extract keywords (amongst other things)
> from search engine referer urls stored in a mysql database (using apache
> mod_log_sql).
>
> 1 - Please tell me if I'm re-inventing the wheel.

You're reinventing the wheel.  *grin*

> 2 - If not , what I'm struggling with is to get the differnet url
> querystring parameters, what am I missing ?
>
> example:
> the url is
> http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=utf-8&q=south+africa+trav
> el+cape+town

The function cgi.parse_qs() will parse out a query string for you:

###
>>> query_string =
"hl=en&lr=&ie=UTF-8&oe=utf-8&q=south+africa+travel+cape+town">>> import
cgi
>>> dict = cgi.parse_qs(query_string)
>>> dict
{'hl': ['en'], 'ie': ['UTF-8'], 'oe': ['utf-8'],
 'q': ['south africa travel cape town']}
###

See:

    http://www.python.org/doc/lib/node298.html

for more information on cgi.parse_qs().

I am a bit concerned, though, that this function wasn't easy to find; the
url parsing stuff seems distributed among the urlparse and cgi modules ---
I think it would be better if they were consolidated into the urlparse
module!  I'll add a feature request on Sourceforge to see if it's possible
to move that function over to urlparse.

Good luck!