trouble getting google through urllib

Duncan Booth duncan.booth at invalid.invalid
Tue Dec 19 03:54:38 EST 2006


Will McGugan <will at willNOmcguganSPAM.com> wrote:

> Dr. Locke Z2A wrote:
> 
>> Does anyone know how I would get the bot to have permission to get the
>> url? When I put the url in on firefox it works fine. I noticed that in
>> the output html that google gave me it replaced some of the characters
>> in the url with different stuff like the "&amp" and "%7C", so I'm
>> thinking thats the problem, does anyone know how I would make it keep
>> the url as I intended it to be?
>> 
> 
> Google doesnt like Python scripts. You will need to pretend to be a 
> browser by setting the user-agent string in the HTTP header.
> 
and possibly also run the risk of having your system blocked by Google if 
they figure out you are lying to them?



More information about the Python-list mailing list