trouble getting google through urllib

BJörn Lindqvist bjourne at gmail.com
Wed Dec 20 08:49:43 EST 2006


> > > Google doesnt like Python scripts. You will need to pretend to be a
> > > browser by setting the user-agent string in the HTTP header.
> > >
> > and possibly also run the risk of having your system blocked by Google if
> > they figure out you are lying to them?
>
> It is possible. I wrote a 'googlewhack' (remember them?) script a while
> ago, which pretty much downloaded as many google pages as my adsl could
> handle. And they didn't punish me for it. Although apparently they do
> issue short term bans on IP's that abuse their service.

For Google, that load must be piss in the ocean. I bet for Google to
even notice the abuse, it must be something really, really severe.

-- 
mvh Björn



More information about the Python-list mailing list