Is Python good for web crawlers?
Diez B. Roggisch
deets at nospam.web.de
Tue Feb 7 12:16:51 EST 2006
Tempo wrote:
> Why do you say that the bottleneck of the crawler will always be
> downloading the page? Is it becasue there isn't already a modual to do
> this and I will have to start from scratch? Or a bandwidth issue?
Because of bandwidth - not necessarily yours directly, but the maximum flow
between your uplink and the site in question. It will always take at least
a fractioin of a second up to several seconds until the data is there - in
that time, lots of python code can run.
Diez
More information about the Python-list
mailing list