Urllib(1/2) how to open multiple client sockets?
Gerhard Häring
gh at ghaering.de
Thu Jun 26 07:03:27 EDT 2008
ShashiGowda wrote:
> Hey there i made a script to download all images from a web site but
> it runs damn slow though I have a lot of bandwidth waiting to be used
> please tell me a way to use urllib to open many connections to the
> server to download many pics simultaneously.... Any off question
> suggestions are also ok...
"Simultaneously" means using multiple threads (or processes).
You could create worker threads that do the downloading (use the
threading module for this) and communicate with them via a queue (use
the Queue module for this).
For example the main thread would then do the HTML parsing and push the
image URLs to the worker threads via the queue.
Each worker thread then polls the queue and downloads the images it gets.
One issue is how to terminate the multithreaded application. A simple
approach is to use a sentinel value to push to the queue to signal the
worker threads to quit. Something like "QUIT" will do. Be sure to push
at least as many sentinel values as you have worker threads, then.
-- Gerhard
More information about the Python-list
mailing list