how to get 20000 html pages content quickly from one server?

Zachery Bir zbir at urbanape.com
Wed Mar 15 12:53:27 EST 2006


On Mar 15, 2006, at 11:58 AM, JuHui wrote:

> in fact, I want to do a script to get news on others site.
> I must use script get the content and analyze the html code, where is
> the title, where is the body....
> so, I can't ask permission, use wget  and "Physically remove the
> harddrive and reinstall it locally"

The only one it looks like you *can't* do is physically remove the  
hard drive and reinstall it locally. Seems more like you *won't* do  
the other two.

Zac




More information about the Python-list mailing list