Library for crawling forums

Thomas Wittek mail at gedankenkonstrukt.de
Thu Oct 11 17:22:37 EDT 2007


BlueCrux:
> I'm trying to write a utility to crawl forums and strip posts to be
> gone through offline. Just the content, I don't need to get who posted
> or sigs or any identifying info.
> 
> Can anyone suggest a library that is already geared toward this?

Maybe a combination of mechanize [1] and BeautifulSoup [2]?

[1] http://wwwsearch.sourceforge.net/mechanize/
[2] http://www.crummy.com/software/BeautifulSoup/
-- 
Thomas Wittek
Web: http://gedankenkonstrukt.de/
Jabber: streawkceur at jabber.i-pobox.net
GPG: 0xF534E231



More information about the Python-list mailing list