HTML parsing/scraping & python

gene tani gene.tani at gmail.com
Sun Dec 4 20:47:44 EST 2005


John J. Lee wrote:
> Sanjay Arora <sanjay.k.arora at gmail.com> writes:
>
> > We are looking to select the language & toolset more suitable for a
> > project that requires getting data from several web-sites in real-
> > time....html parsing/scraping. It would require full emulation of the
> > browser, including handling cookies, automated logins & following
> > multiple web-link paths. Multiple threading would be a plus but not
> > requirement.
> [...]
>
> What's the application?
>
>
> John

I'll do your googling for you ;-p

(The topic guide needs to be updated for mechanize, pamie, beautiful
soup, clientTable, pullparser, etc.)
http://www.python.org/topics/web/HTML.html
http://blog.ianbicking.org/best-of-the-web-app-test-frameworks.html




More information about the Python-list mailing list