Web Scraping/Site Scraping

Andrew Bennetts andrew-pythonlist at puzzling.org
Mon Jul 12 21:05:31 EDT 2004


On Sun, Jul 11, 2004 at 01:42:47PM +0000, David Jones wrote:
> Hi, I'm interested in learning about web scraping/site scraping using
> Python.  Does anybody know of some online resources or have any modules that
> are available to help out.  O'Reilly published an interesting book
> "Spidering Hacks" which covered some great scraping hacks but it is all
> written in Perl.  I don't know Perl and don't want to.  I'm new to
> programing and have been advised to start with Python.  So far so good ...
> but need some help with web programming.   Thanks for any help you may
> provide.  Dave.

For the HTML parsing part of the task, I've heard that Beautiful Soup works
well:
    http://www.crummy.com/software/BeautifulSoup/ 

-Andrew.




More information about the Python-list mailing list