HTML Parsing

Larry Bates larry.bates at websafe.com`
Mon Jun 30 08:52:58 EDT 2008


disappearedng at gmail.com wrote:
> Hi everyone
> I am trying to build my own web crawler for an experiement and I don't
> know how to access HTTP protocol with python.
> 
> Also, Are there any Opensource Parsing engine for HTML documents
> available in Python too? That would be great.
> 
> 
Check on Mechanize.  It wraps Beautiful Soup inside of methods that aid in 
website crawling.

http://pypi.python.org/pypi/mechanize/0.1.7b

-Larry



More information about the Python-list mailing list