HTML Parsing
Larry Bates
larry.bates at websafe.com`
Mon Jun 30 08:52:58 EDT 2008
disappearedng at gmail.com wrote:
> Hi everyone
> I am trying to build my own web crawler for an experiement and I don't
> know how to access HTTP protocol with python.
>
> Also, Are there any Opensource Parsing engine for HTML documents
> available in Python too? That would be great.
>
>
Check on Mechanize. It wraps Beautiful Soup inside of methods that aid in
website crawling.
http://pypi.python.org/pypi/mechanize/0.1.7b
-Larry
More information about the Python-list
mailing list