HTML Parsing

Victor Noagbodji noagbodjivictor at gmail.com
Sat Jun 28 23:22:51 EDT 2008


> Hi everyone
Hello

> I am trying to build my own web crawler for an experiement and I don't
> know how to access HTTP protocol with python.
urllib2: http://docs.python.org/lib/module-urllib2.html

> Also, Are there any Opensource Parsing engine for HTML documents
> available in Python too? That would be great.
BeautifulSoup:
  http://www.crummy.com/software/BeautifulSoup/
  http://www.crummy.com/software/BeautifulSoup/documentation.html

All the best

-- 
NOAGBODJI Paul Victor



More information about the Python-list mailing list