Html parser

Nikita the Spider NikitaTheSpider at gmail.com
Fri Jun 15 21:21:42 EDT 2007


In article <mailman.9124.1181917406.32031.python-list at python.org>,
 Stephen R Laniel <steve at laniels.org> wrote:

> On Fri, Jun 15, 2007 at 07:11:56AM -0700, HMS Surprise wrote:
> > Could you recommend  an html parser that works with python (jython
> > 2.2)?
> 
> I'm new here, but I believe BeautifulSoup is the canonical
> answer:
> http://www.crummy.com/software/BeautifulSoup/

It is, but personally I'm a fan of Connelly Barnes' htmldata module:
http://oregonstate.edu/~barnesc/htmldata/

Much easier to use than BeautifulSoup IMO.

-- 
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more



More information about the Python-list mailing list