ElementTree Tidy HTML Tree Builder and comments

Björn Lindström bkhl at elektrubadur.se
Fri Mar 18 22:41:23 EST 2005


I'm considering using the ElementTree Tidy HTML Tree Builder for a web
spidering program I'm developing.

However, my program must be able to extract certain information from
HTML comments.

I'm basically creating my trees like this:

TidyHTMLTreeBuilder.parse(urllib.urlopen(url))

What I want to know is, is it possible to make TidyHTMLTreeBuilder
preserve comments in this process, and if so, how would I go
about it?




More information about the Python-list mailing list