Manipulate HTML documents via data structure

David Fraser davidf at sjsoft.com
Tue Oct 5 02:35:09 EDT 2004


C. Barnes wrote:
> Python provides HTML parsing through the
> HTMLParser and htmllib modules.
> 
> For my application, I needed to search through
> an HTML document in a nonlinear fashion and
> dynamically change parts of the document.  The
> most logical way to do this is to translate HTML
> back and forth to a data structure.
> 
> I wrote a module called htmldata, available from:
> 
> http://oregonstate.edu/~barnesc/htmldata/

Just wondering if you've seen BeautifulSoup?
It seems quite similar to what you're doing although I haven't looked in 
detail...
http://www.crummy.com/software/BeautifulSoup/

David



More information about the Python-list mailing list