Manipulate HTML documents via data structure
David Fraser
davidf at sjsoft.com
Tue Oct 5 02:35:09 EDT 2004
C. Barnes wrote:
> Python provides HTML parsing through the
> HTMLParser and htmllib modules.
>
> For my application, I needed to search through
> an HTML document in a nonlinear fashion and
> dynamically change parts of the document. The
> most logical way to do this is to translate HTML
> back and forth to a data structure.
>
> I wrote a module called htmldata, available from:
>
> http://oregonstate.edu/~barnesc/htmldata/
Just wondering if you've seen BeautifulSoup?
It seems quite similar to what you're doing although I haven't looked in
detail...
http://www.crummy.com/software/BeautifulSoup/
David
More information about the Python-list
mailing list