Help needed in Handling HTML file

Stephen R Laniel steve at laniels.org
Tue Jun 26 10:45:47 EDT 2007


On Tue, Jun 26, 2007 at 09:42:37AM -0500, senthil arasu wrote:
> Right now Iam handling HTML files using Python.
> Is there any modules to support HTML parsing and rendering ?
> or suggest me any other way to support HTML.

Parsing: BeautifulSoup
http://www.crummy.com/software/BeautifulSoup/

Rendering: not sure. If worse came to worst, you could
always call out to 'w3m -dump' using system(). How
sophisticated do you want your rendering to be? Other people
on this last may well know of ways to fully render a page,
including graphics and JavaScript and whatever else, using a
scriptable Python object; I do not.

-- 
Stephen R. Laniel
steve at laniels.org
Cell: +(617) 308-5571
http://laniels.org/
PGP key: http://laniels.org/slaniel.key



More information about the Python-list mailing list