Q: how to extract only text from a html ?

Gerrit Holl gerrit at NOSPAM.nl.linux.org
Wed Nov 1 08:18:42 EST 2000


On Tue, 31 Oct 2000 13:50:54 -0600, Hwanjo Yu wrote:
> Could someone please tell me how to get rid of all the tags in a html ?
> It seems that the htmllib.HTMLParser is not helpful to do it.

Maybe you should have a look at regular expressions, the re module.
There's extremely much possible with it. Have you had a look at it?

regards,
Gerrit.

-- 
****************************************************************************
*  Save Mother Earth! -- Earth First! -- visit http://www.earthfirst.org/!
****************************************************************************



More information about the Python-list mailing list