[Tutor] regex problem

Michael Powe michael at trollope.org
Wed Jan 5 20:25:05 CET 2005


On Wed, Jan 05, 2005 at 07:37:58AM -0000, Alan Gauld wrote:
> > This function removes HTML formatting codes from a text email 
 
> Using regex to remove HTML is usually the wrong approach unless 
> you can guarantee the format of the HTML in advance. The 
> HTMLparser is usually better and simpler. I think theres an example
> in the module doc of converting HTML to plain text.

Thanks.  This is one of those projects I've had in mind for a long
time, decided it was a good way to learn some python.  I will look at
the HTMLParser module.  But then once I get started on one of these
projects, it has a way of taking over.  ;-)

mp


More information about the Tutor mailing list