HTML Parser chokes on WordHTML...

Ian Bicking ianb at colorstudy.com
Fri May 2 19:35:45 EDT 2003


On Fri, 2003-05-02 at 13:14, Harald Massa wrote:
> So... is there any replacement for the HTMLParser from the python.lib
> which even can eat Microsoft Word HTML ? 

You might want to try running mxTidy on the HTML first:

http://www.lemburg.com/files/python/mxTidy.html

  Ian







More information about the Python-list mailing list