[XML-SIG] Parsing malformed XHTML

Brian Quinlan brian at sweetapp.com
Sun May 21 10:23:59 CEST 2006


Lars Kellogg-Stedman wrote:

> I need to parse this document into a DOM, make some changes, and then
> spit back out the modified file as (X?)HTML (ideally well-formed).  Am
> I going to be able to do this with PyXML?  If not, I'd love to hear
> your suggestions for the appropriate tools.
> 
> Thanks!
> 
> -- Lars
> 

You might want to look into Beautiful Soup. Another approach is to pass 
the document through HTML Tidy and then process the output.

Cheers,
Brian


More information about the XML-SIG mailing list