[Web-SIG] HTML parsers and DOM; WWW::Mechanize work-alike

John J Lee jjl at pobox.com
Tue Dec 2 14:35:36 EST 2003


On Tue, 2 Dec 2003, Simon Willison wrote:
[...]
> Is there any way we could get a DOM tree from invalid HTML using pure
> Python tools? The HTML tools in the Python standard library at the
[...]

No chance.  A lot of work has gone into HTMLTidy / tidylib, reimplementing
it would be a lot of work for little benefit.


John



More information about the Web-SIG mailing list