htmllib.py and parsing malformed HTML

KC nskhcarlso at bellsouth.net
Fri Sep 5 10:48:19 EDT 2003


Jeremy Bowers wrote:
> On Thu, 04 Sep 2003 11:50:07 -0400, KC wrote:
> 
...

> that's bad in a lot of little ways, you'll find the "apply a hack to fix
> this file, apply a hack to fix that file" will start stepping on its own
> toes.
Oh yeah, I couldn't agree more.  Any more requests for "hacks" and 
HTMLTidy gets brought into the picture.
> 
> HTMLTidy represents a ***lot*** of grunt work and a ***lot*** of
> functionality that you can *not* replicate in a reasonable amount of time;
> it's one of those packages that isn't so much a program that "does
> something" as a program that represents many, many man-years of "knowledge
> acquired". 
> 
Agreed.  I like HTMLTidy very much and it's obvious it could save us 
developers a lot of effort.






More information about the Python-list mailing list