Python DIFF to track changes in HTML pages anyone?

François Pinard pinard at iro.umontreal.ca
Mon Jan 17 14:02:02 EST 2000


janhansen at my-deja.com writes:

> I need to track changes in HTML pages and to return meaningful (to humans)
> pieces of information.  In html pages there is often added a new piece
> (or packet) of information but the "string" concept of common diff sort of
> has no meaning.  crlf' s in html is not logical connected with "packets"
> of info in html.  anyone now of python code for this?

It is not Python code, and not especially tied to HTML, but maybe my
`wdiff' utility might help you.  Let me see...  The latest pretest is at:

   http://www.iro.umontreal.ca/contrib/~pinard/wdiff/wdiff-0.5g.tar.gz

Just remove the `g' if you prefer a "stable" version. :-)

-- 
François Pinard   http://www.iro.umontreal.ca/~pinard






More information about the Python-list mailing list