trying to parse non valid html documents with HTMLParser

Benji York benji at benjiyork.com
Wed Aug 3 09:10:39 EDT 2005

Previous message (by thread): trying to parse non valid html documents with HTMLParser
Next message (by thread): trying to parse non valid html documents with HTMLParser
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

florent wrote:
> True, I just want to extract some data from html documents. But the 
> problem is the same. The parser looses the position he was in the string 
> when he encounters a bad tag.

Are you saying that Beautiful Soup can't parse the HTML?  If so, I'm 
sure the author would like an example so he can "fix" it.
--
Benji York

Previous message (by thread): trying to parse non valid html documents with HTMLParser
Next message (by thread): trying to parse non valid html documents with HTMLParser
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the Python-list mailing list