BeautifulSoup vs. Microsoft

Duncan Booth duncan.booth at invalid.invalid
Thu Mar 29 04:08:11 EDT 2007


John Nagle <nagle at animats.com> wrote:

> Strictly speaking, it's Microsoft's fault.
> 
>      title="<!--http://www.microsoft.com/usability/information.mspx->"
> 
> is supposed to be an HTML comment.  But it's improperly terminated.
> It should end with "-->".  So all that following stuff is from what
> follows the next "-->" which terminates a comment.

It is an attribute value, and unescaped angle brackets are valid in 
attributes. It looks to me like a bug in BeautifulSoup.



More information about the Python-list mailing list