[ python-Bugs-975556 ] HTMLParser lukewarm on bogus bare attribute chars

SourceForge.net noreply at sourceforge.net
Fri Jun 18 15:33:18 EDT 2004


Bugs item #975556, was opened at 2004-06-18 14:33
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=975556&group_id=5470

Category: Python Library
Group: Python 2.3
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Coleman (mkc)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser lukewarm on bogus bare attribute chars

Initial Comment:
I tripped over the same problem mentioned in bug
#921657 (HTMLParser.py), except that my bogus attribute
char is '|' instead of '@'.

May I suggest that HTMLParser either require strict
compliance with the HTML spec, or alternatively that it
accept everything reasonable?  The latter approach
would be much more useful, and it would also be
valuable to have this decision documented.

In particular, 'attrfind' needs to be changed to accept
(following the '=\s*') something like the subpattern
given for 'locatestarttagend' (see the "bare value" line).


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=975556&group_id=5470



More information about the Python-bugs-list mailing list