sgmllib missing an end tag?

Bengt Richter bokr at oz.net
Sat Jun 14 21:17:36 EDT 2003


On Sat, 14 Jun 2003 22:14:39 GMT, Chris Curvey <ccurvey at earthlink.net> wrote:

>I hope this doesn't post twice....
>
>Can anyone see why sgmllib would miss the </a> tag just after "Log in" 
>in the following HTML fragment?  I've been looking at this so long that 
>the characters are starting to swim around....
>
>                     <a 
>href="http://chriscurvey.com:8080/curveyConsulting/login_form">
>                 <img 
>src="http://chriscurvey.com:8080/curveyConsulting/linkOpaque.gif" alt="" 
>title="" height="11" width="6" border="0" />
                                           ^---what happens if you remove this?
I.e., I wonder if sgmllib recognizes empty (self-closing) tags. They are XHTML style.
I'm guessing you may need the HTMLParser module.


>
>                 Log in
>             </a> <!-- this is the tag that's being missed -->
>
>
>             <a 
>href="http://chriscurvey.com:8080/curveyConsulting/portal_form/join_form">
>                 <img 
>src="http://chriscurvey.com:8080/curveyConsulting/linkOpaque.gif" alt="" 
>title="" height="11" width="6" border="0" />
>
>                 Join
>             </a>
>

Regards,
Bengt Richter




More information about the Python-list mailing list