sgmllib missing an end tag?
Bengt Richter
bokr at oz.net
Sat Jun 14 21:17:36 EDT 2003
On Sat, 14 Jun 2003 22:14:39 GMT, Chris Curvey <ccurvey at earthlink.net> wrote:
>I hope this doesn't post twice....
>
>Can anyone see why sgmllib would miss the </a> tag just after "Log in"
>in the following HTML fragment? I've been looking at this so long that
>the characters are starting to swim around....
>
> <a
>href="http://chriscurvey.com:8080/curveyConsulting/login_form">
> <img
>src="http://chriscurvey.com:8080/curveyConsulting/linkOpaque.gif" alt=""
>title="" height="11" width="6" border="0" />
^---what happens if you remove this?
I.e., I wonder if sgmllib recognizes empty (self-closing) tags. They are XHTML style.
I'm guessing you may need the HTMLParser module.
>
> Log in
> </a> <!-- this is the tag that's being missed -->
>
>
> <a
>href="http://chriscurvey.com:8080/curveyConsulting/portal_form/join_form">
> <img
>src="http://chriscurvey.com:8080/curveyConsulting/linkOpaque.gif" alt=""
>title="" height="11" width="6" border="0" />
>
> Join
> </a>
>
Regards,
Bengt Richter
More information about the Python-list
mailing list