htmllib: CR in CDATA

Mark Nottingham mnot at pobox.com
Tue Jun 22 05:31:04 EDT 1999


Whooops, nevermind, I misread the spec -- carriage returns are turned into
spaces (which is what htmllib does) - *line feeds* should be ignored...

--
"Get me the phone book."
  "Which one?"
"Doesn't matter."


----- Original Message -----
From: Mark Nottingham <mnot at pobox.com>
To: Python <python-list at cwi.nl>
Sent: Tuesday, June 22, 1999 12:55
Subject: htmllib: CR in CDATA


> It appears that htmllib doesn't ignore returns in CDATA fields, as HTML
4.0
> says it should:
> http://www.w3.org/TR/REC-html40/types.html#type-cdata
> http://www.w3.org/TR/REC-html40/sgml/dtd.html
>
> As a result, htmllib improperly parses any CDATA element that wraps across
a
> line; this affects elements like
>
> <A href="foo.
> gif">
>
> I'm happy to work up a patch, but I thought I'd ask around first. It may
be
> a bit involved to fix it properly; every CDATA should be handled this way,
> which practically means almost every tag attribute.
>
> Regards,
>
>
> Mark Nottingham, Melbourne Australia
> mnot at pobox.com  http://www.mnot.net/
>
>
>





More information about the Python-list mailing list