htmllib: CR in CDATA

Fredrik Lundh fredrik at pythonware.com
Tue Jun 22 09:39:27 EDT 1999


> Now, htmllib certainly:
> * is a component
> * part of a distributed system (i.e., the Web)
> * presents an interface (programmatic)
> * processes requests on behalf of a user

> I'm curious... if it's not a user agent in the quoted context, what is it?

I'm pretty sure you know what I meant, but
alright...

htmllib is a parser, just like the documentation says.
you have to add an application to get an HTML user
agent (see section 1.2.3 of the 2.0 spec for more
info on user agents).

imho, it's pretty reasonable for an SGML parser to
behave like an XML parser: split the document up
into pieces, but pass them all to the application as
untouched as possible. if you wish to implement
additional behaviour, do that on the application
level. otherwise, you'll end up in a situation where
some users cannot use the standard library...

(like I did only a few hours ago, trying to use sgmllib
to parse SGML data with case-sensitive tags. sigh...)

</F>





More information about the Python-list mailing list