sgmllib has problems with dots in tag names

Andreas Jung ajung at sz-sb.de
Fri Jul 16 06:52:30 EDT 1999


The SGML parsers from the standard sgmllib and the XML sgmllib war both
unable to parse SGML tags with dots in the tag name like <TI.DOC>. The 
parsers callback functions only get the first part of the tag name (before
the dot) as argument (in this case 'TI'). Because the tags are valid SGML
tags this is a bit annoying. Ok, one could get a workaround by replacing
all dots in tags with an underscore however that's not a clean solution :-)

Any others ideas ?

Thanks,
Andreas





More information about the Python-list mailing list