[XML-SIG] How do I process CDATA when the characters looks like XML

Wed, 14 May 2003 11:58:24 -0600

Hi,

I'm working with the basic python SAX parser (xml.sax.handler, etc) and am 
needing to process some XML that has "<" and ">" characters going in CDATA 
fields.

For example, the model I want to have is this:

<A>X</A>  where "X" is some kind of character data.

but if X = <B> giving me this situation:

<A><B></A>

A parser is going to think that "<B>" is a new XML tag and it'll die saying 
that I have mismatched tags.  This field also may or may not have <*> type 
entries in it, or it might have many blocks like that.

Is there a way to set a processing flag to specify that I want to grab 
everything between <A> and </A> as a character buffer?  If there's not a 
built in way to grab that, is there a trick I can use to get the same 
effect?

I can't just flip a state in the parser and 'build' the string from 
startElement and endElement names because  the character data won't be well 
formed XML (no </> style elements)

Any help is greatly appreciated!
  -William

_________________________________________________________________
Tired of spam? Get advanced junk mail protection with MSN 8. 
http://join.msn.com/?page=features/junkmail