sax barfs on unicode filenames: workaround

"Martin v. Löwis" martin at v.loewis.de
Wed Oct 4 16:39:18 EDT 2006


Edward K. Ream schrieb:
> Happily, the workaround is easy.  Replace theFile with:
> 
> # Use cStringIo to avoid a crash in sax when inputFileName has unicode 
> characters.
> s = theFile.read()
> theFile = cStringIO.StringIO(s)
> 
> My first attempt at a workaround was to use:
> 
> s = theFile.read()
> parser.parseString(s)
> 
> but the expat parser does not support parseString...

Right - you would have to use xml.sax.parseString (which is a global
function, not a method).

Of course, parseString just does what you did: create a cStringIO
object and operate on that.

Regards,
Martin



More information about the Python-list mailing list