[XML-SIG] unicode problems in elementtree

Jarek Zgoda jzgoda at o2.pl
Sat May 27 12:26:32 CEST 2006


Bryan Lawrence napisał(a):

> I suspect the problem is that I'm not actually passing an xml document (with a 
> character encoding definition) to ET ... I'm just passing some stuff which is 
> an xml fragment (from a web service interface to a database).
> 
> Does elementtree and/or expat need to know the encoding to get this right? 
> (which may be a problem coz this could be from anyone's document in any 
> encoding ...)

As per XML spec, if no encoding is declared, UTF-8 is assumed (AFAIK
expat follows this). Check if your data is valid UTF-8. Expat accepts
only UTF-8, UTF-16, iso-8859-1 and ascii data, but without encoding
declaration treats everything as UTF-8.

-- 
Jarek Zgoda
http://jpa.berlios.de/



More information about the XML-SIG mailing list