[Expat-discuss] Encoding problems on Mac OS X

Fred L. Drake, Jr. fdrake@acm.org
Mon Jun 10 09:34:04 2002


Naveen Viswanatha writes:
 > I am parsing xml encoded in ISO-8859-1 charset on Mac
 > OS 10.1 using expat v1.95.1 and v1.95.3.  On some XML
 > files everything works ok, however, on others, the
 > CharacterDataHandler seems to parse garbage (wierd
 > unrecognizable characters). 

Is the encoding properly specified in the XML declaration at the top
of the file?

Do you realize that the text passed to the character data handler will
always be encoded in UTF-8 in the default build of Expat, regardless
of the input encoding?


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Zope Corporation