[Expat-discuss] (no subject)

Mark 11mjazbdg02 at sneakemail.com
Mon Jun 7 12:07:31 EDT 2004


Hi,



[expat 1.95.6]



I have an XML file which uses iso-8859-1 encoding. Therefore the

file begins with:

   <?xml version="1.0" encoding="iso-8859-1"?>



In the file I have some character data which includes an accented

character (u umlaut - byte 0xFC).  When I parse the file

the charater code gets translated (by expat) to two bytes (0xC3, 0xBC). I can

see this in my CharacterDataHandler function.



What I am doing wrong?  Do I have to preprocess the file to

change this to a character reference?



Thanks, Mark



More information about the Expat-discuss mailing list