[Expat-discuss] (no subject)

Mon Jun 7 12:07:31 EDT 2004

Hi,

[expat 1.95.6]

I have an XML file which uses iso-8859-1 encoding. Therefore the

file begins with:

   <?xml version="1.0" encoding="iso-8859-1"?>

In the file I have some character data which includes an accented

character (u umlaut - byte 0xFC).  When I parse the file

the charater code gets translated (by expat) to two bytes (0xC3, 0xBC). I can

see this in my CharacterDataHandler function.

What I am doing wrong?  Do I have to preprocess the file to

change this to a character reference?

Thanks, Mark