[ expat-Bugs-514281 ] french accents errors?
noreply@sourceforge.net
noreply@sourceforge.net
Thu Feb 7 10:38:02 2002
Bugs item #514281, was opened at 2002-02-07 06:45
You can respond by visiting:
http://sourceforge.net/tracker/?func=detail&atid=110127&aid=514281&group_id=10127
Category: None
Group: Not a Bug
Status: Open
Resolution: None
Priority: 5
Submitted By: Vincent Fortier (fortierv)
Assigned to: Nobody/Anonymous (nobody)
Summary: french accents errors?
Initial Comment:
I'm having errors with french accents in iso-8859-1 and utf-8 format
only at a specific place..
Exemple:
<CLIENT>
<COMPAGNIE>Ville Charlesbourg</COMPAGNIE>
<CONTACT>
<NOM>Martin Labbé</NOM>
<COMM
Type='Phone'>(xxx)xxx</COMM>
</CONTACT>
</CLIENT>
It
always stops at <NOM>Martin Labbé</NOM> .. If I either change "é"
for an "e" it works.. if I simply add 2 chars after the "é" like:
<NOM>Martin LabbéXX</NOM> it still works... But it always stops
when it finishes with "é" or "éX"....
I've tried a few patches..
one for expat.h and xmlparse.c wich was unicode changes (#476931,
#464837).. and another one wich was about xmltok.c (#477667)...
but with no success.. I always get the error message "mismatched tag
at line xx"...
I have a lot of "éèàçêÈ" everywhere.. but it seems
to only stop when it's placed right at the end of a tag..
Anybody
can help me out?
- vin
----------------------------------------------------------------------
>Comment By: Vincent Fortier (fortierv)
Date: 2002-02-07 10:37
Message:
Logged In: YES
user_id=451869
I've found a way to solve my problem.. The problem was actually with the iso-
8859-1 format making errors with "é" at really specefic places.. I've
found a converter in C wich transform a text file to utf-8 and the my problems
where over.. (see attachement)
Here is the web site where I got
it:
http://developer.iplanet.com/tech/directory/utf8ltn1.html
----------------------------------------------------------------------
Comment By: Vincent Fortier (fortierv)
Date: 2002-02-07 07:40
Message:
Logged In: YES
user_id=451869
Note: BTW.. xmlwf works fine on the file.. so it's supposed to be well-
formed in iso-8859-1 format ... and the value of "é" is E9 and 351 (used octal
dump..)..
thnx.
----------------------------------------------------------------------
You can respond by visiting:
http://sourceforge.net/tracker/?func=detail&atid=110127&aid=514281&group_id=10127