[ expat-Bugs-514281 ] french accents errors?

noreply@sourceforge.net noreply@sourceforge.net
Thu Feb 7 10:38:02 2002


Bugs item #514281, was opened at 2002-02-07 06:45
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=110127&aid=514281&group_id=10127

Category: None
Group: Not a Bug
Status: Open
Resolution: None
Priority: 5
Submitted By: Vincent Fortier (fortierv)
Assigned to: Nobody/Anonymous (nobody)
Summary: french accents errors?

Initial Comment:
I'm having errors with french accents in iso-8859-1 and utf-8 format 
only at a specific place..

Exemple:
<CLIENT>
   
<COMPAGNIE>Ville Charlesbourg</COMPAGNIE>
   <CONTACT>
      
<NOM>Martin Labbé</NOM>
      <COMM 
Type='Phone'>(xxx)xxx</COMM>
   </CONTACT>
</CLIENT>

It 
always stops at <NOM>Martin Labbé</NOM> .. If I either change "é" 
for an "e" it works.. if I simply add 2 chars after the "é" like: 
<NOM>Martin LabbéXX</NOM> it still works...  But it always stops 
when it finishes with "é" or "éX"....

I've tried a few patches.. 
one for expat.h and xmlparse.c wich was unicode changes (#476931, 
#464837).. and another one wich was about xmltok.c (#477667)... 
but with no success..  I always get the error message "mismatched tag 
at line xx"...

I have a lot of "éèàçêÈ" everywhere.. but it seems 
to only stop when it's placed right at the end of a tag..

Anybody 
can help me out?

- vin

----------------------------------------------------------------------

>Comment By: Vincent Fortier (fortierv)
Date: 2002-02-07 10:37

Message:
Logged In: YES 
user_id=451869

I've found a way to solve my problem.. The problem was actually with the iso-
8859-1 format making errors with "é" at really specefic places.. I've 
found a converter in C wich transform a text file to utf-8 and the my problems 
where over.. (see attachement)

Here is the web site where I got 
it:
http://developer.iplanet.com/tech/directory/utf8ltn1.html



----------------------------------------------------------------------

Comment By: Vincent Fortier (fortierv)
Date: 2002-02-07 07:40

Message:
Logged In: YES 
user_id=451869

Note:  BTW.. xmlwf works fine on the file.. so it's supposed to be well-
formed in iso-8859-1 format ... and the value of "é" is E9 and 351 (used octal 
dump..)..

thnx.

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=110127&aid=514281&group_id=10127