[Expat-discuss] Character Encoding 4 bytes Limitation

chandan kumar chandu_ns at yahoo.com
Mon Aug 7 13:07:44 CEST 2006


Hi All,
   
   
  The expat doc/reference.html mentions these limitation for character encoding.
  -----
  Expat places restrictions on character encodings that it can support by filling in the XML_Encoding structure. include file:
   
  2. Characters must be encoded in 4 bytes or less.
  3. All characters encoded must have Unicode scalar values less than or equal to 65535 (0xFFFF)This does not apply to the built-in support for UTF-16 and UTF-8
  ------
   
  Some of the chinese characters fall beyond this range. Does this mean that expat cannot parse all the chinese characters?
   
  Is there any expat document providing the list of characters supported? 
   
   
  Thanks,
  Chandu
   
   
   
   

 		
---------------------------------
Do you Yahoo!?
 Next-gen email? Have it all with the  all-new Yahoo! Mail Beta.


More information about the Expat-discuss mailing list