[Tutor] extra characters in XML

Ben Vinger benvinger at yahoo.co.uk
Thu Jan 12 12:26:12 CET 2006


Thanks - it was exactly as you said 

--- Kent Johnson <kent37 at tds.net> wrote:

> Most likely your XML file is 16-bit unicode, not
> utf-8. When ascii text 
> is represented as unicode, every other byte will be
> a null byte. That is 
> the extra character that shows up as a space or box
> depending on who is 
> interpreting it. The utf-8 codec must be swallowing
> the null bytes.
> 
> In your code above, instead of utf-8 try utf_16_be
> and utf_16_le, one of 
> them should work.





		
___________________________________________________________ 
To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com


More information about the Tutor mailing list