Unicode chr(150) en dash

Richard Brodie R.Brodie at rl.ac.uk
Thu Apr 17 11:55:37 EDT 2008


<marexposed at googlemail.com> wrote in message 
news:mailman.595.1208445083.17997.python-list at python.org...

> I think I understand the unicode basic principles, what confuses me is the usage 
> different applications
> make out of it.
>
> For example, I got that EN DASH out of a web page which states
> <?xml version="1.0" encoding="ISO-8859-1"?> at the beggining. That's why I did go for 
> that
> encoding. But if the browser can properly decode that character using  that encoding, 
> how come
> other applications can't?

Browsers tend to guess what the author intended a lot.  In particular, they fudge the 
difference
between ISO8859-1 and Windows-1252. http://en.wikipedia.org/wiki/Windows-1252 





More information about the Python-list mailing list