[OT] does the charset lie?

David Goodger goodger at python.org
Sun May 2 12:39:45 EDT 2004


Skip Montanaro wrote:
> Given an HTML file with this meta tag:
> 
>     <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
> 
> and all individual characters being legitimately in the iso-8859-1 character
> set, but with this numeric entity:
> 
>> 
> is the charset correct or should it have been utf-8?

The charset is correct.  "&" "#" "8" etc. are all in iso-8859-1.
"’" is just a way of indicating a character outside of the
text's charset.  Internally, the browser may use Unicode or
some other encoding, but that's not your problem.

-- David Goodger





More information about the Python-list mailing list