[OT] does the charset lie?
David Goodger
goodger at python.org
Sun May 2 12:39:45 EDT 2004
Skip Montanaro wrote:
> Given an HTML file with this meta tag:
>
> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
>
> and all individual characters being legitimately in the iso-8859-1 character
> set, but with this numeric entity:
>
> ’
>
> is the charset correct or should it have been utf-8?
The charset is correct. "&" "#" "8" etc. are all in iso-8859-1.
"’" is just a way of indicating a character outside of the
text's charset. Internally, the browser may use Unicode or
some other encoding, but that's not your problem.
-- David Goodger
More information about the Python-list
mailing list