[OT] does the charset lie?

Skip Montanaro skip at pobox.com
Sun May 2 12:20:11 EDT 2004


I know this is off-topic for this group, but I am analyzing this text with
Python... ;-)

Given an HTML file with this meta tag:

    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

and all individual characters being legitimately in the iso-8859-1 character
set, but with this numeric entity:

    ’

is the charset correct or should it have been utf-8?

Thx,

Skip




More information about the Python-list mailing list