> ... print ord(c), unicodedata.name(c) > ... > 65 LATIN CAPITAL LETTER A > 110 LATIN SMALL LETTER N > 97 LATIN SMALL LETTER A > 239 LATIN SMALL LETTER I WITH DIAERESIS > 115 LATIN SMALL LETTER S Looks like I need to explore the unicodedata class. Thanks!