raw_input() and utf-8 formatted chars

kyosohma at gmail.com kyosohma at gmail.com
Fri Oct 12 15:18:05 EDT 2007


On Oct 12, 1:53 pm, 7stud <bbxx789_0... at yahoo.com> wrote:
> s = 'A\xcc\x88'   #capital A with umlaut
> print s           #displays capital A with umlaut
>
> s = raw_input('Enter: ')   #A\xcc\x88
> print s                    #displays A\xcc\x88
>
> print len(input)           #9
>
> It looks like every character of the string I enter in utf-8 is being
> interpreted literally as 9 separate characters rather than one
> character.  How do I enter a capital A with an umlaut so that python
> treats it as one character?

I don't know. This works for me:

>>> x = raw_input('Enter: ')
Enter: ä
>>> len(x)
1
>>>

I'm using Python 2.4 with Default Source Encoding set to None on
Windows XP SP2.

Mike




More information about the Python-list mailing list