How to Split Chinese Character with backslash representation?

Cameron Walsh cameron.walsh at gmail.com
Fri Oct 27 00:03:31 EDT 2006


Wijaya Edward wrote:
> Hi all,
> 
> I was trying to split a string that 
> represent chinese characters below:
>  
>  
>>>> str = '\xc5\xeb\xc7\xd5\xbc'
>>>> print str2,
> ???
>>>> fields2 = split(r'\\',str)
>>>> print fields2,
> ['\xc5\xeb\xc7\xd5\xbc']
> 
> But why the split function here doesn't seem
> to do the job for obtaining the desired result:
>  
> ['\xc5','\xeb','\xc7','\xd5','\xbc']
>  

Depends on what you want to do with them:

 >>> string = '\xc5\xeb\xc7\xd5\xbc'
 >>> for char in string:
         print char


Å
ë
Ç
Õ
¼
 >>> list_of_characters = list(string)
 >>> list_of_characters
['\xc5', '\xeb', '\xc7', '\xd5', '\xbc']
 >>> for char in string:
         char


'\xc5'
'\xeb'
'\xc7'
'\xd5'
'\xbc'
 >>> for char in list_of_characters:
         print char


Å
ë
Ç
Õ
¼
 >>> string[3]
'\xd5'
 >>> string[1:3]
'\xeb\xc7'

Basically, you characters are already separated into a list of 
characters, that's effectively what a string is (but with a few more 
methods applicable only to lists of characters, not to other lists).



More information about the Python-list mailing list