[New-bugs-announce] [issue11957] re.sub problem with unicode string
Mindaugas
report at bugs.python.org
Fri Apr 29 20:27:10 CEST 2011
New submission from Mindaugas <mindaugas.rucys at gmail.com>:
re.sub don't substitute not ASCII characters:
Python 2.7.1 (r271:86832, Apr 15 2011, 12:11:58) Arch Linux
>>>import re
>>>a=u'aaa'
>>>print re.search('(\w+)',a,re.U).groups()
(u'aaa')
>>>print re.sub('(\w+)','x',a,re.U)
x
BUT:
>>>a=u'ąąą'
>>>print re.search('(\w+)',a,re.U).groups()
(u'\u0105\u0105\u0105')
>>>print re.sub('(\w+)','x',a,re.U)
ąąą
----------
components: Regular Expressions, Unicode
messages: 134806
nosy: mindauga
priority: normal
severity: normal
status: open
title: re.sub problem with unicode string
versions: Python 2.7
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue11957>
_______________________________________
More information about the New-bugs-announce
mailing list