[New-bugs-announce] [issue11957] re.sub problem with unicode string

Mindaugas report at bugs.python.org
Fri Apr 29 20:27:10 CEST 2011


New submission from Mindaugas <mindaugas.rucys at gmail.com>:

re.sub don't substitute not ASCII characters:

Python 2.7.1 (r271:86832, Apr 15 2011, 12:11:58) Arch Linux

>>>import re

>>>a=u'aaa'
>>>print re.search('(\w+)',a,re.U).groups()
(u'aaa')
>>>print re.sub('(\w+)','x',a,re.U)
x

      BUT:

>>>a=u'ąąą'
>>>print re.search('(\w+)',a,re.U).groups()
(u'\u0105\u0105\u0105')
>>>print re.sub('(\w+)','x',a,re.U)
ąąą

----------
components: Regular Expressions, Unicode
messages: 134806
nosy: mindauga
priority: normal
severity: normal
status: open
title: re.sub problem with unicode string
versions: Python 2.7

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue11957>
_______________________________________


More information about the New-bugs-announce mailing list