sre is broken in SuSE 9.2

Thu Feb 10 07:20:13 EST 2005

On Thu, 10 Feb 2005 13:00:42 +0300
"Denis S. Otkidach" <ods at strana.ru> wrote:

> On all platfroms \w matches all unicode letters when used with flag
> re.UNICODE, but this doesn't work on SuSE 9.2:
> 
> Python 2.3.4 (#1, Dec 17 2004, 19:56:48) 
> [GCC 3.3.4 (pre 3.3.5 20040809)] on linux2
> Type "help", "copyright", "credits" or "license" for more information.
> >>> import re
> >>> re.compile(ur'\w+', re.U).match(u'\xe4')
> >>> 
> 
> BTW, is correctly recognize this character as lowercase letter:
> >>> import unicodedata
> >>> unicodedata.category(u'\xe4')
> 'Ll'
> 
> I've looked through all SuSE patches applied, but found nothing
> related. What is the reason for broken behavior?  Incorrect configure
> options?

Just a bit more information. test_re.py fails in SuSE 9.2 with the
following errors:

<snip>
Running re_tests test suite
=== Failed incorrectly ('(?u)\\b.\\b', u'\xc4', 0, 'found', u'\xc4')
=== Failed incorrectly ('(?u)\\w', u'\xc4', 0, 'found', u'\xc4')

-- 
Denis S. Otkidach
http://www.python.ru/      [ru]