[issue1693050] \w not helpful for non-Roman scripts
Matthew Barnett
report at bugs.python.org
Wed May 29 19:31:08 CEST 2013
Matthew Barnett added the comment:
You could've obtained it from msg76556 or msg190100:
>>> print(ascii('हिन्दी'))
'\u0939\u093f\u0928\u094d\u0926\u0940'
>>> import re, regex
>>> print(ascii(re.match(r"\w+", '\u0939\u093f\u0928\u094d\u0926\u0940').group()))
'\u0939'
>>> print(ascii(regex.match(r"\w+", '\u0939\u093f\u0928\u094d\u0926\u0940').group()))
'\u0939\u093f\u0928\u094d\u0926\u0940'
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue1693050>
_______________________________________
More information about the Python-bugs-list
mailing list