[issue12731] python lib re uses obsolete sense of \w in full violation of UTS#18 RL1.2a

Henry S. Thompson report at bugs.python.org
Fri Jan 31 08:31:30 EST 2020


Henry S. Thompson <ht at inf.ed.ac.uk> added the comment:

[One year and 2 days later... :-[

Is this fixed in 3.9?  If not, the Versions list above should be updated.

The failure of lower() to preserve 'alpha-ness' is a serious bug, it causes significant failures in e.g. Turkish NLP, and it's _not_ just a failure of the documentation!

Please can we move this to category Unicode and get at least this aspect of the problem fixed?  Should I raise a separate issue on isalpha() etc.?

----------

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue12731>
_______________________________________


More information about the Python-bugs-list mailing list