[issue12731] python lib re uses obsolete sense of \w in full violation of UTS#18 RL1.2a
Henry S. Thompson
report at bugs.python.org
Fri Jan 31 08:31:30 EST 2020
Henry S. Thompson <ht at inf.ed.ac.uk> added the comment:
[One year and 2 days later... :-[
Is this fixed in 3.9? If not, the Versions list above should be updated.
The failure of lower() to preserve 'alpha-ness' is a serious bug, it causes significant failures in e.g. Turkish NLP, and it's _not_ just a failure of the documentation!
Please can we move this to category Unicode and get at least this aspect of the problem fixed? Should I raise a separate issue on isalpha() etc.?
----------
_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue12731>
_______________________________________
More information about the Python-bugs-list
mailing list