[issue32198] \b reports false-positives in Indic strings involving combining marks

Serhiy Storchaka report at bugs.python.org
Sat Dec 2 13:28:37 EST 2017


Serhiy Storchaka <storchaka+cpython at gmail.com> added the comment:

This is a known issue. See also issue1693050, issue12731, issue25743. I hope it will be solved in 3.7 and maybe the solution will be backported to 2.7 and 3.6 (but not to 3.5, 3.5 takes only security fixes).

As a workaround I suggest you to use the third-party regex module. This is a mature module mostly compatible with re, but with better support of Unicode and additional features.

----------
assignee:  -> serhiy.storchaka
nosy: +serhiy.storchaka
versions: +Python 2.7, Python 3.6, Python 3.7 -Python 3.5

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue32198>
_______________________________________


More information about the Python-bugs-list mailing list