[Python-Dev] Regular expressions: splitting on zero-width patterns

MRAB python at mrabarnett.plus.com
Tue Nov 28 17:23:04 EST 2017


On 2017-11-28 20:04, Serhiy Storchaka wrote:
> The two largest problems in the re module are splitting on zero-width
> patterns and complete and correct support of the Unicode standard. These
> problems are solved in regex. regex has many other features, but they
> are less important.
> 
> I want to tell the problem of splitting on zero-width patterns. It
> already was discussed on Python-Dev 13 years ago [3] and maybe later.
> See also issues: [4], [5], [6], [7], [8].
> 
[snip]
After some thought, I've decided that if this happens in the re module 
in Python 3.7, then, for the sake of compatibility (and because the edge 
cases are debatable anyway), I'll have the regex module do the same when 
used on Python 3.7.


More information about the Python-Dev mailing list