[Python-Dev] Regular expressions: splitting on zero-width patterns

Guido van Rossum guido at python.org
Tue Nov 28 17:27:22 EST 2017


On Tue, Nov 28, 2017 at 2:23 PM, MRAB <python at mrabarnett.plus.com> wrote:

> On 2017-11-28 20:04, Serhiy Storchaka wrote:
>
>> The two largest problems in the re module are splitting on zero-width
>> patterns and complete and correct support of the Unicode standard. These
>> problems are solved in regex. regex has many other features, but they
>> are less important.
>>
>> I want to tell the problem of splitting on zero-width patterns. It
>> already was discussed on Python-Dev 13 years ago [3] and maybe later.
>> See also issues: [4], [5], [6], [7], [8].
>>
>> [snip]
> After some thought, I've decided that if this happens in the re module in
> Python 3.7, then, for the sake of compatibility (and because the edge cases
> are debatable anyway), I'll have the regex module do the same when used on
> Python 3.7.
>

Maybe it should also be selectable with a version flag?

-- 
--Guido van Rossum (python.org/~guido)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20171128/e0c1ae4d/attachment.html>


More information about the Python-Dev mailing list