[Python-Dev] Regular expressions, Unicode etc.

Mike Klaas mike.klaas at gmail.com
Wed Aug 8 20:56:58 CEST 2007


On 8-Aug-07, at 2:28 AM, Nick Maclaren wrote:

> I have needed to push my stack to teach REs (don't ask), and am
> taking a look at the RE code.  I may be able to extend it to support
> RFE 694374 and (more importantly) atomic groups and possessive
> quantifiers.  While I regard such things as revolting beyond belief,
> they make a HELL of a difference to the efficiency of recognising
> things like HTML tags in a morass of mixed text.

+1.  I would use such a feature.

> The other approach, which is to stick to true regular expressions,
> and wholly or partially convert to DFAs, has already been rendered
> impossible by even the limited Perl/PCRE extensions that Python
> has adopted.

Impossible?  Surely, a sufficiently-competent re engine could detect  
when a DFA is possible to construct?

-Mike


More information about the Python-Dev mailing list