How to write Regular Expression for recursive matching?

J. Clifford Dyer jcd at sdf.lonestar.org
Mon Nov 26 12:34:59 EST 2007


On Mon, Nov 26, 2007 at 06:04:54PM +0100, Diez B. Roggisch wrote regarding Re: How to write Regular Expression for recursive matching?:
> 
> lisong wrote:
> 
> > Hi All,
> > 
> > I have problem to split a string like this:
> > 
> > 'abc.defg.hij.klmnop'
> > 
> > and I want to get all substrings with only one '.' in mid. so the
> > output I expect is :
> > 
> > 'abc.defg', 'defg.hij', 'hij.klmnop'
> > 
> > a simple regular expression '\w+.\w' will only return:
> > 'abc.defg', 'hij.klmnop'
> > 
> > is there a way to get 'defg.hij' using regular expression?
> 
> Nope. Regular expressions can't get back in their input-stream, at least not
> for such stuff.
> 
> The problem at hand is easily solved using
> 
> s = 'abc.defg.hij.klmnop'
> 
> pairs = [".".join(v) for v in zip(s.split(".")[:-1], s.split(".")[1:])]
> 

which is veritably perlesque in its elegance and simplicity!

A slightly more verbose version.

l = s.split('.')
pairs = []
for x in xrange(len(l)-1):
	pairs.append('.'.join(l[x:x+2]))

Cheers,
Cliff



More information about the Python-list mailing list