python regular expression help
attn.steven.kuo at gmail.com
attn.steven.kuo at gmail.com
Thu Apr 12 01:15:23 EDT 2007
On Apr 11, 9:50 pm, "Gabriel Genellina" <gagsl-... at yahoo.com.ar>
wrote:
> En Wed, 11 Apr 2007 23:14:01 -0300, Qilong Ren <qilong_... at yahoo.com>
> escribió:
>
> > Thanks for reply. That actually is not what I want. Strings I am dealing
> > with may look like this:
> > s = 'a = 4.5 b = 'h' 'd' c = 4.5 3.5'
> > What I want is
> > a = 4.5
> > b = 'h' 'd'
> > c = 4.5 3.5
>
> That's a bit tricky. You have LHS = RHS where RHS includes all the
> following text *except* the very next word before the following = (which
> is the LHS of the next expression). Or something like that :)
>
> py> import re
> py> s = "a = 4.5 b = 'h' 'd' c = 4.5 3.5"
> py> r = re.compile(r"\w+\s*=\s*.*?(?=\w+\s*=|$)")
> py> for item in r.findall(s):
> ... print item
> ...
> a = 4.5
> b = 'h' 'd'
> c = 4.5 3.5
>
Another way is to use split:
import re
lhs = re.compile(r'\s*(\b\w+\s*=)')
for s in [ "a = 4 b =3.4 5.4 c = 4.5",
"a = 4.5 b = 'h' 'd' c = 4.5 3.5"]:
tokens = lhs.split(s)
results = [tokens[_] + tokens[_+1] for _ in range(1,len(tokens),
2)]
print s
print results
--
Regards,
Steven
More information about the Python-list
mailing list