fast regex

Patrick Maupin pmaupin at gmail.com
Fri May 7 10:58:30 EDT 2010


On May 6, 9:44 pm, james_027 <cai.hai... at gmail.com> wrote:
> On May 6, 11:33 pm, John Bokma <j... at castleamber.com> wrote:
>
> > james_027 <cai.hai... at gmail.com> writes:
> > > I was working with regex on a very large text, really large but I have
> > > time constrained. Does python has any other regex library or string
> > > manipulation library that works really fast?
>
> > Hard to answer without seeing your regex and requirements first.
> > Your question is like: I had a meal yesterday and it upset my
> > stomach. Can you suggest a different meal for today?
>
> > --
> > John Bokma                                                               j3b
>
> > Hacking & Hiking in Mexico -  http://johnbokma.com/http://castleamber.com/-Perl & Python Development
>
> I am doing something like this
>
> for key, value in words_list.items():
>     compile = re.compile(r"""\b%s\b""" % key, re.IGNORECASE)
>     search = compile.sub(value, content)
>
> where the content is a large text about 500,000 characters and the
> word list is about 5,000
>
> Any optimization for the code above?

Sure.

for key, value in words_list.items():
    pass
compile = re.compile(r"""\b%s\b""" % key, re.IGNORECASE)
search = compile.sub(value, content)




More information about the Python-list mailing list