Regex Generator From Multiple Files

Jeremy.Chen you680 at gmail.com
Tue Jan 6 08:17:01 EST 2009


On Jan 6, 8:48 am, MRAB <goo... at mrabarnett.plus.com> wrote:
> James Pruitt wrote:
> > I am looking for a way given a number of files, say 3, that represent
> > technical support tickets in the same format to generate regular
> > expressions for the different fields automatically.
>
> > An example from of one line from each file:
> > Date: 12/30/2008 Room: 457 Building: Main
> > Date: 12/31/2008 Room: A21 Building: Annex
> > Date: 1/4/2009 Room: L69 Building: Library
>
> > The program would then, possibly using the python diff library, generate
> > the regular expression needed to parse out different fields. In this
> > case it might return a tuple like
> > ("^Date:[\w]+(.*)[\w]+Room","Room:[\w]+(.*)[\w]+Building","Building:[\w]+(.­*)[\w]+$")
> > that would match each of the fields based on the common data and sort of
> > assume that what doesn't change between them is data we are looking for.
>
> Why not just assume that each field consists of a word terminated by a
> colon, then some text, then the next field or the end of the line?- Hide quoted text -
>
> - Show quoted text -

do you mean the sub method?
-------------
re.sub( r'(?i)(example)',self.captureRegxp,content )



More information about the Python-list mailing list