Regular Expression

Pippo adm2303.2304 at gmail.com
Sun Apr 12 19:57:00 EDT 2015


On Sunday, 12 April 2015 19:47:09 UTC-4, Pippo  wrote:
> On Sunday, 12 April 2015 19:44:05 UTC-4, Pippo  wrote:
> > On Sunday, 12 April 2015 19:28:44 UTC-4, MRAB  wrote:
> > > On 2015-04-12 23:49, Pippo wrote:
> > > > I have a text as follows:
> > > >
> > > > "#D{#C[Health] #P[Information] -
> > > > means any information, including #ST[genetic information],
> > > > whether #C[oral | (recorded in (any form | medium))], that
> > > > (1)#C[Is created or received by] a
> > > > #A[health care provider | health plan | public health authority | employer | life insurer | school | university | or health care clearinghouse];
> > > > (2)#C[Relates to] #C[the past, present, or future physical | mental health | condition of an individual] |
> > > > #C[the provision of health care to an individual] |
> > > > #C[the past, present, or future payment for the provision of health care to an individual].}"
> > > >
> > > > I want to get all elements that start with #C and are []  and put it in an array. For example #C[Health], I try with regex but it doesn't work:
> > > >
> > > "... it doesn't work"? In what way doesn't it work?
> > > 
> > > > import re
> > > > import tkinter.filedialog
> > > > import readfile
> > > >
> > > >
> > > >
> > > > j = 0
> > > >
> > > > text = [ ]
> > > >
> > > >
> > > > content = readfile.pattread()
> > > >
> > > > while j < len(content):
> > > >
> > > There's a syntax error here:
> > > 
> > > >      constraint = re.compile(r'(#C\[\w*\]'))
> > > >      result = constraint.search(content[j],re.MULTILINE)
> > > >      text.append(result)
> > > >      print(text)
> > > >      j = j+1
> > > >
> > 
> > result is empty! Although it should have a content.
> > 
> > What is the syntax error?
> 
> I fixed the syntax error but the result shows:
> 
> >>> 
> [None]
> [None, None]
> [None, None, None]
> [None, None, None, None]
> [None, None, None, None, None]
> [None, None, None, None, None, None]
> [None, None, None, None, None, None, None]
> [None, None, None, None, None, None, None, None]
> >>> 
> 
> 
> No error but if I don't call the content I posted up and call this as a content: #content = "#C[Health] #P[Information]"
> 
> result gives me #C[Health]

what is the best way to separate the elements in the [] from the text?



More information about the Python-list mailing list