[Tutor] Using Regular Expression to extracting string in brackets on a list

Bod Soutar bodsda at googlemail.com
Mon Dec 30 07:39:38 CET 2013


Steven's answer is probably a lot more robust, but I would use a simple
split.

mylist = ['hypothetical protein BRAFLDRAFT_208408 [Branchiostoma
floridae]\n', 'hypoxia-inducible factor 1-alpha [Mus musculus]\n',
'hypoxia-inducible factor 1-alpha [Gallus gallus]\n' ]
for item in mylist:
    item.split("[")[1].split("]")[0]

-- Bodsda


On 30 December 2013 03:28, Jing Ai <jai633 at g.rwu.edu> wrote:

> Thank you all for the suggestions! I decided to use Steven's re loop in
> the end.
>
> Joel, what i meant earlier was that the link you sent seems to suggest me
> to replace some characters in the list and I'm not sure how it would work...
>
>
>
>
>
> On Sun, Dec 29, 2013 at 9:24 PM, Joel Goldstick <joel.goldstick at gmail.com>wrote:
>
>>
>>
>>
>> On Sun, Dec 29, 2013 at 9:00 PM, Jing Ai <jai633 at g.rwu.edu> wrote:
>>
>>> Thanks, but I don't think I can get out the string in the brackets by
>>> only replacing other items...(there's too many things to replace and may
>>> interfere with the items within the string).
>>>
>>>
>>>
>>
>> I am not sure what you mean by your previous sentence.  Check out
>> Steven's excellent answer.  Also, remember to reply to the list, or no one
>> will see your question.
>>
>> Good luck
>>
>>>
>>>
>>>
>>> On Sun, Dec 29, 2013 at 7:08 PM, Joel Goldstick <
>>> joel.goldstick at gmail.com> wrote:
>>>
>>>>
>>>>
>>>>
>>>> On Sun, Dec 29, 2013 at 4:02 PM, Jing Ai <jai633 at g.rwu.edu> wrote:
>>>>
>>>>> Hello,
>>>>>
>>>>> I am trying to rewrite some contents on a long list that contains
>>>>> words within brackets and outside brackets and I'm having trouble
>>>>> extracting the words within brackets, especially since I have to add the
>>>>> append function for list as well.  Does anyone have any suggestions? Thank
>>>>> you!
>>>>>
>>>>> *An example of list*:
>>>>>
>>>>> ['hypothetical protein BRAFLDRAFT_208408 [Branchiostoma floridae]\n',
>>>>> 'hypoxia-inducible factor 1-alpha [Mus musculus]\n', 'hypoxia-inducible
>>>>> factor 1-alpha [Gallus gallus]\n' ]
>>>>>
>>>>
>>>> Is the above line a python  list, or is it what you get when you read a
>>>> line of a data file.  The reason I ask, is if it is a list you can split
>>>> the list by looping of each list item.  Then just maybe try some of these
>>>> ideas:
>>>>
>>>>
>>>> http://stackoverflow.com/questions/10017147/python-replace-characters-in-string
>>>>
>>>>>  *What I'm trying to extract out of this*:
>>>>>
>>>>> ['Branchiostoma floridae', 'Mus musculus', 'Gallus gallus']
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Tutor maillist  -  Tutor at python.org
>>>>> To unsubscribe or change subscription options:
>>>>> https://mail.python.org/mailman/listinfo/tutor
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Joel Goldstick
>>>> http://joelgoldstick.com
>>>>
>>>
>>>
>>
>>
>> --
>> Joel Goldstick
>> http://joelgoldstick.com
>>
>
>
> _______________________________________________
> Tutor maillist  -  Tutor at python.org
> To unsubscribe or change subscription options:
> https://mail.python.org/mailman/listinfo/tutor
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/tutor/attachments/20131230/c09eaf1c/attachment-0001.html>


More information about the Tutor mailing list