Parsing Email 'References' header.

David wizzardx at gmail.com
Thu May 8 09:14:19 EDT 2008


On Thu, May 8, 2008 at 2:53 PM, Aspersieman <aspersieman at gmail.com> wrote:
> Hi
>
>  I have a python script that parses email headers to extract information
> from them. I need to get the _last_ messageid in the 'References' field
> (http://cr.yp.to/immhf/thread.html) to create a threaded view of these
> emails (these messageid's are stored in a database).
>
>  Now, I can easily access the 'References' field using the python 'email'
> module, but I need a regular expression to get the last messageid in the
> 'References' field.
>
>  Here's what I have so far:
>  <code>
>  rx_lastmesgid = re.compile(r"(<.+>$)")
>  lastmesgid = "".join( filter( rx_lastmesgid.match, parentid ) ) #
> parentid's value is
> eg:"<1 at mail.gmail.com><2 at mail.gmail.com><3 at mail.gmail.com><4 at mail.gmail.com><5 at mail.gmail.com>"
>  lastmesgid = "".join( filter( rx_lastmesgid.match, parentid ) )
>  </code>
>
>  I need it to return "<5 at mail.gmail.com>"
>
>  Can anyone help?
>

"<.*>(<.*?>)"



More information about the Python-list mailing list