[Spambayes-checkins] spambayes tokenizer.py,1.41,1.42
Neil Schemenauer
nascheme@users.sourceforge.net
Sat, 28 Sep 2002 11:48:54 -0700
Update of /cvsroot/spambayes/spambayes
In directory usw-pr-cvs1:/tmp/cvs-serv8826
Modified Files:
tokenizer.py
Log Message:
Remove mine_message_ids option since it shouldn't hurt to always have it
enabled.
Index: tokenizer.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/tokenizer.py,v
retrieving revision 1.41
retrieving revision 1.42
diff -C2 -d -r1.41 -r1.42
*** tokenizer.py 27 Sep 2002 04:06:12 -0000 1.41
--- tokenizer.py 28 Sep 2002 18:48:52 -0000 1.42
***************
*** 984,996 ****
yield 'received:' + tok
! if options.mine_message_ids:
! msgid = msg.get("message-id", "")
! m = message_id_re.match(msgid)
! if not m:
! # might be weird instead of invalid but who cares?
! yield 'message-id:invalid'
! else:
! # looks okay, return the hostname only
! yield 'message-id:@%s' % m.group(1)
# As suggested by Anthony Baxter, merely counting the number of
--- 984,997 ----
yield 'received:' + tok
! # Message-Id: This seems to be a small win and should no
! # adversely affect a mixed source corpus so it's always enabled.
! msgid = msg.get("message-id", "")
! m = message_id_re.match(msgid)
! if m:
! # looks okay, return the hostname
! yield 'message-id:@%s' % m.group(1)
! else:
! # might be weird instead of invalid but who cares?
! yield 'message-id:invalid'
# As suggested by Anthony Baxter, merely counting the number of