[Spambayes-checkins] spambayes tokenizer.py,1.59,1.60
Sjoerd Mullender
sjoerd@users.sourceforge.net
Fri Nov 1 16:10:18 2002
Update of /cvsroot/spambayes/spambayes
In directory usw-pr-cvs1:/tmp/cvs-serv13555
Modified Files:
tokenizer.py
Log Message:
Switch " and ' in url_re character class and add # ' token the re to
resync python-mode.
Index: tokenizer.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/tokenizer.py,v
retrieving revision 1.59
retrieving revision 1.60
diff -C2 -d -r1.59 -r1.60
*** tokenizer.py 31 Oct 2002 15:43:55 -0000 1.59
--- tokenizer.py 1 Nov 2002 16:10:13 -0000 1.60
***************
*** 604,609 ****
# be in HTML, may or may not be in quotes, etc. If it's full of %
# escapes, cool -- that's a clue too.
! ([^\s<>'"\x7f-\xff]+) # capture the guts
! """, re.VERBOSE)
urlsep_re = re.compile(r"[;?:@&=+,$.]")
--- 604,609 ----
# be in HTML, may or may not be in quotes, etc. If it's full of %
# escapes, cool -- that's a clue too.
! ([^\s<>"'\x7f-\xff]+) # capture the guts
! """, re.VERBOSE) # '
urlsep_re = re.compile(r"[;?:@&=+,$.]")