[Spambayes] More HTML strippage.

Neil Schemenauer nas@python.ca
Sat, 28 Sep 2002 20:45:28 -0700


Tim Peters wrote:
> [Mark M. Hoffman]
> > A modest proposal:  tokenize the number of message recipients e.g.
> > header:num_recip_21 for the above.
> 
> Yes, we don't do anything with To, Cc, etc, now, except count the number of
> times (>= 1) each header line appears.

I think it would be better to generate:

    header:num_recip_1
    header:num_recip_2
    header:num_recip_3
    header:num_recip_4
    ...
    header:num_recip_21

We should probably do the same thing when counting headers.  I'll give
it a try.

  Neil