[Spambayes] Non-email use of the spambayes project

Fredrik Rodland spambayes at rodland.no
Fri Mar 28 07:55:00 EST 2003


> -----Original Message-----
> From: spambayes-bounces at python.org
> [mailto:spambayes-bounces at python.org]On Behalf Of Skip Montanaro
> Sent: 28. mars 2003 13:20
> To: Moore, Paul
> Cc: python-list at python.org; spambayes at python.org
> Subject: RE: [Spambayes] Non-email use of the spambayes project
>
> important I think is to document how to use it as I did.  So much
> of what is
> there now is so strongly tied to classifying email messages that it's easy
> to lose sight of how well it can be applied to other
> classifcation problems.


Totally agree!

also, for us who're not completely into Python, it would be great with some
sort of cookbook/skeletons/APIs available and documented.  I tried to read
your original code, but gave up after a while.  I have a similar situation,
having a database with 100.000 people in it, with quite a few rows not being
real persons.  It'd be gresat to try to use the spambayes code on this.

The concept should be fairly common so that one could write a script/program
in any language.

At least what I'm picturing is write a script wich loops over the dataset,
construct some kind of concatinated string, and passing this as argument to
one of three procedures/methods/scripts:

A. classify as spam
B. classify as ham
C. get_score



Fredrik


--
Fredrik Rodland	Technical Architect, Stocknet, Oslo, Norway
Stocknet:		http://www.stocknet.com		phone: +47 23 28 40 17
Private:		http://rodland.no			phone: +47 99 21 98 17






More information about the Python-list mailing list