[Spambayes] Feedback on sb_dbexpimp.py

Christophe Deprez christophe.deprez at bri.nrc.ca
Thu May 11 15:04:09 CEST 2006


Hi there,

Thanks for spambayes. We've been using our own server-side adaptation of 
spambayes and the amount of spam received in the Institute has 
drastically dropped... for free!

Recently, I've been using the sb_dbexpimp.py utility to merge databases
and I am questionning the "feature" introduced in recent versions, which 
tries different western encodings in the uunquote function:
    for encoding in ("utf-8", "cp1252", "iso-8859-1"):
My point is that when non-ASCII characters are converted in an encoding 
different from utf-8, it is irreversible!
So when exporting and re-importing a database, I don't end up with the 
initial database, as I first expected.
Is this intentional? What is the effect on the filtering efficiency of 
the new filter?

Cheers

-- 
Christophe Deprez                         christophe.deprez at bri.nrc.ca
----------------------------------------------------------------------
Institut de Recherche en Biotechnologies / Biotech. Research Institute
6100 Royalmount, Montréal (QC) H4P 2R2, Canada     Tel: (514) 496-6164 




More information about the SpamBayes mailing list