spamassassin-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From spamassassin-...@incubator.apache.org
Subject [SpamAssassin Wiki] Updated: SiteWideBayesFeedback
Date Wed, 07 Jul 2004 08:30:08 GMT
   Date: 2004-07-07T01:30:07
   Editor: DanKohn <dan@dankohn.com>
   Wiki: SpamAssassin Wiki
   Page: SiteWideBayesFeedback
   URL: http://wiki.apache.org/spamassassin/SiteWideBayesFeedback

   Improved wording, added links

Change Log:

------------------------------------------------------------------------------
@@ -2,17 +2,13 @@
 
 If you want to set up site-wide use of Bayesian classification, you should set up a way for
your users to send in misclassified mail to be "learned" from.
 
-A good way is to set up a mailbox where users can send verified spam, or verified non-spam,
for the learner to learn from.
+If you create mailboxes for false positives and false negatives, you can then run a cron
job intermittently to learn all the mails in that mailbox as spam (or non-spam).  Details
on having your users redirect from mail clients to these mailboxes without mangling the headers
are at ResendingMailWithHeaders.
 
-One issue here is that you will need all the headers of those messages for the learner to
work effectively, including the Received headers. A few mail user agents strip off those headers.
For users using Outlook 2000, get them to use
+= Using Procmail with learning =
 
-{{{
-  (double-click to open the mail in its own window)
-      -> Actions
-      -> Resend This Message
-}}}
+For one approach, see ProcmailToForwardMail.
 
-and bounce it to the mailbox address. Then run a cron job intermittently to learn all the
mails in that mailbox as spam (or non-spam).  Details on redirecting from mail clients are
at ResendingMailWithHeaders.
+= Submitting multiple messages at once =
 
 For those who want an "easier" way, and that also works with Outlook Express AND Outlook...
(This also allows users to submit many mail pieces at once.) This maintains full headers and
bodies. (Or as best as I can tell - someone tell me if I'm wrong!)
 
@@ -22,14 +18,19 @@
 
 (I also setup two "drop boxes" for mail - say qqqspam and qqqnospam - as re-iterated below,
make them difficult to "guess" as you don't want spammers filling up your spam or ham drop.)
 
+= Using procmail to removed forwarding info =
 
 For MUAs (Like Netscape/Mozilla) that do a good job with keeping original headers intact,
(almost) all you need to do is forward the email to the feedback account and strip off the
header added by the forward. See BayesFeedbackViaForwarding for details.
 
+= IMAP folders =
+
 Another option, and one that's easier for most users to use, is to set up two public IMAP
folders on your IMAP server, one for MissedSpam, one for NotSpam.
 
 Then ask your users to move messages that SpamAssassin misses into the MissedSpam folder,
and move messages that SpamAssassin marked incorrectly as spam into the NotSpam folder.
 
 You can then run sa-learn from a cron job over those folders to update the Bayesian databases.

+
+Also see http://wiki.apache.org/spamassassin/RemoteImapFolder
 
 = How to set up site wide aliases on postfix where ham and spam can be sent for learning
with Postfix =
 

Mime
View raw message