Before you can use SpamSieve, you should give it some examples of messages you consider to be spam, and ones which you do not. You do this by selecting some messages and choosing a training command from the menu. SpamSieve collects information from the messages it’s trained with into its corpus, which it uses to predict whether subsequent messages are spam. Don’t worry; it learns quickly!
Go to SpamSieve’s Filter menu and choose Show Statistics. The Corpus section at the bottom of the Statistics window shows how many good and spam messages SpamSieve has been trained with, and what percentage of them are spam. After the initial training, SpamSieve will automatically train itself, and you’ll only need to train it to correct mistakes.
To do the initial training, use both spam and good messages subject to two requirements:
After the initial training, you don’t have to worry about the number or percentage of messages in the corpus. SpamSieve will automatically learn from new messages as they arrive and keep its corpus properly balanced.
Accuracy will improve with time, but if you’ve used at least 100 or so messages in the initial training, SpamSieve should immediately start moving some of the incoming spam messages to your spam folder. If you don’t see results right away, check the setup in your mail program. After a few hundred messages of each type are in the corpus, SpamSieve should be catching most of your spam.
Now you’re done setting up SpamSieve. The Correct All Mistakes section explains how you can keep SpamSieve’s accuracy high by telling it if it puts any messages in the wrong mailbox.