Spam emails are causing major resource wastage by unnecessarily flooding the network links. Though many antispam solutions have been implemented, the Bayesian spam score approach looks quite promising. A proposal for spam detection algorithm is presented and its implementation using Java is discussed, along with its performance test results on two independent spam corpuses - Ling-spam and Enron-spam. We use the Bayesian calculation for single keyword sets and multiple keywords sets, along with its keyword contexts to improve the spam detection and thus to get good accuracy.
|Title of host publication
|2009 International Conference on Computer Engineering and Technology
|Number of pages
|Published - 2 Feb 2009