Abstract
Spam mails are unsolicited bulk mails which are meant to fulfill some malicious purpose of the sender. They may cause economical, emotional and time losses to the recipients. Hence there is a need to understand their characteristics and distinguish them from normal in box mails. Decision tree classifier has been trained with the major characteristics of spam mails and results obtained with more then 86.7437% accuracy. This classifier can be a valuable strategy for software developers who are trying to combat this ever growing problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Witten, I.H., Frank, E.: Data Mining – Practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (2005)
Langley, P., Sage, S.: Elements of machine learning. Morgan Kaufmann, San Fracisco (1994)
Weka Data Mining Java Software, http://www.cs.waikato.ac.nz/~ml/weka/
Mike Spykerman – CEO Red Earth Software,Typical spam characteristics How to effectively block spam and junk mail
Langley, P., Sage, S.: Elements of Machine Learning. Morgan Kaufmann, San Fracisco (1994)
Han, J., Kamber, M.: Data Mining:Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)
Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian network: The combination of knowledge and statistical data. Machine Learning 20(3), 197–243 (1995)
Pant, B., Pant, K., Pardasani, K.R.: Decision tree cassifier for classiification of plant and animal micro RNA’s. Communications in Computer and Information Science 51, 443–451 (2009), doi:10.1007/978-3-642-04962-0_51
Pant, B., Pant, K., Pardasani, K.R.: Machine Learning Model for Domain Based Classification of MMP’s. The Internet Journal of Genomics and Proteomics 5(2) (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pandey, H., Pant, B., Pant, K. (2011). A Model for Detection, Classification and Identification of Spam Mails Using Decision Tree Algorithm. In: Das, V.V., Thomas, G., Lumban Gaol, F. (eds) Information Technology and Mobile Communication. AIM 2011. Communications in Computer and Information Science, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20573-6_93
Download citation
DOI: https://doi.org/10.1007/978-3-642-20573-6_93
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20572-9
Online ISBN: 978-3-642-20573-6
eBook Packages: Computer ScienceComputer Science (R0)