Skip to main content

A Model for Detection, Classification and Identification of Spam Mails Using Decision Tree Algorithm

  • Conference paper
Information Technology and Mobile Communication (AIM 2011)

Abstract

Spam mails are unsolicited bulk mails which are meant to fulfill some malicious purpose of the sender. They may cause economical, emotional and time losses to the recipients. Hence there is a need to understand their characteristics and distinguish them from normal in box mails. Decision tree classifier has been trained with the major characteristics of spam mails and results obtained with more then 86.7437% accuracy. This classifier can be a valuable strategy for software developers who are trying to combat this ever growing problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Witten, I.H., Frank, E.: Data Mining – Practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  2. Langley, P., Sage, S.: Elements of machine learning. Morgan Kaufmann, San Fracisco (1994)

    Google Scholar 

  3. Weka Data Mining Java Software, http://www.cs.waikato.ac.nz/~ml/weka/

  4. Mike Spykerman – CEO Red Earth Software,Typical spam characteristics How to effectively block spam and junk mail

    Google Scholar 

  5. Langley, P., Sage, S.: Elements of Machine Learning. Morgan Kaufmann, San Fracisco (1994)

    Google Scholar 

  6. Han, J., Kamber, M.: Data Mining:Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)

    MATH  Google Scholar 

  7. Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian network: The combination of knowledge and statistical data. Machine Learning 20(3), 197–243 (1995)

    MATH  Google Scholar 

  8. Pant, B., Pant, K., Pardasani, K.R.: Decision tree cassifier for classiification of plant and animal micro RNA’s. Communications in Computer and Information Science 51, 443–451 (2009), doi:10.1007/978-3-642-04962-0_51

    Article  Google Scholar 

  9. Pant, B., Pant, K., Pardasani, K.R.: Machine Learning Model for Domain Based Classification of MMP’s. The Internet Journal of Genomics and Proteomics 5(2) (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pandey, H., Pant, B., Pant, K. (2011). A Model for Detection, Classification and Identification of Spam Mails Using Decision Tree Algorithm. In: Das, V.V., Thomas, G., Lumban Gaol, F. (eds) Information Technology and Mobile Communication. AIM 2011. Communications in Computer and Information Science, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20573-6_93

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20573-6_93

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20572-9

  • Online ISBN: 978-3-642-20573-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics