Architecture of Adaptive Spam Filtering Based on Machine Learning Algorithms

Islam, Md Rafiqul; Zhou, Wanlei

doi:10.1007/978-3-540-72905-1_41

Md Rafiqul Islam¹ &
Wanlei Zhou¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4494))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

612 Accesses
6 Citations

Abstract

Spam is commonly defined as unsolicited email messages and the goal of spam filtering is to distinguish between spam and legitimate email messages. Much work has been done to filter spam from legitimate emails using machine learning algorithm and substantial performance has been achieved with some amount of false positive (FP) tradeoffs. In the case of spam detection FP problem is unacceptable sometimes. In this paper, an adaptive spam filtering model has been proposed based on Machine learning (ML) algorithms which will get better accuracy by reducing FP problems. This model consists of individual and combined filtering approach from existing well known ML algorithms. The proposed model considers both individual and collective output and analyzes them by an analyzer. A dynamic feature selection (DFS) technique also proposed in this paper for getting better accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Islam, R., Chowdhury, M., Zhou, W.: An Innovative Spam Filtering Model Based on Support Vector Machine. In: Proceedings of the IEEE International Conference on Intelligent Agents, Web Technologies and Internet Commerce, vol. 2, pp. 348–353 (2005)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Cohen, W., Singer, Y.: Context-sensitive learning methods for text categorization. ACM Transactions on Information Systems 17(2), 141–173 (1999)
Article Google Scholar
Kaitarai, H.: Filtering Junk e-mail: A performance comparison between genetic programming and naïve bayes. Tech. Report, Department of Electrical and Computer Engineering, University of Waterloo (November 1999)
Google Scholar
Androutsopoulos, I., et al.: Learning to filter spam e-mail: A comparison of a Naive Bayesian and a memory-based approach. In: Proceedings of the Workshop on Machine Learning and Textual Information Access, 4th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lyon, France, pp. 1–13 (2000)
Google Scholar
Zhang, J., et al.: A Modified logistic regression: An approximation to SVM and its applications in large-scale text categorization. In: Proceedings of the 20th International Conference on Machine Learning, pp. 888–895. AAAI Press, California (2003)
Google Scholar
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A bayesian approach to filtering junk e-mail. In Learning for Text Categorization. Papers from the Workshop, Madison, Wisconsin, AAAI Technical Report WS, pp. 98–105 (1998)
Google Scholar
Drucker, H., Shahrary, B., Gibbon, D.C.: Support vector machines: relevance feedback and information retrieval. Inform. Process. Manag. 38(3), 305–323 (2003)
Article Google Scholar
Huan, L., Lei, Y.: Toward Integrating Feature Selection Algorithms for Classification and Clustering. IEEE Transaction on Knowledge and Data Engg. 17(4), 491–502 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering and Information Technology, Deakin University, Melbourne, Australia
Md Rafiqul Islam & Wanlei Zhou

Authors

Md Rafiqul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Wanlei Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hai Jin Omer F. Rana Yi Pan Viktor K. Prasanna

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Islam, M.R., Zhou, W. (2007). Architecture of Adaptive Spam Filtering Based on Machine Learning Algorithms. In: Jin, H., Rana, O.F., Pan, Y., Prasanna, V.K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2007. Lecture Notes in Computer Science, vol 4494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72905-1_41

Download citation

DOI: https://doi.org/10.1007/978-3-540-72905-1_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72904-4
Online ISBN: 978-3-540-72905-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics