Abstract
This paper proposes a novel solution to spam detection inspired by a model of the adaptive immune system known as the cross-regulation model. We report on the testing of a preliminary algorithm on six e-mail corpora. We also compare our results statically and dynamically with those obtained by the Naive Bayes classifier and another binary classification method we developed previously for biomedical text-mining applications. We show that the cross-regulation model is competitive against those and thus promising as a bio-inspired algorithm for spam detection in particular, and binary classification in general.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abi-Haidar, A., Kaur, J., Maguitman, A., Radivojac, P., Retchsteiner, A., Verspoor, K., Wang, Z., Rocha, L.: Uncovering protein-protein interactions in abstracts and text using linear models and word proximity networks. Genome Biology (in press, 2008)
Androutsopoulos, I., Koutsias, J., Chandrinos, K., Spyropoulos, C.: An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages. ACM Press, New York (2000b)
Bezerra, G., Barra, T.: An Immunological Filter for Spam. In: Bersini, H., Carneiro, J. (eds.) ICARIS 2006. LNCS, vol. 4163, pp. 446–458. Springer, Heidelberg (2006)
Boykin, P., Roychowdhury, V.: Leveraging social networks to fight spam. Computer 38(4), 61–68 (2005)
Carneiro, J., Leon, K., Caramalho, Í., van den Dool, C., Gardner, R., Oliveira, V., Bergman, M., Sepúlveda, N., Paixão, T., Faro, J., et al.: When three is not a crowd: a Crossregulation Model of the dynamics and repertoire selection of regulatory CD4 T cells. Immunological Reviews 216(1), 48–68 (2007)
Chirita, P., Diederich, J., Nejdl, W.: MailRank: using ranking for spam detection. In: Proceedings of the 14th ACM international conference on Information and knowledge management, pp. 373–380 (2005)
Delany, S.J., Cunningham, P., Smyth, B.: Ecue: A spam filter that uses machine leaming to track concept drift. In: Brewka, G., Coradeschi, S., Perini, A., Traverso, P. (eds.) ECAI 2006, 17th European Conference on Artificial Intelligence, PAIS 2006, Proceedings, pp. 627–631. IOS Press, Amsterdam (2006a)
Delany, S.J., Cunningham, P., Tsymbal, A., Coyle, L.: A case-based technique for tracking concept drift in spam filtering. Knowledge-Based Systems 18(4-5), 187–195 (2005)
Fdez-Riverola, F., Iglesias, E., Díaz, F., Méndez, J., Corchado, J.: SpamHunting: An instance-based reasoning system for spam labelling and filtering. Decision Support Systems 43(3), 722–736 (2007)
Feldman, R., Sanger, J.: The Text Mining Handbook: advanced approaches in analyzing unstructured data. Cambridge University Press, Cambridge (2006)
Hofmeyr, S.: An Interpretative Introduction to the Immune System. Design Principles for the Immune System and Other Distributed Autonomous Systems (2001)
Jensen, F., Jensen, F., Jensen, F.: Introduction to Bayesian Networks. Springer, New York (1996)
Kolcz, A., Alspector, J.: SVM-based filtering of e-mail spam with content-specific misclassification costs. In: Proceedings of the TextDM, pp. 1–14 (2001)
Méndez, J., Fdez-Riverola, F., Iglesias, E., Díaz, F., Corchado, J.: Tracking Concept Drift at Feature Selection Stage in SpamHunting: an Anti-Spam Instance-Based Reasoning System. In: Roth-Berghofer, T.R., Göker, M.H., Güvenir, H.A. (eds.) ECCBR 2006. LNCS (LNAI), vol. 4106, pp. 504–518. Springer, Heidelberg (2006)
Metsis, V., Androutsopoulos, I., Paliouras, G.: Spam Filtering with Naive Bayes–Which Naive Bayes? In: Third Conference on Email and Anti-Spam (CEAS), pp. 125–134 (2006)
Meyer, T.A., Whateley, B.: SpamBayes: Effective open-source, Bayesian based, email classification system. In: Proceedings of the First Conference on Email and Anti-Spam (CEAS) (2004), http://ceas.cc/papers-2004/136.pdf
Oda, T.: A Spam-Detecting Artificial Immune System. Masters thesis, Carleton University (2005)
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: Learning for Text Categorization: Papers from the 1998 Workshop, pp. 55–62 (1998)
Tsymbal, A.: The problem of concept drift: definitions and related work. Informe técnico: TCD-CS-2004-15, Departament of Computer Science Trinity College, Dublin, 4(15) (2004)
Yue, X., Abraham, A., Chi, Z., Hao, Y., Mo, H.: Artificial immune system inspired behavior-based anti-spam filter. Soft Computing-A Fusion of Foundations, Methodologies and Applications 11(8), 729–740 (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abi-Haidar, A., Rocha, L.M. (2008). Adaptive Spam Detection Inspired by a Cross-Regulation Model of Immune Dynamics: A Study of Concept Drift. In: Bentley, P.J., Lee, D., Jung, S. (eds) Artificial Immune Systems. ICARIS 2008. Lecture Notes in Computer Science, vol 5132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85072-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-85072-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85071-7
Online ISBN: 978-3-540-85072-4
eBook Packages: Computer ScienceComputer Science (R0)