Abstract
Artificial immune systems (AIS) use the concepts and algorithms inspired by the theory of how the human immune system works. This document presents the design and initial evaluation of a new artificial immune system for collaborative spam filtering.
Collaborative spam filtering allows for the detection of not-previously-seen spam content, by exploiting its bulkiness. Our system uses two novel and possibly advantageous techniques for collaborative spam filtering. The first novelty is local processing of the signatures created from the emails prior to deciding whether and which of the generated signatures will be exchanged with other collaborating antispam systems. This processing exploits both the email-content profiles of the users and implicit or explicit feedback from the users, and it uses customized AIS algorithms. The idea is to enable only good quality and effective information to be exchanged among collaborating antispam systems. The second novelty is the representation of the email content, based on a sampling of text strings of a predefined length and at random positions within the emails, and a use of a custom similarity hashing of these strings. Compared to the existing signature generation methods, the proposed sampling and hashing are aimed at achieving a better resistance to spam obfuscation (especially text additions) — which means better detection of spam, and a better precision in learning spam patterns and distinguishing them well from normal text — which means lowering the false detection of good emails.
Initial evaluation of the system shows that it achieves promising detection results under modest collaboration, and that it is rather resistant under the tested obfuscation. In order to confirm our understanding of why the system performed well under this initial evaluation, an additional factorial analysis should be done. Also, evaluation under more sophisticated spammer models is necessary for a more complete assessment of the system abilities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cotten W (2001), Preventing delivery of unwanted bulk e-mail, US patent 6,330,590
Damiani E, et al. (2004), An open digest-based technique for spam detection. In Proc. of the 2004 International Workshop on Security in Parallel and Distributed Systems, San Francisco, CA USA
DCC project web page (Jan 2007), http://www.rhyolite.com/anti-spam/dcc/
Graham P (2002), A plan for spam, http://www.paulgraham.com/spam.html
Nilsimsa project web page (Sep 2006), http://lexx.shinn.net/cmeclax/nilsimsa.html
Oda T, White T (2003), Developing an immunity to spam. In: Genetic and Evolutionary Computation Conference, Chicago(GECCO 2003), Proceedings, Part I. Volume 2723 of Lecture Notes in Computer Science, 231–241
Razor project web page (Sep 2006), http://razor.sourceforge.net/
Secker A, Freitas A, Timmis J (2003), AISEC: An Artificial Immune System for Email Classification. In Proceedings of the Congress on Evolutionary Computation, Canberra, IEEE, 131–139
SpamAssassin project web page (Sep 2006), http://spamassassin.apache.org/
SpamAssassin email corpus (Sep 2006), http://spamassassin.apache.org/publiccorpus/
Zhou F, et al. (2003), Approximate object location and spam filtering on peer-to-peer systems. In Proc ACM/IFIP/Usenix Int’l Middleware Conf., LNCS 2672, pp. 1–20
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Sarafijanovic, S., Le Boudec, JY. (2008). Artificial Immune System for Collaborative Spam Filtering. In: Krasnogor, N., Nicosia, G., Pavone, M., Pelta, D. (eds) Nature Inspired Cooperative Strategies for Optimization (NICSO 2007). Studies in Computational Intelligence, vol 129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78987-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-78987-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78986-4
Online ISBN: 978-3-540-78987-1
eBook Packages: EngineeringEngineering (R0)