Neural Recognition and Genetic Features Selection for Robust Detection of E-Mail Spam

Gavrilis, Dimitris; Tsoulos, Ioannis G.; Dermatas, Evangelos

doi:10.1007/11752912_54

Dimitris Gavrilis²²,
Ioannis G. Tsoulos²³ &
Evangelos Dermatas²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3955))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

1774 Accesses
11 Citations

Abstract

In this paper a method for feature selection and classification of email spam messages is presented. The selection of features is performed in two steps: The selection is performed by measuring their entropy and a fine-tuning selection is implemented using a genetic algorithm. In the classification process, a Radial Basis Function Network is used to ensure robust classification rate even in case of complex cluster structure. The proposed method shows that, when using a two-level feature selection, a better accuracy is achieved than using one-stage selection. Also, the use of a lemmatizer or a stop-word list gives minimal classification improvement. The proposed method achieves 96-97% average accuracy when using only 20 features out of 15000.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Spam detection through feature selection using artificial neural network and sine–cosine algorithm

Article 30 April 2020

Grasshopper Optimization Algorithm Based Spam Detection System Using Multi-Objective Wrapper Feature Selection and Neural Network Classification

Email Spam Detection Using Machine Learning and Feature Optimization Method

References

Sakkis, G., Androutsopoulos, I., Paliouras, G., Karkaletsis, V., Spyropoulos, C.D., Stamatopoulos, P.: A memory-based approach to anti-spam filtering for mailing lists. Information Retrieval 6, 49–73 (2003)
Article Google Scholar
Androutsopoulos, I., Koutsias, J., Chandrinos, K.V., Paliouras, G., Spyropoulos, C.D.: An Evaluation of Naive Bayesian Anti-Spam Filtering. In: Proc. of the workshop on Machine Learning in the New Information Age (2000)
Google Scholar
Lee, D.L., Chuang, H., Seamons, K.: Document Ranking and the Vector-Space Model. IEEE Software 14, 67–75 (1997)
Article Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A Comparison of Document Clustering Techniques. In: KDD Workshop on Text Mining (2000)
Google Scholar
Michelakis, E., Androutsopoulos, I., Paliouras, G., Sakkis, G., Stamatopoulos, P.: Filtron: A Learning-Based Anti-Spam Filter. In: Proc. of the 1st Conference on Email and Anti-Spam (2004)
Google Scholar
Gavrilis, D., Tsoulos, I., Dermatas, E.: Stochastic Classification of Scientific Abstracts. In: Proceedings of the 6th Speech and Computer Conference, Patra (2005)
Google Scholar
Pierre, J.M.: On the Automated Classification of Web Sites. Linkoping Electronic Articles in Computer and Information Science 6 (2001)
Google Scholar
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 46, 391–407 (1990)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Electrical & Computer Engineering, University of Patras, Greece
Dimitris Gavrilis & Evangelos Dermatas
Computer Science Department, University of Ioannina, Greece
Ioannis G. Tsoulos

Authors

Dimitris Gavrilis
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis G. Tsoulos
View author publications
You can also search for this author in PubMed Google Scholar
Evangelos Dermatas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department of University of Crete, Greece
Grigoris Antoniou
Institute of Computer Science, Foundation for Research & Technology – Hellas (FORTH), Vassilika Vouton, P.O. Box 1385, 71110, Heraklion, Greece
George Potamias
Institute of Informatics and Telecommunications, NCSR "Demokritos", 15310 A., Paraskevi Attikis, Greece
Costas Spyropoulos
Institute of Computer Science, FO.R.T.H., Vassilika Vouton, P.O. Box 1385, GR 71110, Heraklion, Greece
Dimitris Plexousakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gavrilis, D., Tsoulos, I.G., Dermatas, E. (2006). Neural Recognition and Genetic Features Selection for Robust Detection of E-Mail Spam. In: Antoniou, G., Potamias, G., Spyropoulos, C., Plexousakis, D. (eds) Advances in Artificial Intelligence. SETN 2006. Lecture Notes in Computer Science(), vol 3955. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752912_54

Download citation

DOI: https://doi.org/10.1007/11752912_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34117-8
Online ISBN: 978-3-540-34118-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Neural Recognition and Genetic Features Selection for Robust Detection of E-Mail Spam

Abstract

Access this chapter

Preview

Similar content being viewed by others

Spam detection through feature selection using artificial neural network and sine–cosine algorithm

Grasshopper Optimization Algorithm Based Spam Detection System Using Multi-Objective Wrapper Feature Selection and Neural Network Classification

Email Spam Detection Using Machine Learning and Feature Optimization Method

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Neural Recognition and Genetic Features Selection for Robust Detection of E-Mail Spam

Abstract

Access this chapter

Preview

Similar content being viewed by others

Spam detection through feature selection using artificial neural network and sine–cosine algorithm

Grasshopper Optimization Algorithm Based Spam Detection System Using Multi-Objective Wrapper Feature Selection and Neural Network Classification

Email Spam Detection Using Machine Learning and Feature Optimization Method

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation