Skip to main content

Feature Selection Using a Machine Learning to Classify a Malware

  • Chapter
  • First Online:
Handbook of Computer Networks and Cyber Security

Abstract

Generally, malware has come to be known as one of the biggest threats, so malware is a program which operates malicious actions and steals information, to specifically identify it as software which is designed specifically to through breaking the system of a computer without consent from the owner. This chapter aimed to study feature selection and malware classification using machine learning. The identification of such features was done through the intuition that various parts of the PE files’ features can correlate with one another less than with the class files, being clean or dirty. Such features are implemented as algorithms in machine learning to help classify the malware, resulting in such classification to be properly implemented in antivirus programs to help enhance the rate of detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Khammas, B. M., Monemi, A., Bassi, J. S., Ismail, I., Nor, S. M., & Marsono, M. N. (2015). Feature selection and machine learning classification for malware detection. Jurnal Teknologi, 77. https://doi.org/10.11113/jt.v77.3558.

  2. Gavriluţ, D., Cimpoeşu, M., Anton, D., & Ciortuz, L. (2009). Malware detection using machine learning. In International Multiconference on Computer Science and Information Technology, 2009. IMCSIT’09 (pp. 735–741). IEEE.

    Google Scholar 

  3. Chumachenko, K. (2017). Machine learning methods for malware detection and classification. XAMK University of Applied Science.

    Google Scholar 

  4. Liu, L., Wang, B.-S., Yu, B., & Zhong, Q.-X. (2017). Automatic malware classification and new malware detection using machine learning. Frontiers of Information Technology & Electronic Engineering, 18, 1336–1347.

    Article  Google Scholar 

  5. Raman, K. (2012). Selecting features to classify malware. In InfoSec Southwest 2012.

    Google Scholar 

  6. Vinod, P., Laxmi, V., & Gaur, M. S. (2012). Reform: Relevant features for malware analysis. In 26th International Conference on Advanced Information Networking and Applications Workshops (WAINA), 2012 (pp. 738–744). IEEE.

    Google Scholar 

  7. Ismail, I., & Elektrik, F. K. (2013). Naive Bayes classification with domain knowledge for new malware variants and stateless packet level detection. Skudai, Malaysia: Universiti Teknologi Malaysia.

    Google Scholar 

  8. Moskovitch, R., Stopel, D., Feher, C., Nissim, N., Elovici, Y. (2008). Unknown malcode detection via text categorization and the imbalance problem. In IEEE International Conference on Intelligence and Security Informatics, 2008. ISI 2008 (pp. 156–161). IEEE.

    Google Scholar 

  9. Moskovitch, R., Stopel, D., Feher, C., Nissim, N., Japkowicz, N., & Elovici, Y. (2009). Unknown malcode detection and the imbalance problem. Journal in Computer Virology, 5, 295.

    Article  Google Scholar 

  10. Reddy, D. K. S., & Pujari, A. K. (2006). N-gram analysis for computer virus detection. Journal in Computer Virology, 2, 231–239.

    Article  Google Scholar 

  11. Liangboonprakong, C., & Sornil, O. (2013). Classification of malware families based on n-grams sequential pattern features. In 8th IEEE Conference on Industrial Electronics and Applications (ICIEA), 2013 (pp. 777–782). IEEE.

    Google Scholar 

  12. Lin, C.-T., Wang, N.-J., Xiao, H., & Eckert, C. (2015). Feature selection and extraction for malware classification. Journal of Information Science and Engineering, 31, 965–992.

    Google Scholar 

  13. Singhal, P., & Raul, N. (2012). Malware detection module using machine learning algorithms to assist in centralized security in enterprise networks. International Journal of Network Security & Its Applications, 4, 61.

    Article  Google Scholar 

  14. Baldangombo, U., Jambaljav, N., Horng, S.-J. (2013). A static malware detection system using data mining methods. arXiv preprint arXiv:13082831.

    Google Scholar 

  15. Alazab, M., Venkatraman, S., Watters, P., & Alazab, M. (2011). Zero-day malware detection based on supervised learning algorithms of API call signatures. In Proceedings of the Ninth Australasian Data Mining Conference (Vol. 121, pp. 171–182). Australian Computer Society, Inc..

    Google Scholar 

  16. Siddiqui, M., Wang, M. C., & Lee, J. (2008). Detecting Trojans using data mining techniques. In International Multi Topic Conference (pp. 400–411). Springer.

    Google Scholar 

  17. Schultz, M. G., Eskin, E., Zadok, F., & Stolfo, S. J. (2001). Data mining methods for detection of new malicious executables. In 2001 IEEE Symposium on Security and Privacy, 2001. S&P 2001. Proceedings (pp. 38–49). IEEE.

    Google Scholar 

  18. Shafiq, M. Z., Tabish, S. M., Mirza, F., & Farooq, M. (2009). PE-Miner: Mining structural information to detect malicious executables in realtime. In Recent advances in intrusion detection (pp. 121–141). Berlin: Springer.

    Chapter  Google Scholar 

  19. Yan, G., Brown, N., & Kong, D. (2013). Exploring discriminatory features for automated malware classification. In International Conference on Detection of Intrusions and Malware, and Vulnerability Assessment (pp. 41–61). Springer.

    Google Scholar 

  20. Alkasassbeh, M. (2017). An empirical evaluation for the intrusion detection features based on machine learning and feature selection methods. Journal of Theoretical and Applied Information Technology, 22, 95.

    Google Scholar 

  21. Almseidin, M., Alzubi, M., Kovacs, S., & Alkasassbeh, M. (2017). Evaluation of machine learning algorithms for intrusion detection system. In 2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY), 14-16 September 2017 (pp. 000277–000282). https://doi.org/10.1109/SISY.2017.8080566.

    Chapter  Google Scholar 

  22. Alauthaman, M., Aslam, N., Zhang, L., Alasem, R., & Hossain, M. (2018). A P2P Botnet detection scheme based on decision tree and adaptive multilayer neural networks. Neural Computing and Applications, 29, 991–1004.

    Article  Google Scholar 

  23. Altaher, A., ALmomani, A., Anbar, M., & Ramadass, S. (2012). Malware detection based on evolving clustering method for classification. Scientific Research and Essays, 7, 2031–2036.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Alauthman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Al-Kasassbeh, M., Mohammed, S., Alauthman, M., Almomani, A. (2020). Feature Selection Using a Machine Learning to Classify a Malware. In: Gupta, B., Perez, G., Agrawal, D., Gupta, D. (eds) Handbook of Computer Networks and Cyber Security. Springer, Cham. https://doi.org/10.1007/978-3-030-22277-2_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-22277-2_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-22276-5

  • Online ISBN: 978-3-030-22277-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics