Abstract
In today’s world, breast cancer is one of the most widespread causes of death in women. According to an estimation, approximately 40,920 women would die in 2018 just because of breast cancer, which is a highly alarming number. Such alarming numbers could be reduced if the cancer is diagnosed at an early stage. With the advent of technology, making such predictions has become an easier task. Machine learning is one of the latest trends, which enables to make predictions related to diseases based on physical or behavioral characteristics. In this paper, we use various machine learning algorithms like decision trees, k-nearest neighbor (KNN), logistic regression, neural networks (NNs), naïve Bayes, random forest, and support vector machine (SVM). The outcome is then compared based on the precision, recall, and F1 score. Furthermore, we identify the least important features in the dataset, implement all these algorithms again after removing those features, and then compare the outcomes for the two implementation stages in order to understand the importance of feature selection in breast cancer prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Tyrer, J., Duffy, S.W., Cuzick, J.: A breast cancer prediction model incorporating familial and personal risk factors. Stat. Med. 23(7), 1111–1130 (2004)
Shah, C., Jivani, A.G: Comparison of data mining classification algorithms for breast cancer prediction. In: 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT), IEEE (2013)
Setiono, R.: Generating concise and accurate classification rules for breast cancer diagnosis. Artif. Intell. Med. 18(3), 205–219 (2000)
Sarvestani, A., Soltani, et al.: Predicting breast cancer survivability using data mining techniques. In: 2010 2nd International Conference on Software technology and Engineering (ICSTE), vol. 2, IEEE (2010)
Polat, K., Güneş, S.: Breast cancer diagnosis using least square support vector machine. Digit. Signal Proc. 17(4), 694–701 (2007)
Kuo, W-J., et al.: Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images. Breast. Cancer. Res. Treat. 66(1), 51–57 (2001)
Karabatak, M., Ince, M.C.: An expert system for detection of breast cancer based on association rules and neural network. Expert. Syst. Appl. 36(2), 3465–3469 (2009)
Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)
Gupta, S., Kumar, D., Sharma, A.: Data mining classification techniques applied for breast cancer diagnosis and prognosis. Indian. J. Comput. Sci. Eng. (IJCSE). 2(2), 188–195 (2011)
Hassanien, A E., Ali, J.M.H.: Rough set approach for generation of classification rules of breast cancer data. Informatica. 15(1), 23–38 (2004)
Bellaachia, A., Guven, E.: Predicting breast cancer survivability using data mining techniques. Age. 58(13), 10–110 (2006)
Baker, J.A., et al.: Breast cancer: prediction with artificial neural network based on BI-RADS standardized lexicon. Radiology. 196(3), 817–822 (1995)
Kharya, S.: Using data mining techniques for diagnosis and prognosis of cancer disease. arXiv preprint arXiv:1205.1923 (2012)
Xiong, X., et al.: Analysis of breast cancer using data mining and statistical techniques. In: Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 2005 and First ACIS International Workshop on Self-Assembling Wireless Networks. SNPD/SAWN 2005, IEEE (2005)
Chaurasia, V., Pal, S.: Data mining techniques: to predict and resolve breast cancer survivabilitys (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Prateek (2019). Breast Cancer Prediction: Importance of Feature Selection. In: Bhatia, S., Tiwari, S., Mishra, K., Trivedi, M. (eds) Advances in Computer Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 924. Springer, Singapore. https://doi.org/10.1007/978-981-13-6861-5_62
Download citation
DOI: https://doi.org/10.1007/978-981-13-6861-5_62
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6860-8
Online ISBN: 978-981-13-6861-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)