Abstract
Breast cancer is one of the major cancers that is common to women all over the world. Though, the cancer is curable and can be prevented if it is detected in early stages. In medical science, lots of different strategies have been developed to detect and diagnose the cancer patients. Data mining techniques are no far behind and are widely used to extract information from large databases of the cancer patients to discover some patterns making decisions. Classification is one of the data mining techniques that can be used to classify the data in two stages i.e. benign or malignant. This paper presents the Naïve Bayes improved K-Nearest Neighbor method (NBKNN) for breast cancer prediction and compares the results with traditional classifiers like traditional K-nearest Neighbor and naïve Bayes. In the experiments, the standard dataset used is taken from UCI repository. Sensitivity and specificity have been used as accuracy measures for comparing the results. Experimental results show that proposed classifier is better than traditional classifiers.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ratanachaikanont, T.: Clinical breast examination and its relevance to diagnosis of palpable breast lesion. J. Med. Assoc. Thailand 88(4), 505–507 (2005)
Nover, A.B., Jagtap, S., Anjum, W., et al.: Modern breast cancer detection: a technological review. Int. J. Biomed. Imaging, 2009, 14 p. (2009). Article ID 902326, https://doi.org/10.1155/2009/902326
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Fransisco (2005)
Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)
Gupta, S., Kumar, D., Sharma, A.: Data mining classification techniques applied for breast cancer diagnosis and prognosis. Indian J. Comput. Sci. Eng. (IJCSE) 2(2), 188–195 (2011)
Kuo, W.-J., Chang, R.-F., Chen, D.-R., Lee, C.C.: Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images. Breast Cancer Res. Treat. 66(1), 51–57 (2001)
Chaurasia, V., Pal, S.: Data mining techniques: to predict and resolve breast cancer survivability. Int. J. Comput. Sci. Mob. Comput. 3(1), 10–22 (2014)
Venkatesan, E., Velmurugan, T.: Performance analysis of decision tree algorithms for breast cancer classification. Indian J. Sci. Technol. 8(29), 1–8 (2015)
Kharya, S., Agrawal, S., Soni, S.: Naive Bayes classifiers: a probabilistic detection model for breast cancer. Int. J. Comput. Appl. 92(10), 0975–8887 (2014)
Williams, K., Idowu, P.A., Balogun, J.A., Oluwaranti, A.I.: Breast cancer risk prediction using data mining classification techniques. Trans. Netw. Commun. 3(2), 01–11 (2015)
Tomar, D., Agarwal, S.: A survey on data mining approaches for healthcare. Int. J. Bio-Sci. Bio-Technol. 5(5), 241–266 (2013)
Ramadevi, G.N., Rani, K.U., Lavanya, D.: Importance of feature extraction for classification of breast cancer datasets—a study. Int. J. Sci. Innovative Math. Res. 3(2), 368–763 (2015)
Majid, A., Ali, S., Iqbal, M., Kausar, N.: Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines. Comput. Meth. Programs Biomed. 113(3), 792–808 (2014)
Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using three machine learning techniques for predicting breast cancer recurrence. J. Health Med. Inform. 4(124), 3 (2013)
Ramadevi, G.N., Rani, K.U., Lavanya, D.: Evaluation of classifiers performance using resampling on breast cancer data. Int. J. Sci. Eng. Res. 6(2) (2015)
Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: a review of classification techniques. Emerg. Artif. Intell. Appl. Comput. Eng. 160, 3–24 (2007)
Mangasarian, O.L., Wolberg, W.H.: Cancer diagnosis via linear programming. SIAM News 23(5), 1–18 (1990)
Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. In: Proceedings of the National Academy of Sciences, U.S.A., vol. 87, pp. 9193–9196, December 1990
Islam, Md.M., Iqbal, H., Haque, Md.R., Hasan, Md.K.: Prediction of breast cancer using support vector machine and K-nearest neighbors. In: 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 226–229. IEEE (2017)
Obaid, O.I., Mohammed, M.A., Ghani, M.K.A., Mostafa, S.A., Taha, F.: Evaluating the performance of machine learning techniques in the classification of Wisconsin Breast Cancer. Int. J. Eng. Technol. 7(4.36), 160–166 (2018)
Yue, W., Wang, Z., Chen, H., Payne, A., Liu, X.: Machine learning with applications in breast cancer diagnosis and prognosis. Designs 2, 13 (2018). https://doi.org/10.3390/designs2020013
Gupta, A., Kaushik, B.N.: Feature selection from biological database for breast cancer prediction and detection using machine learning classifier. J. Artif. Intell. 11, 55–64 (2018)
Chawla, S., Kumar, R., Aggarwal, E., Swain, S.: Breast cancer detection using K-nearest neighbor algorithm. Int. J. Comput. Intell. IoT 2(4) (2018)
Cherif, W.: Optimization of K-NN algorithm by clustering and reliability coefficients: application to breast-cancer diagnosis. Proc. Comput. Sci. 127, 293–299 (2018)
Sun, J., et al.: Predicting medical conditions using k-nearest neighbors. University of Nevada, Las Vegas (2017)
Alarabeyyat, A., Alhanahnah, M.: Breast cancer detection using k nearest neighbor machine learning algorithm. In: 2016 9th International Conference on Developments in eSystems Engineering (DeSE), pp. 35–39. IEEE (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Goyal, S., Maheshwar (2019). Naïve Bayes Model Based Improved K-Nearest Neighbor Classifier for Breast Cancer Prediction. In: Luhach, A., Jat, D., Hawari, K., Gao, XZ., Lingras, P. (eds) Advanced Informatics for Computing Research. ICAICR 2019. Communications in Computer and Information Science, vol 1075. Springer, Singapore. https://doi.org/10.1007/978-981-15-0108-1_1
Download citation
DOI: https://doi.org/10.1007/978-981-15-0108-1_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0107-4
Online ISBN: 978-981-15-0108-1
eBook Packages: Computer ScienceComputer Science (R0)