Binary Particle Swarm Optimization Based Feature Selection (BPSO-FS) for Improving Breast Cancer Prediction

  • Arnab Kumar MishraEmail author
  • Pinki Roy
  • Sivaji Bandyopadhyay
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1164)


Breast cancer is currently one of the leading causes of cancer-related deaths among women around the world. Although the severity of the disease is undeniable, an efficient early diagnosis of the disease can lead to a much higher chance of survival for the patients. Effective clinical decision support systems (CDSS) could potentially be of very high utility for medical practitioners, in this regard. In this paper, a binary Particle Swarm Optimization (BPSO) based feature selection approach is presented, which can be used to improve the performance of automatic breast cancer prediction CDSS. The key idea is to formulate the problem of feature selection in terms of a discrete optimization problem, with appropriate data-driven objective function. The average cost of bad feature selection is considered as the objective function to minimize, in this work. Multiple evaluation metrics like average prediction accuracy, sensitivity, specificity and area under the curve (AUC) of the receiver operating characteristics (ROC) curve have been considered in this work, to evaluate the performance of the prediction system. Average prediction accuracies of 80.83% and 98.24% have been observed respectively for the Breast Cancer Coimbra Dataset (BCCD) and Breast Cancer Wisconsin Diagnostic Dataset (BCWDD) after feature selection is performed. When pre-feature selection and post feature selection performances are compared, overall improvements of 4–8% and 1–4% have been observed across all the evaluation metrics for BCCD and BCWDD respectively, suggesting the potential applicability of the proposed approach in real diagnostic settings.


Breast cancer Feature selection Machine learning Clinical decision support system Binary particle swarm optimization 



The authors would like to acknowledge TEQIP-III, NIT Silchar for their support.


  1. 1.
    WHO breast cancer statistics, World Health Organization breast cancer diagnosis and screening page (2019). Last accessed 21 Oct 2019
  2. 2.
    U.T.O.E.D., of group, BC, First results on mortality reduction in the UK trial of early detection of breast cancer. Lancet 332(8608), 411–416 (1988)Google Scholar
  3. 3.
    K. Kourou, T.P. Exarchos, K.P. Exarchos, M.V. Karamouzis, D.I. Fotiadis, Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 13, 8–17 (2015). Scholar
  4. 4.
    D. Delen, G. Walker, A. Kadam, Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005). Scholar
  5. 5.
    P.J. Lisboa, A.F. Taktak, The use of artificial neural networks in decision support in cancer: a systematic review. Neural Netw. 19(4), 408–415 (2006). Scholar
  6. 6.
    D. West, P. Mangiameli, R. Rampal, V. West, Ensemble strategies for a medical diagnostic decision support system: a breast cancer diagnosis application. Eur. J. Oper. Res. 162(2), 532–551 (2005). Scholar
  7. 7.
    A.M. Abdel-Zaher, A.M. Eldeib, Breast cancer classification using deep belief networks. Expert Syst. Appl. 46, 139–144 (2016). Scholar
  8. 8.
    G. Litjens, C.I. Snchez, N. Timofeeva, M. Hermsen, I. Nagtegaal, I. Kovacs, J. Van Der Laak, Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci. Rep. 6, 26286 (2016). Scholar
  9. 9.
    J. Wang, X. Yang, H. Cai, W. Tan, C. Jin, L. Li, Discrimination of breast cancer with microcalcifications on mammography by deep learning. Sci. Rep. 6, 27327 (2016).
  10. 10.
    J. Arevalo, F.A. Gonzlez, R. Ramos-Polln, J.L. Oliveira, M.A.G. Lopez, Representation learning for mammography mass lesion classification with convolutional neural networks. Comput. Methods Programs Biomed. 127, 248–257 (2016). Scholar
  11. 11.
    A.E. Hassanien, T.H. Kim, Breast cancer MRI diagnosis approach using support vector machine and pulse coupled neural networks. J. Appl. Logic 10(4), 277–284 (2012). Scholar
  12. 12.
    U.R. Acharya, E.Y.K. Ng, J.H. Tan, S.V. Sree, Thermography based breast cancer detection using texture features and support vector machine. J. Med. Syst. 36(3), 1503–1510 (2012). Scholar
  13. 13.
    H.L. Chen, B. Yang, G. Wang, S.J. Wang, J. Liu, D.Y. Liu, Support vector machine based diagnostic system for breast cancer using swarm intelligence. J. Med. Syst. 36(4), 2505–2519 (2012). Scholar
  14. 14.
    R. Ramos-Polln, M.A. Guevara-Lpez, C. Surez-Ortega, G. Daz-Herrero, J.M. Franco-Valiente, M. Rubio-Del-Solar, I. Ramos, Discovering mammography-based machine learning classifiers for breast cancer diagnosis. J. Med. Syst. 36(4), 2259–2269 (2012). Scholar
  15. 15.
    P. Jiang, J. Peng, G. Zhang, E. Cheng, V. Megalooikonomou, H. Ling, Learning-based automatic breast tumor detection and segmentation in ultrasound images, in ISBI, pp. 1587–1590 (2012, May).
  16. 16.
    C. Nguyen, Y. Wang, H.N. Nguyen, Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic. J. Biomed. Sci. Eng. 6(05), 551 (2013). Scholar
  17. 17.
    J. Dheeba, N.A. Singh, S.T. Selvi, Computer-aided detection of breast cancer on mammograms: a swarm intelligence optimized wavelet neural network approach. J. Biomed. Inform. 49, 45–52 (2014). Scholar
  18. 18.
    A.T. Azar, S.A. El-Said, Performance analysis of support vector machines classifiers in breast cancer mammography recognition. Neural Comput. Appl. 24(5), 1163–1177 (2014). Scholar
  19. 19.
    A. Addeh, H. Demirel, P. Zarbakhsh, Early detection of breast cancer using optimized ANFIS and features selection, in 2017 9th International Conference on Computational Intelligence and Communication Networks (CICN), IEEE, pp. 39–42 (2017, Sept).
  20. 20.
    C. Baneriee, S. Paul, M. Ghoshal, A comparative study of different ensemble learning techniques using wisconsin breast cancer data set, in 2017 International Conference on Computer, Electrical Communication Engineering (ICCECE), IEEE, pp. 1–6 (2017, Dec).
  21. 21.
    E. Alikovi, A. Subasi, Breast cancer diagnosis using GA feature selection and rotation forest. Neural Comput. Appl. 28(4), 753–763 (2017). Scholar
  22. 22.
    M. Nilashi, O. Ibrahim, H. Ahmadi, L. Shahmoradi, A knowledge-based system for breast cancer classification using fuzzy logic method. Telematics Inform. 34(4), 133–144 (2017). Scholar
  23. 23.
    M. Patrcio, J. Pereira, J. Crisstomo, P. Matafome, M. Gomes, R. Seia, F. Caramelo, Using Resistin, glucose, age and BMI to predict the presence of breast cancer. BMC Cancer 18(1), 29 (2018). Scholar
  24. 24.
    Y. Li, Z. Chen, Performance evaluation of machine learning methods for breast cancer prediction. Appl. Comput. Math. 7(4), 212–216 (2018)CrossRefGoogle Scholar
  25. 25.
    J. Kennedy, R.C. Eberhart, A discrete binary version of the particle swarm algorithm, in 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation, vol. 5 (IEEE, pp. 4104–4108) (1997, Oct)Google Scholar

Copyright information

© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2021

Authors and Affiliations

  • Arnab Kumar Mishra
    • 1
    Email author
  • Pinki Roy
    • 1
  • Sivaji Bandyopadhyay
    • 1
  1. 1.CSE DepartmentNational Institute of TechnologySilcharIndia

Personalised recommendations