Skip to main content

Classifying Breast Cancer Based on Machine Learning

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1164))

Abstract

Breast cancer is the most prevalent cancer among Indian women and a prime cause of death due to cancer. Hence, an early detection and accurate diagnosis and staging of breast cancer are crucial in managing the disease. In this work, a comparative study of application of machine learning classifiers has been done for the classification of benign from malignant breast cancer. This paper investigates the performance of various supervised classification techniques like logistic regression, support vector machine, k-nearest neighbour and decision tree. These algorithms are coded in R and executed in R studio. For performance analysis, various parameters such as specificity, sensitivity and accuracy have been calculated and compared. The SVM classifier gives the accuracy of 99.82% indicating its suitability over other classification techniques. In this work, we have addressed the issue of distinguishing benign from malignant breast cancer.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. http://www.breastcancerindia.net/bc/statistics/stat_global.htm

  2. International Agency for Research on Cancer, World Health Organization GLOBOCAN 2012—Estimated Cancer Incidence, Mortality and Prevalence Worldwide in 2012. http://globocan.iarc.fr/Default.aspx. Accessed 1 Apr 2018

  3. V. Chaurasia, S. Pal, A novel approach for breast cancer detection using data mining techniques. Int. J. Innov. Res. Comput. Commun. Eng. 2(1) (2017)

    Google Scholar 

  4. S. Malvia, S.A. Bagadi, U.S. Dubey, S. Saxena, Epidemiology of breast cancer in Indian women. Asia Pac. J. Clin. Oncol. 13, 289–295 (2017)

    Google Scholar 

  5. World Health Organisation, The Global Burden of Disease (WHO, Geneva, 2009). 2004 Update

    Google Scholar 

  6. J. Cornfield, Joint dependence of the risk of coronary heart disease on serum cholesterol and systolic blood pressure: a discriminant function analysis. Proc. Fed. Am. Soc. Exp. Biol. 21, 58–61 (1962)

    Google Scholar 

  7. D. Cox, Some Procedures Associated with the Logistic Qualitative Response Curve (Wiley, New York, 1966)

    MATH  Google Scholar 

  8. N. Day, D. Kerridge, A general maximum likelihood discriminant. Biometrics 23, 313–323 (1967)

    Article  Google Scholar 

  9. D.A. Salazar, J.I. Vélez, J.C. Salazar, Comparison between SVM and logistic regression: which one is better to discriminate? Rev. Col. Estadstica, 35, 223–237 (2012)

    Google Scholar 

  10. D. Hosmer, S. Lemeshow, Applied Logistic Regression (Wiley, New York, 1989)

    MATH  Google Scholar 

  11. C. Cortes, V. Vapnik, Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  12. L.J. Cao, Support vector machines experts for time series forecasting. Neurocomputing (2002, in press)

    Google Scholar 

  13. C.J.C. Burges, A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Discov. 2(2), 1–47 (1998)

    Article  Google Scholar 

  14. E. Osuna, R. Freund, F. Girosi, Training support vector machines: an application to face detection, in Proceedings of Computer Vision and Pattern Recognition (1997), pp. 130–136

    Google Scholar 

  15. T. Anderson, An Introduction to Multivariate Statistical Analysis (Wiley, New York, 1984)

    MATH  Google Scholar 

  16. Breast Cancer Wisconsin (Original) Data Set (online). Available at: https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancerwisconsin/breast-cancer-wisconsin.data. Accessed 25 Aug 2017

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Archana Balyan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Balyan, A., Singh, Y., Shashank (2021). Classifying Breast Cancer Based on Machine Learning. In: Bansal, P., Tushir, M., Balas, V., Srivastava, R. (eds) Proceedings of International Conference on Artificial Intelligence and Applications. Advances in Intelligent Systems and Computing, vol 1164. Springer, Singapore. https://doi.org/10.1007/978-981-15-4992-2_4

Download citation

Publish with us

Policies and ethics