Skip to main content

Using K-means Clustering Algorithm with Python Programming for Predicting Breast Cancer

  • Conference paper
  • First Online:
Smart Technologies in Data Science and Communication

Abstract

In this paper, we identify the mutated signal transduction pathways in a breast cancerous cell. A simulated model is developed for these pathways. Some of the pathways like PKB (protein kinase B), MAPK (mitogen-activated protein kinase), MTOR (mammalian target of rapamycin), Fas ligand (Type-II transmembrane protein), Notch (single-pass transmembrane receptor), SHH (Sonic Hedgehog), Tnf (tumor necrosis factor), Wnt (wingless/integrated) pathways are simulated. For computational modeling of signal transduction pathways, SBML (Systems Biology Markup Language) is used. Programming is done in SBML and executed in Cell Designer. These simulated models are in the form of XML files. We extracted the information in the XML files into tables, and we applied information processing techniques to it like information cleaning, information integration, information transformation, information reduction and information discretization. K-means clustering algorithm is applied on the extracted data set. Python code is written to implement K-means clustering algorithm. Two clusters are formed after running the code on the data set, one representing benign tumors and the other representing malignant tumors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. K.D. Miller, R.L. Siegel, C.C. Lin et al., Cancer treatment and survivorship statistics. CA Cancer J. Clin. 66, 271–289 (2016)

    Article  Google Scholar 

  2. T.T. Kwan, A digital RNA signature of circulating tumor cells predicting early therapeutic response in localized and metastatic breast cancer. AACR J (2018). https://doi.org/10.1158/2159-8290.cd-18-0432

  3. B. Yuan, S. Schafferer, Q. Tong, A plasma metabolite panel as biomarkers for early primary breast cancer detection. Int. J. Cancer 13 (2018)

    Google Scholar 

  4. W.-B. Yin, M.-G. Yan et al., Circulating circular RNA hsa_circ_0001785 acts as a diagnostic biomarker for breast cancer detection. Clinica Chimica Acta 487, 363–368 (2018)

    Google Scholar 

  5. A. Cruz-Roa, H. Gilmore, High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via Convolutional neural networks: application to invasive breast cancer detection. Research Gate, 24 May (2018)

    Google Scholar 

  6. J. Xu, L. Xiang, Stacked sparse autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE Trans. Med. Imag. 35(1), 119–130 (2016)

    Google Scholar 

  7. P.J. van Diest, B. van Ginneken, Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Med. Image Anal. 318(22), 2199–2210 (2017)

    Google Scholar 

  8. M. Veta, P.J. van Diest, Assessment of algorithms for mitosis detection in breast cancer histopathology images. Med. Image Anal. 20(1), 237–248 (2015)

    Google Scholar 

  9. J.L. Gonzalez-Hernandez, et al., Technology, application and potential of dynamic breast thermography for the detection of breast cancer. Int. J. Heat Mass Transf. 131, 558–573 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Prasanna Priya Golagani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Golagani, P.P., Beebi, S.K., Mahalakshmi, T.S. (2020). Using K-means Clustering Algorithm with Python Programming for Predicting Breast Cancer. In: Fiaidhi, J., Bhattacharyya, D., Rao, N. (eds) Smart Technologies in Data Science and Communication. Lecture Notes in Networks and Systems, vol 105. Springer, Singapore. https://doi.org/10.1007/978-981-15-2407-3_21

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-2407-3_21

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-2406-6

  • Online ISBN: 978-981-15-2407-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics