Skip to main content

An Improved Self-Labeled Algorithm for Cancer Prediction

  • Conference paper
  • First Online:
GeNeDis 2018

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 1194))

Abstract

Nowadays, cancer constitutes the second leading cause of death globally. The application of an efficient classification model is considered essential in modern diagnostic medicine in order to assist experts and physicians to make more accurate and early predictions and reduce the rate of mortality. Machine learning techniques are being broadly utilized for the development of intelligent computational systems, exploiting the recent advances in digital technologies and the significant storage capabilities of electronic media. Ensemble learning algorithms and semi-supervised algorithms have been independently developed to build efficient and robust classification models from different perspectives. The former attempts to achieve strong generalization by using multiple learners, while the latter attempts to achieve strong generalization by exploiting unlabeled data. In this work, we propose an improved semi-supervised self-labeled algorithm for cancer prediction, based on ensemble methodologies. Our preliminary numerical experiments illustrate the efficacy and efficiency of the proposed algorithm, proving that reliable and robust prediction models could be developed by the adaptation of ensemble techniques in the semi-supervised learning framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Abdel-Zaher AM, Eldeib AM (2016) Breast cancer classification using deep belief networks. Expert Syst Appl 46:139–144

    Article  Google Scholar 

  • Aha D (1997) Lazy learning. Kluwer academic publishers, Dordrecht

    Book  Google Scholar 

  • Albertina B, Watson M, Holback C, Jarosz R, Kirk S, Lee Y, Lemmerman J (2016) Radiology data from the cancer genome atlas lung adenocarcinoma [TCGA-LUAD] collection. The Cancer Imaging Archive

    Google Scholar 

  • Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: 11th annual conference on computational learning theory. ACM, USA, pp 92–100

    Google Scholar 

  • Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Ma tt D, Pringle M (2013) The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26(6):1045–1057

    Article  Google Scholar 

  • DeSantis C, Ma J, Bryan L, Jemal A (2014) Breast cancer statistics, 2013. CA Cancer J Clin 64(1):52–62

    Article  Google Scholar 

  • Dollinger M, Rosenbaum E (2002) Everyone’s guide to cancer therapy: how cancer is diagnosed, treated, and managed day to day. Andrews McMeel Publishing, Kansas City

    Google Scholar 

  • Domingos P, Pazzani M (1997) On the optimality of the simple Bayesian classier under zero-one loss. Mach Learn 29:103–130

    Article  Google Scholar 

  • Edwards A, Elwyn G (2009) Shared decision-making in health care: achieving evidence-based patient choice. Oxford University Press

    Google Scholar 

  • Engelen A, Vanderhaegen J, Van Poppel H, Van Audenhove C (2016) `patients views on using decision support tools: a systematic review. Eur J Pers Cent Healthc 4(1):61–186

    Google Scholar 

  • Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639):115

    Article  CAS  Google Scholar 

  • Fatima M, Pasha M (2017) Survey of machine learning algorithms for disease diagnostic. J Intell Learn Syst Appl 9(01):1

    Google Scholar 

  • Hady M, Schwenker F (2010) Combining committee-based semi-supervised learning and active learning. J Comput Sci Technol 25(4):681–698

    Article  Google Scholar 

  • Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten I (2009) The WEKA data mining software: an update. SIGKDD Explorations Newsletters 11:10–18

    Article  Google Scholar 

  • Hussain L, Ahmed A, Saeed S, Rathore S, Awan IA, Shah SA, Majid A, Idris A, Awan A (2018) Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies. J Cancer Biomark 21(2):393–413

    Google Scholar 

  • Li M, Zhou Z (2007) Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE Trans Syst Man Cybern Part A Syst Hum 37(6):1088–1098

    Article  CAS  Google Scholar 

  • Livieris I (2019) A new ensemble self-labeled semi-supervised algorithm. Informatica 43(2):1–14, (to be appear)

    Google Scholar 

  • Livieris I, Drakopoulou K, Tampakas V, Mikropoulos T, Pintelas P (2018a) An ensemble-based semi-supervised approach for predicting students’ performance. In: Research on e-learning and ICT in education. Elsevier, Cham

    Google Scholar 

  • Livieris I, Kanavos A, Tampakas V, Pintelas P (2018b) An ensemble SSL algorithm for efficient chest X-ray image classification. J Imaging 4(7)

    Google Scholar 

  • Morvan C, Jenkins WJ (2017) Judgment under uncertainty: heuristics and biases. Macat Library

    Google Scholar 

  • Patrício M, Pereira J, Crisostomo J, Matafome P, Gomes M, Seica R, Caramelo F (2018) Using resistin, glucose, age and BMI to predict the presence of breast cancer. BMC Cancer 18(1):29

    Article  Google Scholar 

  • Platt J (1998) Advances in Kernel methods – support vector learning. MIT Press, Cambridge

    Google Scholar 

  • Revesz D, Engelhardt E, Tamminga J, Schramel FM, Onwuteaka-Philipsen BD, van de Garde E, Steyerberg EW, Jansma E, De Vet HC, Coupe VM (2017) `decision support systems for incurable non-small cell lung cancer: a systematic review. BMC Med Inform Decis Mak 17(1):144

    Article  CAS  Google Scholar 

  • Rokach L (2010) Pattern classification using ensemble methods. World Scientific Publishing Company

    Google Scholar 

  • Sasieni PD, Parkin DM (2018) Global perspectives surrounding cancer prevention and screening. In: Cancer prevention and screening: concepts, principles and controversies, pp 1–18

    Google Scholar 

  • Turgut S, Dagtekin M, Ensari T (2018) Microarray breast cancer data classification using machine learning methods. In: 2018 Electric electronics, computer science, biomedical engineerings’ meeting (EBBT). IEEE, Turkey, pp 1–3

    Google Scholar 

  • Vidic I, Egnell L, Jerome NP, Teruel JR, Sj bakk TE, stlie A, Fj sne HE, Bathen TF, Goa PE (2018) Support vector machine for breast cancer classification using diffusion-weighted mri histogram features: preliminary study. J Magn Reson Imaging 47(5):1205–1216

    Article  Google Scholar 

  • Wu X, Kumar V, Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan G, Ng A, Liu B, Yu P, Zhou Z, Steinbach M, Hand D, Steinberg D (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37

    Article  Google Scholar 

  • Yarowsky D (1995) Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd annual meeting of the association for computational linguistics, pp 189–196

    Chapter  Google Scholar 

  • Zhou Y, Goldman S (2004) Democratic co-learning. In: 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, Boca Raton, FL, USA pp 594–602

    Google Scholar 

  • Zhou Z (2011) When semi-supervised learning meets ensemble learning, vol 6. Springer, pp 6–16

    Google Scholar 

  • Zhou Z, Li M (2005) Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans Knowl Data Eng 17(11):1529–1541

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Ioannis Livieris or Andreas Kanavos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Livieris, I., Pintelas, E., Kanavos, A., Pintelas, P. (2020). An Improved Self-Labeled Algorithm for Cancer Prediction. In: Vlamos, P. (eds) GeNeDis 2018. Advances in Experimental Medicine and Biology, vol 1194. Springer, Cham. https://doi.org/10.1007/978-3-030-32622-7_31

Download citation

Publish with us

Policies and ethics