Abstract
Nowadays, cancer constitutes the second leading cause of death globally. The application of an efficient classification model is considered essential in modern diagnostic medicine in order to assist experts and physicians to make more accurate and early predictions and reduce the rate of mortality. Machine learning techniques are being broadly utilized for the development of intelligent computational systems, exploiting the recent advances in digital technologies and the significant storage capabilities of electronic media. Ensemble learning algorithms and semi-supervised algorithms have been independently developed to build efficient and robust classification models from different perspectives. The former attempts to achieve strong generalization by using multiple learners, while the latter attempts to achieve strong generalization by exploiting unlabeled data. In this work, we propose an improved semi-supervised self-labeled algorithm for cancer prediction, based on ensemble methodologies. Our preliminary numerical experiments illustrate the efficacy and efficiency of the proposed algorithm, proving that reliable and robust prediction models could be developed by the adaptation of ensemble techniques in the semi-supervised learning framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdel-Zaher AM, Eldeib AM (2016) Breast cancer classification using deep belief networks. Expert Syst Appl 46:139–144
Aha D (1997) Lazy learning. Kluwer academic publishers, Dordrecht
Albertina B, Watson M, Holback C, Jarosz R, Kirk S, Lee Y, Lemmerman J (2016) Radiology data from the cancer genome atlas lung adenocarcinoma [TCGA-LUAD] collection. The Cancer Imaging Archive
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: 11th annual conference on computational learning theory. ACM, USA, pp 92–100
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Ma tt D, Pringle M (2013) The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26(6):1045–1057
DeSantis C, Ma J, Bryan L, Jemal A (2014) Breast cancer statistics, 2013. CA Cancer J Clin 64(1):52–62
Dollinger M, Rosenbaum E (2002) Everyone’s guide to cancer therapy: how cancer is diagnosed, treated, and managed day to day. Andrews McMeel Publishing, Kansas City
Domingos P, Pazzani M (1997) On the optimality of the simple Bayesian classier under zero-one loss. Mach Learn 29:103–130
Edwards A, Elwyn G (2009) Shared decision-making in health care: achieving evidence-based patient choice. Oxford University Press
Engelen A, Vanderhaegen J, Van Poppel H, Van Audenhove C (2016) `patients views on using decision support tools: a systematic review. Eur J Pers Cent Healthc 4(1):61–186
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639):115
Fatima M, Pasha M (2017) Survey of machine learning algorithms for disease diagnostic. J Intell Learn Syst Appl 9(01):1
Hady M, Schwenker F (2010) Combining committee-based semi-supervised learning and active learning. J Comput Sci Technol 25(4):681–698
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten I (2009) The WEKA data mining software: an update. SIGKDD Explorations Newsletters 11:10–18
Hussain L, Ahmed A, Saeed S, Rathore S, Awan IA, Shah SA, Majid A, Idris A, Awan A (2018) Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies. J Cancer Biomark 21(2):393–413
Li M, Zhou Z (2007) Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE Trans Syst Man Cybern Part A Syst Hum 37(6):1088–1098
Livieris I (2019) A new ensemble self-labeled semi-supervised algorithm. Informatica 43(2):1–14, (to be appear)
Livieris I, Drakopoulou K, Tampakas V, Mikropoulos T, Pintelas P (2018a) An ensemble-based semi-supervised approach for predicting students’ performance. In: Research on e-learning and ICT in education. Elsevier, Cham
Livieris I, Kanavos A, Tampakas V, Pintelas P (2018b) An ensemble SSL algorithm for efficient chest X-ray image classification. J Imaging 4(7)
Morvan C, Jenkins WJ (2017) Judgment under uncertainty: heuristics and biases. Macat Library
PatrÃcio M, Pereira J, Crisostomo J, Matafome P, Gomes M, Seica R, Caramelo F (2018) Using resistin, glucose, age and BMI to predict the presence of breast cancer. BMC Cancer 18(1):29
Platt J (1998) Advances in Kernel methods – support vector learning. MIT Press, Cambridge
Revesz D, Engelhardt E, Tamminga J, Schramel FM, Onwuteaka-Philipsen BD, van de Garde E, Steyerberg EW, Jansma E, De Vet HC, Coupe VM (2017) `decision support systems for incurable non-small cell lung cancer: a systematic review. BMC Med Inform Decis Mak 17(1):144
Rokach L (2010) Pattern classification using ensemble methods. World Scientific Publishing Company
Sasieni PD, Parkin DM (2018) Global perspectives surrounding cancer prevention and screening. In: Cancer prevention and screening: concepts, principles and controversies, pp 1–18
Turgut S, Dagtekin M, Ensari T (2018) Microarray breast cancer data classification using machine learning methods. In: 2018 Electric electronics, computer science, biomedical engineerings’ meeting (EBBT). IEEE, Turkey, pp 1–3
Vidic I, Egnell L, Jerome NP, Teruel JR, Sj bakk TE, stlie A, Fj sne HE, Bathen TF, Goa PE (2018) Support vector machine for breast cancer classification using diffusion-weighted mri histogram features: preliminary study. J Magn Reson Imaging 47(5):1205–1216
Wu X, Kumar V, Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan G, Ng A, Liu B, Yu P, Zhou Z, Steinbach M, Hand D, Steinberg D (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37
Yarowsky D (1995) Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd annual meeting of the association for computational linguistics, pp 189–196
Zhou Y, Goldman S (2004) Democratic co-learning. In: 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, Boca Raton, FL, USA pp 594–602
Zhou Z (2011) When semi-supervised learning meets ensemble learning, vol 6. Springer, pp 6–16
Zhou Z, Li M (2005) Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans Knowl Data Eng 17(11):1529–1541
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Livieris, I., Pintelas, E., Kanavos, A., Pintelas, P. (2020). An Improved Self-Labeled Algorithm for Cancer Prediction. In: Vlamos, P. (eds) GeNeDis 2018. Advances in Experimental Medicine and Biology, vol 1194. Springer, Cham. https://doi.org/10.1007/978-3-030-32622-7_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-32622-7_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32621-0
Online ISBN: 978-3-030-32622-7
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)