Predicting Postoperative Complications for Gastric Cancer Patients Using Data Mining
Gastric cancer refers to the development of malign cells that can grow in any part of the stomach. With the vast amount of data being collected daily in healthcare environments, it is possible to develop new algorithms which can support the decision-making processes in gastric cancer patients treatment. This paper aims to predict, using the CRISP-DM methodology, the outcome from the hospitalization of gastric cancer patients who have undergone surgery, as well as the occurrence of postoperative complications during surgery. The study showed that, on one hand, the RF and NB algorithms are the best in the detection of an outcome of hospitalization, taking into account patients’ clinical data. On the other hand, the algorithms J48, RF, and NB offer better results in predicting postoperative complications.
KeywordsData Mining Clinical Decision Support Systems CRISP-DM Gastric cancer WEKA
This work has been supported by Compete: POCI-01-0145-FEDER-007043 and FCT within the Project Scope UID/CEC/00319/2013.
- 1.Biglarian, A., Hajizadeh, E., Kazemnejad, A., Zali, M.R.: Application of artificial neural network in predicting the survival rate of gastric cancer patients. Iran. J. Public Health 40(2), 80–86 (2011)Google Scholar
- 7.Koh, H.C., Tan, G.: Data mining applications in healthcare. J. Healthc. Inf. Manag. 19(2), 64–72 (2011)Google Scholar
- 11.Bâra, A., Lungu, I.: Improving decision support systems with data mining techniques. In: Advances in Data Mining Knowledge Discovery and Applications. INTECH Open Access Publisher, pp. 397–418 (2012)Google Scholar
- 13.Beeler, P., Bates, D., Hug, B.: Clinical decision support systems. Swiss Med. Wkly 144, w14073 (2014)Google Scholar
- 14.Trowbridge, R., Weingarten, S.: Clinical decision support systems [Internet], Chap. 53. United States Department of Health & Human Services Agency for Healthcare Research and Quality (2001). https://archive.ahrq.gov/clinic/ptsafety/chap53.htm. Accessed 6 May 2018
- 17.Chen, C., Liaw, A., Breiman, L.: Using random forest to learn imbalanced data (2004)Google Scholar
- 19.Khoshgoftaar, T., Golawala, M., Hulse, J.: An empirical study of learning from imbalanced data using random forest. In: 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007) (2007)Google Scholar
- 21.Platt, J.: Sequential minimal optimization: a fast algorithm for training support vector machines (1998)Google Scholar
- 24.Rajput, A., Aharwal, R., Dubey, M., Saxena, S., Raghuvanshi, M.: J48 and JRIP rules for e-governance data. Int. J. Comput. Sci. Secur. (IJCSS) 5(2), 201–207 (2011)Google Scholar
- 25.Mohamed, W., Salleh, M., Omar, A.: A comparative study of reduced error pruning method in decision tree algorithms. In: 2012 IEEE International Conference on Control System, Computing and Engineering, pp. 392–397 (2012)Google Scholar