Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 476))

Included in the following conference series:

Abstract

In this work a 5-year survival prediction model was developed for colon cancer using machine learning methods. The model was based on the SEER dataset which, after preprocessing, consisted of 38,592 records of colon cancer patients. Survival prediction models for colon cancer are not widely and easily available. Results showed that the performance of the model using fewer features is close to that of the model using a larger set of features recommended by an expert physician, which indicates that the first may be a good compromise between usability and performance. The purpose of such a model is to be used in Ambient Assisted Living applications, providing decision support to health care professionals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Al-Bahrani, R., Agrawal, A., Choudhary, A.: Colon cancer survival prediction using ensemble data mining on seer data. In: 2013 IEEE International Conference on Big Data, pp. 9–16 (2013)

    Google Scholar 

  2. Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7), 1145–1159 (1997)

    Article  Google Scholar 

  3. Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)

    MathSciNet  MATH  Google Scholar 

  4. Bush, D.M., Michaelson, J.S.: Derivation: Nodes + PrognosticFactors Equation for Colon Cancer. Tech. rep. (2009)

    Google Scholar 

  5. Chang, G.J., Hu, C.Y., et al.: Practical application of a calculator for conditional survival in colon cancer. Journal of Clinical Oncology 27(35), 5938–5943 (2009)

    Article  Google Scholar 

  6. Džeroski, S., Ženko, B.: Is combining classifiers with stacking better than selecting the best one? Machine Learning 54(3), 255–273 (2004)

    Article  MATH  Google Scholar 

  7. Ferlay, J., Soerjomataram, I., Ervik, M., et al.: Globocan 2012: Estimated cancer incidence, mortality and prevalence worldwide in 2012 (2012). http://globocan.iarc.fr (last visited on December 27, 2015)

  8. Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  9. Ganganwar, V.: An overview of classification algorithms for imbalanced datasets. Int. J. Emerg. Technol. Adv. Eng 2(4), 42–47 (2012)

    Google Scholar 

  10. Kittler, J.: Combining classifiers: A theoretical framework. Pattern Analysis and Applications 1(1), 18–27 (1998)

    Article  MathSciNet  Google Scholar 

  11. Klepac, G., Klepac, G., et al.: Developing Churn Models Using Data Mining Techniques and Social Network Analysis, 1st edn. IGI Global, Hershey (2014)

    Google Scholar 

  12. National Cancer Institute: Surveillance, epidemiology and end results program (2015). http://seer.cancer.gov/data/ (last visited on October 01, 2015)

  13. Oliveira, T., Leão, P., Novais, P., Neves, J.: Webifying the computerized execution of clinical practice guidelines. In: Bajo Perez, J., Corchado, J.M., et al. (eds.) The PAAMS Collection SE - 18, Advances in Intelligent Systems and Computing, vol. 293, pp. 149–156. Springer International Publishing (2014)

    Google Scholar 

  14. RapidMiner: Rapidminer documentation: Operator reference guide (2016). http://docs.rapidminer.com/studio/operators/ (last visited on March 01, 2016)

  15. Refaeilzadeh, P., Tang, L., Liu, H.: Cross-validation. In: Liu, L., Özsu, M. (eds.) Encyclopedia of Database Systems, pp. 532–538. Springer US (2009)

    Google Scholar 

  16. Yamauchi, M., Lochhead, P., Morikawa, T., et al.: Colorectal cancer: a tale of two sides or a continuum? Gut 61(6), 794–797 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tiago Oliveira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Silva, A., Oliveira, T., Novais, P., Neves, J., Leão, P. (2016). Developing an Individualized Survival Prediction Model for Colon Cancer. In: Lindgren, H., et al. Ambient Intelligence- Software and Applications – 7th International Symposium on Ambient Intelligence (ISAmI 2016). ISAmI 2016. Advances in Intelligent Systems and Computing, vol 476. Springer, Cham. https://doi.org/10.1007/978-3-319-40114-0_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-40114-0_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-40113-3

  • Online ISBN: 978-3-319-40114-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics