Prediction of Two Year Survival Among Patients of Non-small Cell Lung Cancer

  • Yash DagliEmail author
  • Saumya Choksi
  • Sudipta Roy
Conference paper
Part of the Lecture Notes in Computational Vision and Biomechanics book series (LNCVB, volume 31)


Lung cancer ranks as second most prevalent type of cancer. Still predictions for survival of lung cancer patients are not accurate. In this research, we try to create a prediction model, with the help of machine learning to accurately predict the survival of non-small cell lung cancer patients (NSCLC). Clinical data of 559 patients was taken for training and testing of models. We have developed multilevel perceptron model for survival prediction. Other models developed during this study were compared to measure performance of our model. Attributes that are found to be useful as biomarkers for prediction of survival analysis of NSCLC have also been computed and ranked accordingly for increase in accuracy of prediction model by implementing feature selection method. The final model included T stage, N stage, Modality, World Health Organization Performance status, Cumulative Total Tumor dose, tumor load, Overall treatment time as the variables. Two year survival was chosen as the prediction outcome. Neural Network was found as the best prediction model with area under Curve (AUC) of 0.75. By far to our knowledge Multilevel Neural Network is found to be the best model for predicting two-year survival among patients of non-small cell lung cancer.


Multilevel neural network Non-small cell lung cancer Machine learning Feature selection ReliefF Survival prediction 


  1. 1.
    Siegel RL, Miller KD, Jemal A (2018) Cancer statistics. CA Cancer J Clin 68(1):7–30Google Scholar
  2. 2.
    Clément-Duchêne C, Carnin C, Guillemin F, Martinet Y (2010) How accurate are physicians in the prediction of patient survival in advanced lung cancer? Oncol Express 15(7):782–789CrossRefGoogle Scholar
  3. 3.
    Jochems Arthur et al (2017) Developing and Validating a Survival Prediction Model for NSCLC Patients Through Distributed Learning Across 3 Countries. Int J Radiat Oncol *Biol* Phys 99(2):344–352CrossRefGoogle Scholar
  4. 4.
    Mei X (2017) Predicting five-year overall survival in patients with non-small cell lung cancer by reliefF algorithm and random forests. In: 2017 IEEE 2nd advanced information technology, electronic and automation control conference (IAEAC)Google Scholar
  5. 5.
  6. 6.
    Devi S et al (2015) Study of data cleaning and comparision of data cleaning tools. Int J Comput Sci Mob Comput 4(3):360–370Google Scholar
  7. 7.
    Li X, Shi Y, Li J, Zhang P (2007) Data mining consulting improve data quality. Data Sci J 6Google Scholar
  8. 8.
    Wang Z et al (2016) Application of ReliefF algorithm to selecting feature sets for classification of high resolution remote sensing image. In: 2016 IEEE international geoscience and remote sensing symposium (IGARSS)Google Scholar
  9. 9.
    Ladha L (2011) Feature selection methods and algorithms. Int J Comput Sci Eng 3Google Scholar
  10. 10.
    Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2013) A review of feature selection methods on synthetic data. Knowl Inf Syst 34CrossRefGoogle Scholar
  11. 11.
    Lin G et al (2012) A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J Chromatogr B Anal. Technol Biomed Life SciGoogle Scholar
  12. 12.
    Yuan LCJ, GX, HCH (2012) Recent advances of large-scale linear classification. In: Proceedings of the IEEEGoogle Scholar
  13. 13.
    Dehing-Oberije C et al (2009) Development and external validation of prognostic model for 2-year survival of non-small-cell lung cancer patients treated with chemotherapy. Int J Radiat Oncol Biol Phys 74:355–362CrossRefGoogle Scholar
  14. 14.
    Jayasurya K et al (2010) Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy. Med Phys 37:1401–1407CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringU.V. Patel College of Engineering, Ganpat UniversityKherva, MehsanaIndia
  2. 2.Department of Computer Science and EngineeringInstitute of Computer Technology, Ganpat UniversityKherva, MehsanaIndia
  3. 3.Department of Computer Science and EngineeringCalcutta University Technology CampusSalt Lake City, KolkataIndia

Personalised recommendations