Feature-maximum-dependency-based fusion diagnosis method for COPD

  • Youli Fang
  • Hong WangEmail author
  • Lutong Wang
  • Ruitong Di
  • Yongqiang Song


Chronic Obstructive Pulmonary Disease (COPD) is a chronic lung disease that causes a progressive decline in respiratory function. COPD has become the fourth most lethal disease in the world, and worldwide deaths continue to become more common as a result of COPD. Therefore, it is important to help doctors diagnose COPD more accurately using big data analytics and effective algorithms. In the past, COPD was mainly studied as follows: applying data to determine the impact of a single feature on the disease, such as the effect of FEV1/FVC (forced expiratory volume in the first second/forced vital capacity), and analyzing a case with simple models, such as logistic regression or a support vector machine. Therefore, there are obviously deficiencies in previous studies. First, the impacts of multi-dimensional features on COPD have not been considered comprehensively. Second, there is no fusion of multiple study methods on the diagnosis and prognosis of COPD. Thus, this paper presents a feature-maximum-dependency-based fusion diagnosis method for COPD. First, the MDF-RS (feature maximum dependency-rough set) algorithm is proposed to extract the optimal combination of multi-dimensional features. Second, the integrated model DSA-SVM (direct search simulated annealing-support vector machine) is presented to classify the disease. Finally, the proposed method is experimentally tested. The results show that the algorithms outperform other classic methods.


COPD Feature-maximum-dependency Multi-dimensional feature Integrated learning 



The work is partially supported by the National Natural Science Foundation of China (Nos. 61672329, 61373149, 61472233, 61572300, 81273704), Shandong Province Science and Technology Plan Supported Project (No. 2014GGX101026) and Taishan Scholar Fund of Shandong Province (No. TSHW201502038, 20110819). We also gratefully acknowledge the support of NVIDIA Corporation with the donation of the TITAN X GPU used for this research.


  1. 1.
    Ali MM, Törn A, Viitanen S (2002) A direct search variant of the simulated annealing algorithm for optimization involving continuous variables. Comput Oper Res 29(1):87–102MathSciNetCrossRefGoogle Scholar
  2. 2.
    Bleeker SE, Moll HA, Steyerberg EW (2013) External validation is necessary in prediction research: a clinical example. J Clin Epidemiol 56(9):826–832CrossRefGoogle Scholar
  3. 3.
    Celik MU, Sharma G, Tekalp AM (2006) Lossless watermarking for image authentication: a new framework and an implementation. IEEE Transactions on Image Processing 15(4):1042–1049CrossRefGoogle Scholar
  4. 4.
    Chen Y, Miao D, Wang R (2017) A rough set approach to feature selection based on ant colony optimization. Pattern Recogn Lett 31(3):226–233CrossRefGoogle Scholar
  5. 5.
    Cheung N (2017) Machine learning techniques for medical analysis. N Engl J Med 26(4):126–132Google Scholar
  6. 6.
    Cui JS, Liu Y, Xu YD (2013) Tracking generic human motion via fusion of low- and high-dimensional approaches. IEEE Trans Syst Man Cybern Syst Hum 43(4):996–1002CrossRefGoogle Scholar
  7. 7.
    Du ZL, Li XM, Xi LP (2015) Multi-class probabilistic extreme learning machine and its application in remaining useful life prediction. J Syst Eng Electron 37(12):2777–2784zbMATHGoogle Scholar
  8. 8.
    Gao C, Liu J, Feng Q (2016) People-flow counting in complex environments by combining depth and color information. Multimed Tools Appl 75(15):9315–9331CrossRefGoogle Scholar
  9. 9.
    Guo HM, Du J, Huang LF (2017) Application of ARIMA model based on R language in predicting incidence of patients with acute exacerbation of chronic obstructive pulmonary disease. Chinese Journal of Health Statistics 34(2):288–289Google Scholar
  10. 10.
    Himes BE, Dai Y, Kohane IS, Weiss ST, Ramoni MF (2009) Prediction of chronic obstructive pulmonary disease (copd) in asthma patients using electronic medical records. J Am Med Inform Assoc 16(3):371–379CrossRefGoogle Scholar
  11. 11.
    Hoogendoorn M, Feenstra TL, Boland M, Briggs AH, Borg S, Jansson SA (2017) Prediction models for exacerbations in different copd patient populations: comparing results of five large data sources. Int J Chron Obstruct Pulmon Dis 12(12):3183–3194CrossRefGoogle Scholar
  12. 12.
    Kaneiwa K (2011) A rough set approach to multiple dataset analysis. 11(3):204–215. Elsevier Science PublishersGoogle Scholar
  13. 13.
    Kaya Y, Uyar M (2013) A hybrid decision support system based on rough set and extreme learning machine for diagnosis of hepatitis disease. Appl Soft Comput 13(8):3429–3438CrossRefGoogle Scholar
  14. 14.
    Li CQ, Tao YX (2017) Application of support vector machine with simulated annealing algorithm in mbr membrane pollution prediction. In: IEEE International Conference on Software Engineering Research, Management and Applications 34(5):211–217Google Scholar
  15. 15.
    Liu Y, Liang Y, Liu S (2016) Predicting urban water quality with ubiquitous data. J Clin Epidemiol 13(2):112–124MathSciNetGoogle Scholar
  16. 16.
    Liu Y, Zheng Y, Liang S (2016) Urban water quality pre diction based on multi-task multi-view learning. Proceedings of the 25th International Joint Conference on Artificial Intelligence 3(4):251–265Google Scholar
  17. 17.
    Liu Y, Nie L, Rosenblum DS (2016) From action to activity: sensor based activity recognition. Neurocomputing 181(2):108–115CrossRefGoogle Scholar
  18. 18.
    Lópezcampos JL, Tan W, Soriano JB (2016) Global burden of COPD. Respirology 21(1):14–23CrossRefGoogle Scholar
  19. 19.
    Mega JL, Braunwald E, Wiviott SD et al (2012) Rivaroxaban in patients with a recent acute coronary syndrome. N Engl J Med 336(1):19–31Google Scholar
  20. 20.
    Moons K, Royston P, Vergouwe Y (2009) Prognosis and prognostic research: what, why, and how. BMJ 338(7):375–382CrossRefGoogle Scholar
  21. 21.
    Moons K, Kengne A, Grobbee D (2013) Risk prediction models: II. External validation, model updating, and impact assessment. Heart 98(9):691–702CrossRefGoogle Scholar
  22. 22.
    Rui L, Hong W, Xiaomei Y (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450(6):200–226MathSciNetGoogle Scholar
  23. 23.
    Sartakhti JS, Zangooei MH, Mozafari K (2012) Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). Comput Methods Prog Biomed 108(2):570–582CrossRefGoogle Scholar
  24. 24.
    Steyerberg EW (2011) Clinical prediction models. Acta Anaesthesiol Scand 39(105):134–135Google Scholar
  25. 25.
    Steyerberg EW, Moons K, Windt D (2013) Prognosis research strategy (PROGRESS): prognostic model research. PLoS Med 10(2):1001–1010CrossRefGoogle Scholar
  26. 26.
    Thangavel K, Pethalakshmi A (2009) Dimensionality reduction based on rough set theory: a review. Appl Soft Comput 9(1):1–12CrossRefGoogle Scholar
  27. 27.
    Zhang YP, Gao WC, Zhang B (2017) SA algorithm based integrated fault detection method for insulated track circuits. Railw J 39(4):68–72Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.School of Information Science and EngineeringShandong Normal UniversityJinanChina
  2. 2.Shandong Provincial Key Laboratory for Distributed Computer Software Novel TechnologyJinanChina

Personalised recommendations