Skip to main content

Learning the Representation of Medical Features for Clinical Pathway Analysis

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10828))

Abstract

Clinical Pathway (CP) represents the best practice of treatment process management for inpatients with specific diagnosis, and a treatment process can be divided into several stages, usually in units of days. With the explosion of medical data, CP analysis is receiving increasing attention, which provides important support for CP design and optimization. However, these data-driven researches often suffer from the high complexity of medical data, so that a proper representation of medical features is necessary. Most of existing representation learning methods in healthcare domain focus on outpatient data, which get weak performance and interpretability when adopted for CP analysis. In this paper, we propose a new representation, RoMCP, which can capture both diagnosis information and temporal relations between days. The learned diagnosis embedding grasps the key factors of the disease, and each day embedding is determined by the diagnosis together with the preorder days. We evaluate RoMCP on real-world dataset with 538K inpatient visits for several typical CP analysis tasks. Our method demonstrates significant improvement on performance and interpretation.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    A patient who stays in a hospital while receiving medical care or treatment. In general, it takes up more resources and cost compared to outpatients.

  2. 2.

    Doctors usually make a prescription with multiple events together. There is no strict temporal relations between these events.

  3. 3.

    Outpatient contains several visits. It is common that the events between sequential visits are quite different, due to the different diagnosis.

  4. 4.

    Some visits may have more than one diagnosis. While for CP, we only concern the first diagnosis, which largely determines the treatment strategy.

  5. 5.

    It refers to the 10th revision of the International Statistical Classification of Diseases and Related Health Problems that listed by the World Health Organization. In our dataset, an Chinese version is used for NRCMS.

  6. 6.

     https://www.tensorflow.org/.

  7. 7.

     http://scikit-learn.org/stable/.

References

  1. Andrews, N.O., Fox, E.A.: Recent Developments in Document Clustering (2007)

    Google Scholar 

  2. Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: Proceedings of NIPS, pp. 153–160 (2007)

    Google Scholar 

  3. Binder, M., et al.: On analyzing process compliance in skin cancer treatment: an experience report from the evidence-based medical compliance cluster (EBMC2). In: Ralyté, J., Franch, X., Brinkkemper, S., Wrycza, S. (eds.) CAiSE 2012. LNCS, vol. 7328, pp. 398–413. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31095-9_26

    Chapter  Google Scholar 

  4. Bouarfa, L., Dankelman, J.: Workflow mining and outlier detection from clinical activity logs. J. Biomed. Inform. 45(6), 1185–1190 (2012)

    Article  Google Scholar 

  5. Caron, F., Vanthienen, J., Vanhaecht, K., Van Limbergen, E., De Weerdt, J., Baesens, B.: Monitoring care processes in the gynecologic oncology department. Comput. Biol. Med. 44, 88–96 (2014)

    Article  Google Scholar 

  6. Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., Sun, J.: Doctor AI: predicting clinical events via recurrent neural networks. In: Proceedings of MLHC, pp. 301–318 (2016)

    Google Scholar 

  7. Choi, E., Bahadori, M.T., Searles, E., Coffey, C., Thompson, M., Bost, J., Tejedor-Sojo, J., Sun, J.: Multi-layer representation learning for medical concepts. In: Proceedings of KDD, pp. 1495–1504. ACM (2016)

    Google Scholar 

  8. Choi, E., Bahadori, M.T., Song, L., Stewart, W.F., Sun, J.: GRAM: graph-based attention model for healthcare representation learning. In: Proceedings of KDD, pp. 787–795. ACM (2017)

    Google Scholar 

  9. Choi, E., Bahadori, M.T., Sun, J., Kulas, J., Schuetz, A., Stewart, W.: Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In: Proceedings of NIPS, pp. 3504–3512 (2016)

    Google Scholar 

  10. Choi, E., Schuetz, A., Stewart, W.F., Sun, J.: Using recurrent neural network models for early detection of heart failure onset. J. Am. Med. Inform. Assoc. 24(2), 361–370 (2016)

    Google Scholar 

  11. Choi, Y., Chiu, C.Y.I., Sontag, D.: Learning low-dimensional representations of medical concepts. In: AMIA Summits on Translational Science Proceedings 2016, p. 41 (2016)

    Google Scholar 

  12. De Vine, L., Zuccon, G., Koopman, B., Sitbon, L., Bruza, P.: Medical semantic similarity with a neural language model. In: Proceedings of CIKM, pp. 1819–1822. ACM (2014)

    Google Scholar 

  13. Harutyunyan, H., Khachatrian, H., Kale, D.C., Galstyan, A.: Multitask learning and benchmarking with clinical time series data. arXiv preprint arXiv:1703.07771 (2017)

  14. Huang, Z., Dong, W., Ji, L., Gan, C., Lu, X., Duan, H.: Discovery of clinical pathway patterns from event logs using probabilistic topic models. J. Biomed. Inform. 47, 39–57 (2014)

    Article  Google Scholar 

  15. Huang, Z., Dong, W., Ji, L., He, C., Duan, H.: Incorporating comorbidities into latent treatment pattern mining for clinical pathways. J. Biomed. Inform. 59, 227–239 (2016)

    Article  Google Scholar 

  16. Huang, Z., Lu, X., Duan, H.: Latent treatment pattern discovery for clinical processes. J. Med. Syst. 37(2), 1–10 (2013)

    Article  Google Scholar 

  17. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of ICML, pp. 1188–1196 (2014)

    Google Scholar 

  18. Li, C., Hou, Y., Sun, M., Lu, J., Wang, Y., Li, X., Chang, F., Hao, M.: An evaluation of China’s new rural cooperative medical system: achievements and inadequacies from policy goals. BMC Public Health 15(1), 1079 (2015)

    Article  Google Scholar 

  19. Lipton, Z.C., Kale, D.C., Elkan, C., Wetzell, R.: Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677 (2015)

  20. Ma, F., Chitta, R., Zhou, J., You, Q., Sun, T., Gao, J.: Dipole: diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In: Proceedings of KDD, pp. 1903–1911. ACM (2017)

    Google Scholar 

  21. Mans, R.S., Schonenberg, M.H., Song, M., van der Aalst, W.M.P., Bakker, P.J.M.: Application of process mining in healthcare – a case study in a Dutch hospital. In: Fred, A., Filipe, J., Gamboa, H. (eds.) BIOSTEC 2008. CCIS, vol. 25, pp. 425–438. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-92219-3_32

    Chapter  Google Scholar 

  22. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  23. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS, pp. 3111–3119 (2013)

    Google Scholar 

  24. Nguyen, P., Tran, T., Wickramasinghe, N., Venkatesh, S.: Deepr: a convolutional net for medical records. J. Biomed. Health Inf. 21(1), 22–30 (2017)

    Article  Google Scholar 

  25. Pham, T., Tran, T., Phung, D., Venkatesh, S.: DeepCare: a deep dynamic memory model for predictive medicine. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J.Z., Wang, R. (eds.) PAKDD 2016. LNCS (LNAI), vol. 9652, pp. 30–41. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31750-2_3

    Chapter  Google Scholar 

  26. Poelmans, J., Dedene, G., Verheyden, G., Van der Mussele, H., Viaene, S., Peters, E.: Combining business process and data discovery techniques for analyzing and improving integrated care pathways. In: Perner, P. (ed.) ICDM 2010. LNCS (LNAI), vol. 6171, pp. 505–517. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14400-4_39

    Chapter  Google Scholar 

  27. Prodel, M., Augusto, V., Xie, X., Jouaneton, B., Lamarsalle, L.: Discovery of patient pathways from a national hospital database using process mining and integer linear programming. In: T-ASE, pp. 1409–1414. IEEE (2015)

    Google Scholar 

  28. Rojas, E., Munoz-Gama, J., Sepúlveda, M., Capurro, D.: Process mining in healthcare: a literature review. J. Biomed. Inform. 61, 224–236 (2016)

    Article  Google Scholar 

  29. Rovani, M., Maggi, F.M., de Leoni, M., van der Aalst, W.M.: Declarative process mining in healthcare. Expert Syst. Appl. 42(23), 9236–9251 (2015)

    Article  Google Scholar 

  30. Xu, X., Jin, T., Wang, J.: Summarizing patient daily activities for clinical pathway mining. In: Proceedings of Healthcom, pp. 1–6. IEEE (2016)

    Google Scholar 

  31. Xu, X., Jin, T., Wei, Z., Lv, C., Wang, J.: TCPM: topic-based clinical pathway mining. In: Proceedings of CHASE, pp. 292–301. IEEE (2016)

    Google Scholar 

  32. Xu, X., Jin, T., Wei, Z., Wang, J.: Incorporating domain knowledge into clinical goal discovering for clinical pathway mining. In: Proceedings of BHI, pp. 261–264. IEEE (2017)

    Google Scholar 

  33. Zeiler, M.D.: ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)

  34. Zhu, Z., Yin, C., Qian, B., Cheng, Y., Wei, J., Wang, F.: Measuring patient similarities via a deep architecture with medical concept embedding. In: Proceedings of ICDM, pp. 749–758. IEEE (2016)

    Google Scholar 

Download references

Acknowledgments

This work was supported by The National Key Technology R&D Program (No. 2015BAH14F02), and Project 61325008 (Mining and Management of Large Scale Process Data) supported by NSFC.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tao Jin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, X., Wang, Y., Jin, T., Wang, J. (2018). Learning the Representation of Medical Features for Clinical Pathway Analysis. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds) Database Systems for Advanced Applications. DASFAA 2018. Lecture Notes in Computer Science(), vol 10828. Springer, Cham. https://doi.org/10.1007/978-3-319-91458-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-91458-9_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-91457-2

  • Online ISBN: 978-3-319-91458-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics