Learning the Representation of Medical Features for Clinical Pathway Analysis

Xu, Xiao; Wang, Ying; Jin, Tao; Wang, Jianmin

doi:10.1007/978-3-319-91458-9_3

Learning the Representation of Medical Features for Clinical Pathway Analysis

Xiao Xu²⁴,
Ying Wang²⁴,
Tao Jin²⁴ &
…
Jianmin Wang²⁴

Conference paper
First Online: 12 May 2018

3774 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10828))

Abstract

Clinical Pathway (CP) represents the best practice of treatment process management for inpatients with specific diagnosis, and a treatment process can be divided into several stages, usually in units of days. With the explosion of medical data, CP analysis is receiving increasing attention, which provides important support for CP design and optimization. However, these data-driven researches often suffer from the high complexity of medical data, so that a proper representation of medical features is necessary. Most of existing representation learning methods in healthcare domain focus on outpatient data, which get weak performance and interpretability when adopted for CP analysis. In this paper, we propose a new representation, RoMCP, which can capture both diagnosis information and temporal relations between days. The learned diagnosis embedding grasps the key factors of the disease, and each day embedding is determined by the diagnosis together with the preorder days. We evaluate RoMCP on real-world dataset with 538K inpatient visits for several typical CP analysis tasks. Our method demonstrates significant improvement on performance and interpretation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
A patient who stays in a hospital while receiving medical care or treatment. In general, it takes up more resources and cost compared to outpatients.
2.
Doctors usually make a prescription with multiple events together. There is no strict temporal relations between these events.
3.
Outpatient contains several visits. It is common that the events between sequential visits are quite different, due to the different diagnosis.
4.
Some visits may have more than one diagnosis. While for CP, we only concern the first diagnosis, which largely determines the treatment strategy.
5.
It refers to the 10th revision of the International Statistical Classification of Diseases and Related Health Problems that listed by the World Health Organization. In our dataset, an Chinese version is used for NRCMS.
6.
https://www.tensorflow.org/.
7.
http://scikit-learn.org/stable/.

References

Andrews, N.O., Fox, E.A.: Recent Developments in Document Clustering (2007)
Google Scholar
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: Proceedings of NIPS, pp. 153–160 (2007)
Google Scholar
Binder, M., et al.: On analyzing process compliance in skin cancer treatment: an experience report from the evidence-based medical compliance cluster (EBMC²). In: Ralyté, J., Franch, X., Brinkkemper, S., Wrycza, S. (eds.) CAiSE 2012. LNCS, vol. 7328, pp. 398–413. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31095-9_26
Chapter Google Scholar
Bouarfa, L., Dankelman, J.: Workflow mining and outlier detection from clinical activity logs. J. Biomed. Inform. 45(6), 1185–1190 (2012)
Article Google Scholar
Caron, F., Vanthienen, J., Vanhaecht, K., Van Limbergen, E., De Weerdt, J., Baesens, B.: Monitoring care processes in the gynecologic oncology department. Comput. Biol. Med. 44, 88–96 (2014)
Article Google Scholar
Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., Sun, J.: Doctor AI: predicting clinical events via recurrent neural networks. In: Proceedings of MLHC, pp. 301–318 (2016)
Google Scholar
Choi, E., Bahadori, M.T., Searles, E., Coffey, C., Thompson, M., Bost, J., Tejedor-Sojo, J., Sun, J.: Multi-layer representation learning for medical concepts. In: Proceedings of KDD, pp. 1495–1504. ACM (2016)
Google Scholar
Choi, E., Bahadori, M.T., Song, L., Stewart, W.F., Sun, J.: GRAM: graph-based attention model for healthcare representation learning. In: Proceedings of KDD, pp. 787–795. ACM (2017)
Google Scholar
Choi, E., Bahadori, M.T., Sun, J., Kulas, J., Schuetz, A., Stewart, W.: Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In: Proceedings of NIPS, pp. 3504–3512 (2016)
Google Scholar
Choi, E., Schuetz, A., Stewart, W.F., Sun, J.: Using recurrent neural network models for early detection of heart failure onset. J. Am. Med. Inform. Assoc. 24(2), 361–370 (2016)
Google Scholar
Choi, Y., Chiu, C.Y.I., Sontag, D.: Learning low-dimensional representations of medical concepts. In: AMIA Summits on Translational Science Proceedings 2016, p. 41 (2016)
Google Scholar
De Vine, L., Zuccon, G., Koopman, B., Sitbon, L., Bruza, P.: Medical semantic similarity with a neural language model. In: Proceedings of CIKM, pp. 1819–1822. ACM (2014)
Google Scholar
Harutyunyan, H., Khachatrian, H., Kale, D.C., Galstyan, A.: Multitask learning and benchmarking with clinical time series data. arXiv preprint arXiv:1703.07771 (2017)
Huang, Z., Dong, W., Ji, L., Gan, C., Lu, X., Duan, H.: Discovery of clinical pathway patterns from event logs using probabilistic topic models. J. Biomed. Inform. 47, 39–57 (2014)
Article Google Scholar
Huang, Z., Dong, W., Ji, L., He, C., Duan, H.: Incorporating comorbidities into latent treatment pattern mining for clinical pathways. J. Biomed. Inform. 59, 227–239 (2016)
Article Google Scholar
Huang, Z., Lu, X., Duan, H.: Latent treatment pattern discovery for clinical processes. J. Med. Syst. 37(2), 1–10 (2013)
Article Google Scholar
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of ICML, pp. 1188–1196 (2014)
Google Scholar
Li, C., Hou, Y., Sun, M., Lu, J., Wang, Y., Li, X., Chang, F., Hao, M.: An evaluation of China’s new rural cooperative medical system: achievements and inadequacies from policy goals. BMC Public Health 15(1), 1079 (2015)
Article Google Scholar
Lipton, Z.C., Kale, D.C., Elkan, C., Wetzell, R.: Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677 (2015)
Ma, F., Chitta, R., Zhou, J., You, Q., Sun, T., Gao, J.: Dipole: diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In: Proceedings of KDD, pp. 1903–1911. ACM (2017)
Google Scholar
Mans, R.S., Schonenberg, M.H., Song, M., van der Aalst, W.M.P., Bakker, P.J.M.: Application of process mining in healthcare – a case study in a Dutch hospital. In: Fred, A., Filipe, J., Gamboa, H. (eds.) BIOSTEC 2008. CCIS, vol. 25, pp. 425–438. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-92219-3_32
Chapter Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS, pp. 3111–3119 (2013)
Google Scholar
Nguyen, P., Tran, T., Wickramasinghe, N., Venkatesh, S.: Deepr: a convolutional net for medical records. J. Biomed. Health Inf. 21(1), 22–30 (2017)
Article Google Scholar
Pham, T., Tran, T., Phung, D., Venkatesh, S.: DeepCare: a deep dynamic memory model for predictive medicine. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J.Z., Wang, R. (eds.) PAKDD 2016. LNCS (LNAI), vol. 9652, pp. 30–41. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31750-2_3
Chapter Google Scholar
Poelmans, J., Dedene, G., Verheyden, G., Van der Mussele, H., Viaene, S., Peters, E.: Combining business process and data discovery techniques for analyzing and improving integrated care pathways. In: Perner, P. (ed.) ICDM 2010. LNCS (LNAI), vol. 6171, pp. 505–517. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14400-4_39
Chapter Google Scholar
Prodel, M., Augusto, V., Xie, X., Jouaneton, B., Lamarsalle, L.: Discovery of patient pathways from a national hospital database using process mining and integer linear programming. In: T-ASE, pp. 1409–1414. IEEE (2015)
Google Scholar
Rojas, E., Munoz-Gama, J., Sepúlveda, M., Capurro, D.: Process mining in healthcare: a literature review. J. Biomed. Inform. 61, 224–236 (2016)
Article Google Scholar
Rovani, M., Maggi, F.M., de Leoni, M., van der Aalst, W.M.: Declarative process mining in healthcare. Expert Syst. Appl. 42(23), 9236–9251 (2015)
Article Google Scholar
Xu, X., Jin, T., Wang, J.: Summarizing patient daily activities for clinical pathway mining. In: Proceedings of Healthcom, pp. 1–6. IEEE (2016)
Google Scholar
Xu, X., Jin, T., Wei, Z., Lv, C., Wang, J.: TCPM: topic-based clinical pathway mining. In: Proceedings of CHASE, pp. 292–301. IEEE (2016)
Google Scholar
Xu, X., Jin, T., Wei, Z., Wang, J.: Incorporating domain knowledge into clinical goal discovering for clinical pathway mining. In: Proceedings of BHI, pp. 261–264. IEEE (2017)
Google Scholar
Zeiler, M.D.: ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)
Zhu, Z., Yin, C., Qian, B., Cheng, Y., Wei, J., Wang, F.: Measuring patient similarities via a deep architecture with medical concept embedding. In: Proceedings of ICDM, pp. 749–758. IEEE (2016)
Google Scholar

Download references

Acknowledgments

This work was supported by The National Key Technology R&D Program (No. 2015BAH14F02), and Project 61325008 (Mining and Management of Large Scale Process Data) supported by NSFC.

Author information

Authors and Affiliations

School of Software, Tsinghua University, Beijing, 100084, China
Xiao Xu, Ying Wang, Tao Jin & Jianmin Wang

Authors

Xiao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Ying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Jin
View author publications
You can also search for this author in PubMed Google Scholar
Jianmin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tao Jin .

Editor information

Editors and Affiliations

Simon Fraser University, Burnaby, BC, Canada
Jian Pei
Aristotle University of Thessaloniki, Thessaloniki, Greece
Yannis Manolopoulos
University of Queensland, Brisbane, QLD, Australia
Shazia Sadiq
University of Western Australia, Crawley, WA, Australia
Jianxin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, X., Wang, Y., Jin, T., Wang, J. (2018). Learning the Representation of Medical Features for Clinical Pathway Analysis. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds) Database Systems for Advanced Applications. DASFAA 2018. Lecture Notes in Computer Science(), vol 10828. Springer, Cham. https://doi.org/10.1007/978-3-319-91458-9_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-91458-9_3
Published: 12 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91457-2
Online ISBN: 978-3-319-91458-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics