Collaborative Prediction Model of Disease Risk by Mining Electronic Health Records
Patient Electronic Health Records (EHR) is one of the major carriers for conducting preventative medicine research. However, the heterogeneous and longitudinal properties make EHRs analysis an inherently challenge. To address this issue, this paper proposes CAPM, a Collaborative Assessment Prediction Model based on patient temporal graph representation, which relies only on a patient EHRs using ICD-10 codes to predict future disease risks. Firstly, we develop a temporal graph for each patient EHRs. Secondly, CAPM uses hybrid collaborative filtering approach to predict each patient’s greatest disease risks based on their own medical history and that of similar patients. Moreover, we also calculate the onset risk with the corresponding diseases in order to take action at the earliest signs. Finally, we present experimental results on a real world EHR dataset, demonstrating that CAPM performs well at capturing future disease and its onset risks.
KeywordsElectronic Health Records Temporal graph Collaborative prediction Disease risk profile
This work is partially supported by NSFC No. 61303005, 61572295; the Innovation Method Fund of China No. 2015IM010200; SDNSFC No. ZR2014FM031; the Science and Technology Development Plan Project of Shandong Province No. 2014GGX101019, 2015GGX101007, 2015GGX 101015; the Shandong Province Independent Innovation Major Special Project No. 2015ZDJQ010 02, 2015ZDXX0201B03; the Fundamental Research Funds of Shandong University No. 2014JC025, 2015JC031.
- 1.Laura, B.M.: Data-Driven Healthcare: How Analytics and BI are Transforming the Industry. Wiley (2014)Google Scholar
- 3.Davis, D.A., Chawla, N.V.: Predicting individual disease risk based on medical history. In: Information and Knowledge Management, pp. 769–778 (2008)Google Scholar
- 4.Dentino, B., Davis, D., Chawla, N.V.: HealthCareND: leveraging EHR and ARE for prospective healthcare. In: Health Informatics Symposium, pp. 841–844 (2010)Google Scholar
- 5.Liu, C., Zhang, K., Xiong, H., Jiang, G., Yang, Q.: Temporal skeletonization on sequential data: patterns, categorization, and visualization. In: KDD, pp. 211–223 (2014)Google Scholar
- 6.Ji, X., Chun, S.A., Geller, Z., Oria, V.: Collaborative and trajectory prediction models of medical conditions by mining patients’ Social Data. In: BIBM, pp. 695–700 (2015)Google Scholar
- 7.Zhou, J.Y., Wang, F., Hu, J.Y., Ye, J.P.: From micro to macro: Data driven phenotyping by densification of longitudinal electronic medical records. In: SIGKDD, pp. 135–144 (2014)Google Scholar
- 8.Ooi, B.C., Tan, K.-L., Tran, Q. T., Yip, J.W.L., Chen, G., Ling,Z.J., Nguyen, T., Tung, A.K.H., Zhang, M.: Contextual crowd intelligence. In: SIGKDD, pp. 39–46 (2014)Google Scholar
- 11.Xia, P., Liu, B., Sun, Y., Chen, C.: Reciprocal recommendation system for online dating. Soc. Netw. Anal. Mining. 9, 234–241 (2015)Google Scholar
- 13.Sun, J., Wang, F., Hu, J., Edabollahi, S.: Supervised patient similarity measure of heterogeneous patient records. In: SIGKDD, pp. 16–24 (2012)Google Scholar