Similarity Measure for Patients via A Siamese CNN Network

  • Fangyuan Zhao
  • Jianliang XuEmail author
  • Yong Lin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11335)


In the medical health field, assessing the similarities between patients is a basic task. A suitable patient similarity measurement has a very wide range of applications. For example, patient group identification, comparative study of treatment methods, etc. The electronic health records (EHRs) contain rich personal information of patient, which is hierarchical, longitudinal, and sparse. Although there have been some studies aimed at learning the similarities of patients from EHRs to solve real medical problems, there still exist some problems. Many works lack of effective patient representation. In addition, most of the research works are limited to one or more specific diseases. However, in fact, many diseases accompany with other diseases. In this case, the similarity of patients with multiple diseases are ignored. In this paper, we designed a siamese CNN network structure to learn patient expression while effectively measure the similarity between patient pairs. The experimental results show the effectiveness of this method.


  1. 1.
    Wang, L., Zhang, Y., Feng, J.: On the Euclidean Distance of Images. IEEE Computer Society, Washington (2005)Google Scholar
  2. 2.
    Chan, L., Chan, T., Cheng, L., Mak, W.: Machine learning of patient similarity: a case study on predicting survival in cancer patient after locoregional chemotherapy. In: 2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 467–470. IEEE (2010)Google Scholar
  3. 3.
    Wang, F., Hu, J., Sun, J.: Medical prognosis based on patient similarity and expert feedback. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 1799–1802. IEEE (2012)Google Scholar
  4. 4.
    Henao, R., Murray, J., Ginsburg, G., Carin, L., Lucas, J.E.: Supplementary Material to Patient Clustering With Uncoded Text in Electronic Medical Records (2012)Google Scholar
  5. 5.
    Sewitch, M.J., Leffondre, K., Dobkin, P.L.: Clustering patients according to health perceptions: relationships to psychosocial characteristics and medication nonadherence. J. Psychosom. Res. 56(3), 323–332 (2004)CrossRefGoogle Scholar
  6. 6.
    Suo, Q., et al.: Personalized disease prediction using a CNN-based similarity learning method. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (2017)Google Scholar
  7. 7.
    Zhu, Z.: Measuring patient similarities via a deep architecture with medical concept embedding. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), pp. 749–758. IEEE (2016)Google Scholar
  8. 8.
    Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)Google Scholar
  9. 9.
    Funk, S.: RMSprop Loses to SMORMS3-Beware The Epsilon! (2015)Google Scholar
  10. 10.
    Khan, M.E., Liu, Z., Tangkaratt, V., Gal, Y.: Vprop: Variational inference using RMSprop, arXiv preprint arXiv:1712.01038 (2017)
  11. 11.
    China conference on knowledge graph and semantic computing.
  12. 12.
    Matthews correlation coefficient accuracy.
  13. 13.
    Baldi, P., Brunak, S., Chauvin, Y., Andersen, C.A., Nielsen, H.: Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16(5), 412–424 (2000)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.College of Information Science and Engineering Ocean University of ChinaQingdaoChina
  2. 2.Weifang Public Security BureauWeifangChina
  3. 3.Weifang Power Supply CompanyWeifangChina

Personalised recommendations