Discovering Commonly Shared Semantic Concepts of Eligibility Criteria for Learning Clinical Trial Design
Discovering commonly shared semantic concepts of eligibility criteria can facilitate hospitals in recruiting appropriate target population and empower patients with more effective result ranking of concept-based search, as well as assist researchers in understanding clinical trial design. This study aims to identify commonly shared semantic concepts of eligibility criteria through the identification of eligibility criteria concepts for each disease. An automated approach for extracting semantic concepts from eligibility criteria texts is proposed. For each disease, commonly shared semantic concepts are determined for reviewing the commonly shared concepts of clinical trials. Our experiment dataset are 145,745 clinical trials associated with 5,488 different types of diseases on ClinicalTrials.gov. 5,508,491 semantic concepts are extracted with 459,936 being unique. We further analyze its application on assisting researchers in learning disease-specific clinical trial design.
KeywordsSemantic concepts Eligibility criteria Learning
This work was supported by National Natural Science Foundation of China (grant No. 61403088).
- 1.Ross, J., Tu, S., Carini, S., Sim, I.: Analysis of eligibility criteria complexity in clinical trials. In: Proceedings of AMIA Summits on Translational Science 2010, pp. 46-50 (2010)Google Scholar
- 3.Campbell, M.K., Snowdon, C., Francis, D., et al.: Recruitment to randomised trials: strategies for trial enrollment and participation study. The STEPS study. Health Technol. Assess. 11(48), iii, ix–105 (2007)Google Scholar
- 6.Milian, K., Bucur, A., Teije A.T.: Formalization of clinical trial eligibility criteria: Evaluation of a pattern-based approach. In: IEEE International Conference on Bioinformatics and Biomedicine, pp. 1–4 (2012)Google Scholar
- 9.National institute of health - NINDS Common Data Elements. http://www.ninds.nih.gov/research/clinical_research/toolkit/common_data_elements.htm. Accessed 2015
- 10.NINDS. http://www.commondataelements.ninds.nih.gov/projreview.aspx. Accessed 2015
- 12.NerveCenter: NINDS common data element project: a long-awaited breakthrough in streamlining trials. Ann. Neurol. 68(1), A11–A13 (2010)Google Scholar
- 17.UMLS - Unified Medical Language System. http://www.nlm.nih.gov/research/umls/. Accessed 2015
- 22.Extracting key phrases with NLTK in Python, GitHub Gist. https://gist.github.com/alexbowe/879414. Accessed 2015
- 23.The Stanford Parser: A statistical parser. http://nlp.stanford.edu/software/lex-parser.shtml. Accessed 2015