Abstract
In the clinical setting, continuum of care depends on integrated information services to assure a smooth progression for patient centered care, and these integrated information services must understand past events and personal circumstances to make care relevant. Clinicians face a problem that the amount of information produced in disparate electronic clinical notes is increasing to levels incapable of being processed by humans. Clinicians need a function in information services that can reduce the free text data to a message useful at time of care. Information extraction (IE) is a sub-field of natural language processing with the goal of data reduction of unstructured free text. Pertinent to IE is an annotated corpus that frames how IE methods should create a logical expression necessary for processing meaning of text. This study explores and reports on the requirements to using the predicate-argument statement (PAS) as the framework. A convenient sample from a prior study with ten synsets of 100 unique sentences from radiology reports deemed by domain experts to mean the same thing will be the text from which PAS structures are formed. Through content analysis of pattern recognition, findings show PAS is a feasible framework to structure sentences for semantic similarity measurement.
Chapter PDF
References
Zweigenbaum, P., Demner-Fushman, D.: Advanced literature-mining tools. In: Edwards, D., Stajich, J., Hansen, D. (eds.) Bioinformatics, pp. 347–380. Springer, New York (2009)
Demner-Fushman, D., Chapman, W.W., McDonald, C.J.: What can natural language processing do for clinical decision support? Journal of Biomedical Informatics 42, 760–772 (2009)
Friedman, C., Hripcsak, G.: Natural language processing and its future in medicine. Academic Medicine 74(8), 890–895 (1999)
Evashwick, C.: Creating the continuum of care. Health Matrix 7(1), 30–39 (1989)
Shortell, S.M., Gillies, R.R., Anderson, D.A.: The new world of managed care: Creating organized delivery systems. Health Affairs 13(5), 46–64 (1994), doi:10.1377/hlthaff.13.5.46
Haggerty, J.L., Reid, R.J., Freeman, G.K., Starfield, B.H., Adair, C.E., McKendry, R.: Continuity of care: a multidisciplinary review. BMJ 327(7425), 1219–1221 (2003), doi:10.1136/bmj.327.7425.1219
Uijen, A.A., Schers, H.J., Schellevis, F.G., van den Bosch, W.J.H.M.: How unique is continuity of care? A review of continuity and related concepts. Family Practice 29(3), 264–271 (2012), doi:10.1093/fampra
Tan, H., Kaliyaperumal, R., Benis, N.: Ontology-Driven Construction of Domain Corpus with Frame Semantics Annotations. In: Gelbukh, A. (ed.) CICLing 2012, Part I. LNCS, vol. 7181, pp. 54–65. Springer, Heidelberg (2012), doi:10.1007/978-3-642-28604-9_5
Chou, W.-C., Tsai, R.T.-H., Su, Y.-S., Ku, W., Sung, T.-Y., Hsu, W.-L.: A Semi-Automatic Method for Annotating a Biomedical Proposition Bank, Sydney, Australia. Paper Presented at the Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora (2006)
Tsai, R., Chou, W.-C., Su, Y.-S., Lin, Y.-C., Sung, C.-L., Dai, H.-J., et al.: BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features. BMC Bioinformatics 8(1), 325 (2007)
Cohen, K.B., Hunter, L.: A critical review of PASBio’s argument structures for biomedical verbs. BMC Bioinformatics, 7(suppl. 3), S5 (2006)
Godbert, E., Royaute, J.: PredXtract, A Generic Platform to Extract in Texts Predicate Argument Structures (PAS), Valleta, Malta. Paper Presented at the LREC 2010 Proceedings (2010)
Kilicoglu, H., Fiszman, M., Rosemblat, G., Marimpieti, S., Rindflesch, T.: Arguments of Nominals in Semantic Intepretation of Biomedical Text, Uppsala, Sweden. Paper Presented at the BioNLP 2010 (2010)
Kogan, Y., Collier, N., Pakhomov, S., Krauthammer, M.: Towards Semantic Role Labeling & IE in the Medical Literature. Paper Presented at the Annual AMIA Symposium (2005)
Miyao, Y., Ohta, T., Masuda, K., Tsuruoka, Y., Yoshida, K., Ninomiya, T., Tsujii, J.I.: Semantic Retrieval for the Accurate Identification of Relational Concepts in Massive Textbases. Paper Presented at the Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia (2006)
Surdeanu, M., Harabagiu, S., Williams, J., Aarseth, P.: Using Predicate-Argument Structures for Information Extraction. Paper Presented at the Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan (July 2003)
Tsai, R., Chou, W.-C., Su, Y.-S., Lin, Y.-C., Sung, C.-L., Dai, H.-J., Hsu, W.-L.: BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features. BMC Bioinformatics 8(1), 325 (2007)
Wattarujeekrit, T., Shah, P., Collier, N.: PASBio: Predicate-argument structures for event extraction in molecular biology. BMC Bioinformatics 5(1), 155 (2004)
Samsonovic, A.V., Ascoli, G.A.: Principal semantic components of language and the measurement of meaning. PLoS ONE 5(6), e10921 (2010)
Caviedes, J.E., Cimino, J.J.: Towards the development of a conceptual distance metric for the UMLS. Journal of Biomedical Informatics 37(2), 77–85 (2004)
Chaves-González, J.M., Martínez-Gil, J.: Evolutionary algorithm based on different semantic similarity functions for synonym recognition in the biomedical domain. Knowledge-Based Systems 37, 62–69 (2013), doi: http://dx.doi.org/10.1016/j.knosys.2012.07.005
Builtelaar, P., Sacaleanu, B.: Ranking and Selecting Synsets by Domain Relevance. Paper Presented at the Proceedings of WordNet and Other Lexical Resources (2001)
Elhadad, N., Sutaria, K.: Mining a Lexicon of Technical Terms and Lay Equivalents. Paper presented at the Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing, Prague, Czech Republic (2007)
Mihalcea, R., Corley, C., Strapparava, C.: Corpus-Based and Knowledge-Based Measures of Text Semantic Similarity. Paper Presented at the Proceedings of the 21st National Conference on Artificial intelligence, Boston, Massachusetts (2006)
Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): Architecture, component evaluation and applications. Journal of the American Medical Informatics Association 17(5), 507–513 (2010), doi:10.1136/jamia.2009.001560
Albright, D., Lanfranchi, A., Fredriksen, A., Styler, W.F., Warner, C., Hwang, J.D., Savova, G.K.: Towards comprehensive syntactic and semantic annotations of the clinical narrative. Journal of the American Medical Informatics Association (2013), doi:10.1136/amiajnl-2012-001317
Friedlin, J., Mahoui, M., Jones, J., Jamieson, P.: Knowledge Discovery and Data Mining of Free Text Radiology Reports. In: 2011 First IEEE International Conference on Paper Presented at the Healthcare Informatics, Imaging and Systems Biology, HISB, July 26-29 (2011)
McCrae, J., Collier, N.: Synonym set extraction from the biomedical literature by lexical pattern discovery. BMC Bioinformatics 9(159) (2008)
Xia, F., Yetisgen-Yildiz, M.: Clinical Corpus Annotation: Challenges and Strategies, Istanbul, Turkey. Paper Presented at the Third Workshop on Building and Evaluating Resources for Biomedical Text Mining Workshop Programme (2012)
Babko-Malaya, O.: Propbank Annotation Guidelines (2005), http://verbs.colorado.edu/~mpalmer/projects/ace/PBguidelines.pdf (retrieved November 7, 2010)
Unified Verb Index (2012), http://verbs.colorado.edu/verb-index/index.php (retrieved December 12, 2012)
Yu, C.H., Jannasch-Pennell, A., DiGangi, S.: Compatibility between text mining and qualitative research in the perspectives of grounded theory, content analysis, and reliability. The Qualitative Report 16(3), 730–744 (2011)
Holden, R.J.: Physicians’ beliefs about using EMR and CPOE: In pursuit of a contextualized understanding of health IT use behavior. International Journal of Medical Informatics 79(2), 71–80 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Newsom, E., Jones, J.F. (2013). Data Reduction for Continuum of Care: An Exploratory Study Using the Predicate-Argument Structure to Pre-process Radiology Sentences for Measurement of Semantic Similarity. In: Stephanidis, C., Antona, M. (eds) Universal Access in Human-Computer Interaction. Applications and Services for Quality of Life. UAHCI 2013. Lecture Notes in Computer Science, vol 8011. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39194-1_60
Download citation
DOI: https://doi.org/10.1007/978-3-642-39194-1_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39193-4
Online ISBN: 978-3-642-39194-1
eBook Packages: Computer ScienceComputer Science (R0)