Abstract
In this paper, we are concerned with the problem of automatic template creation for Information Extraction (IE) and we present a methodology for the creation of IE templates. Our approach proposes the semi-automatic construction of a semantic representation of textual information based on recognition of multi-word and nested terms and Named Entities (NEs) and subsequent exploitation of term and NE context for the induction of Information Extraction template rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bagga, A., J. Y. Chai and A. Biermann: The Role of WordNet in the Creation of a Trainable Message Understanding System. In Proceedings of the Fourteenth Conference on Artificial Intelligence (AAAI/IAAI-97), (1997) 941–948
Boguraev, B. and C. Kennedy:Technical Terminology for Domain Specification and Content Characterisation. In Information Extraction: A multi-disciplinary approach to an emerging information technology. International Summer School, SCIE-97, Frascati, Italy, July 14–18.1997, M.T. Pazienza (ed.) Springer, (1997) 27–96
Boguraev, B. and C. Kennedy: Salience-Based Content Characterisation of Text Documents. In Proceedings of ACL/EACL’7 Workshop on Intelligent Scalable Text Summarisation, Madrid, Spain, (1997) 2–9
Bourigault, D.: LEXTER, a Terminology Extraction Software for Knowledge Acquisition from Texts. In Proceedings of the Ninth Knowledge Acquisition for Knowledge Based System Workshop (KAW’95), Banff, Canada, (1995)
Califf, M. E. and R. J. Mooney: Relational Learning of Pattern-Match Rules for Information Extraction. In Working Papers of ACL-97 Workshop on Natural Language Learning, (1997) 9–15
Chinchor, N. A.: MUC-7 Named Entity Task Definition. Version 3.4, 13 July 1997.
Chinchor, N. A.: Overview of MUC-7/MET-2. In Science Applications International Corporation (SAIC), (1998 )http://www.muc.saic.com/proceedings/muc_7_proceedings/overview.html
Frantzi, K. T. and S. Ananiadou: The C-Value/NC-Value Domain Independent Method for Multi-Word Term Extraction. In Journal of Natural Language Processing, 6(3) (1999) 145–179
Justeson, J. S. and S. M. Katz: Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. In Natural Language Engineering, 1(1) (1995) 9–27
McNaught, J., W. J. Black, F. Rinaldi, E. Bertino, A. Brasher, D. Deavin, B. Catania, D. Silvestri, B. Armani, P. Leo, A. Persidis, G. Semeraro, F. Esposito, G. P. Zarri and L. Gilardoni: Integrated Document and Knowledge Management for the Knowledge-based Enterprise. In Proceedings of Practical Application of Knowledge Management 2000 (PAKeM 2000) (forthcoming), Manchester, (April 2000) 10–14
Mikheev, A., M. Moens and C. Grover: Named Entity Recognition without Gazetteers. In Proceedings of EACL’99, (1999) 1–8
Miller, G.A., R. Beckwith, C. Fellbaum, D. Gross and K. Miller: Introduction to WordNet: An Online Lexical Database. In Five Papers on WordNet, (1993) 1–9 ftp://ftp.cogsci.princeton.edu/pub/wordnet/5papers.ps
Riloff, E.: Automatically Constructing a Dictionary for Information Extraction Tasks. In Proceedings of the Eleventh National Conference on Artificial Intelligence (AAAI-93), (1993) 811–816
Riloff, E.: Automatically Generating Extraction Patterns from Untagged Text. In Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96), (1996) 1044–1049
Riloff, E.: An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains. AI Journal, 85 (August 1996)
Riloff, E.and R. Jones: Learning Dictionaries for Information Extraction by MultiLevel Bootstrapping. In Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), (1999)
Sager, J.C., D. Dungworth and P. F. McDonald: English Special Languages: principles and practice in science and technology. Oscar Brandstetter Verlag KG, Wiesbaden, (1980)
Soderland, S.: Learning Information Extraction Rules for Semi-structured and Free Text. In Machine Learning, C. Cardie and R. Mooney (eds.) Kluwer Academic Publishers, Boston (1999) 1–44
Soderland, S., D. Fisher, J. Aseltine and W. Lehnert: CRYSTAL: Inducing a Conceptual Dictionary. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI’ 95), (1995) 1314–1319
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zervanou, K., McNaught, J. (2000). A Term-Based Methodology for Template Creation in Information Extraction. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_38
Download citation
DOI: https://doi.org/10.1007/3-540-45154-4_38
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67605-8
Online ISBN: 978-3-540-45154-9
eBook Packages: Springer Book Archive