Knowledge representation issues in information extraction

Angela, Wee Li Kwang; Cheong, Tong Loong; Lim, Tan Chew

doi:10.1007/BFb0095291

Wee Li Kwang Angela¹,
Tong Loong Cheong¹ &
Tan Chew Lim²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1531))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

94 Accesses

Abstract

The advent of computing has exacerbated the problem of overwhelming information. Advanced information management strategies such as Information Extraction, Information Filtering, Information Retrieval, and Text Categorization are becoming important to manage the deluge of information. Information Extraction (IE) systems can be used to automatically extract relevant information from free-form text for update to databases or for report generation. This paper describes the major challenge of knowledge representation issues in an information extraction task-representing the meaning of the input text, the knowledge of the field of application (or domain application) and the knowledge about the target information to be extracted. In this research, we have chosen a directed graph structure to represent the input text meaning, a domain ontology to represent the domain application and a frame representation to capture the target information to be extracted. We discuss in this paper how these knowledge structures interplay to perform the task of information extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Appelt D E, J Bear, J R Hobbs, D Israel and M Tyson (1992). “SRI International FASTUS System” Proc. MUC-4, Morgan Kaufmann: 143–147.
Google Scholar
Bobrow G. Daniel and Winograd Terry (1977). “An Overview of KRL, a Knowledge Representation Language”. Cognitive Science 1(1), 1977, 3–46.
Article Google Scholar
Bobrow G. Daniel, R M Kaplan, M Kay, D A Norman, H Thompson and T Winograd (1977). “GUS, A Frame-Driven Dialog System” Artificial Intelligence, North-Holland Publishing Company 1977:155–173.
Google Scholar
Brachman R.J. and Schmolze J.G. (1985). “An Overview of the KL-ONE Knowledge Representation System”. Cognitive Science 9: 171–216.
Article Google Scholar
Charniak Eugene (1978). “On the use of framed knowledge in language comprehension”. Artificial Intelligence 11: 225–265
Article Google Scholar
DARPA (1991). Proc. of Third Message Understanding Conference (MUC-3). Morgan Kaufmann Publishers Inc.
Google Scholar
DARPA (1992). Proc. of Fourth Message Understanding Conference (MUC-4). Morgan Kaufmann Publishers Inc.
Google Scholar
DARPA (1993). Proc. of Fifth Message Understanding Conference (MUC-5). Morgan Kaufmann Publishers Inc.
Google Scholar
DARPA (1995). Proc. of Sixth Message Understanding Conference (MUC-6). Morgan Kaufmann Publishers Inc.
Google Scholar
Jensen Karen, Heidorn E. George, Richardson D. Stephen 1993 Natural Language Processing: The PLNLP Approach. Kluwer Academic Publishers, Boston/Dordrecht/London. Chapter 16: 203–214. Chapter 21: 273–283.
Google Scholar
Krupka G, P Jacobs, L Rau, and L Iwanska (1991). “The GE NLToolset System” Proc. MUC-3, Morgan Kaufmann.
Google Scholar
Marco Costantino, Richard G. Morgan, Russell J. Collingham, Roberto Garigliano (1997). “Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles” Proc. of Conference on Computational Intelligence for Financial Engineering (CIFEr’97), New York City, March 23–25, 1997.
Google Scholar
Nyberg H. Eric (1988). “The FrameKit User’s Guide Version 2.0”, Carnegie Mellon University, 1988.
Google Scholar
Tan Sian Lip, Tong Loong Cheong (1993). “A statistical approach to automatic text extraction.” Asian Libraries, Vol. 3 No 1, Mar 1993: 46–54.
Google Scholar
Tan Sian Lip, Aw Ai Ti (1993). “Domain specific information Extraction—a NLP-Enable application.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.
Google Scholar
Tong Loong Cheong, Wee Li Kwang, Goh Ann Loo, Lee Chee Qwun, and Teo Pit Koon (dy1992). “A Telex Destination Identification System.” Proc. First Singapore Int. Conf. on Intelligent Systems (SPICIS 92), Sep 1992: 281–287.
Google Scholar
Allen James (1987). “Natural Language Understanding”. University of Rochester. Menlo Park: The Benjamin/Cummings Publishing Company, Inc.
Google Scholar
Wan Kwee Ngim, Tong Loong Cheong, Lynda Ang Seok Lay (1993). “Automatic Categorisation of Cargo Descriptions.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.
Google Scholar
Tong Loong Cheong, Angela Wee Li Kwang, Augustina Gunawan, Goh Ann Loo, Lee Chee Qwun, and Shu Huey Leng (1994). “A Pragmatic Information Extraction Architecture for the Message Formatting Expert (MFE) System”. Proc. Second Singapore Int. Conf. on Intelligent Systems (SPICIS 94), Nov 1994: B371–377.
Google Scholar
Wee Li Kwang Angela, Tong Loong Cheong, Chng Tiak Jung (1997). “DeNews—A Personalized News System.” Journal of Expert Systems with Applications, Vol. 13, 1997, Elsevier Science Ltd., UK 0957-4174/97.
Google Scholar
Dolan C P, S R Goldman, T V Cuda and A M Nakamura (1991). “Hughes Trainable Text Skimmer” Proc. MUC-3, Morgan Kaufmann: 155–162.
Google Scholar
Tong Loong Cheong, Low Poh Lian (1991). “Automatic Text Abstraction-Prospects and a Proposed R&D Plan” Information Technology. Journal of SCS, Vol 4 No 2, Sep 1991: 85–94.
Google Scholar
Julian Kupiec, Jan O. Pedersen, Francine Chen (1995). “A Trainable Document Summarizer”. Proc. of the 18^th ACM/SIGIR Conference, 1995: 68–73.
Google Scholar

Download references

Author information

Authors and Affiliations

Kent Ridge Digital Labs, 21 Heng Mui Keng Terrace, 119613, Singapore
Wee Li Kwang Angela & Tong Loong Cheong
School of Computing, National University of Singapore, Kent Ridge, 119260, Singapore
Tan Chew Lim

Authors

Wee Li Kwang Angela
View author publications
You can also search for this author in PubMed Google Scholar
Tong Loong Cheong
View author publications
You can also search for this author in PubMed Google Scholar
Tan Chew Lim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hing-Yan Lee Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Angela, W.L.K., Cheong, T.L., Lim, T.C. (1998). Knowledge representation issues in information extraction. In: Lee, HY., Motoda, H. (eds) PRICAI’98: Topics in Artificial Intelligence. PRICAI 1998. Lecture Notes in Computer Science, vol 1531. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095291

Download citation

DOI: https://doi.org/10.1007/BFb0095291
Published: 20 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65271-7
Online ISBN: 978-3-540-49461-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics