Skip to main content

Knowledge representation issues in information extraction

  • Knowledge Structure
  • Conference paper
  • First Online:
PRICAI’98: Topics in Artificial Intelligence (PRICAI 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1531))

Included in the following conference series:

  • 94 Accesses

Abstract

The advent of computing has exacerbated the problem of overwhelming information. Advanced information management strategies such as Information Extraction, Information Filtering, Information Retrieval, and Text Categorization are becoming important to manage the deluge of information. Information Extraction (IE) systems can be used to automatically extract relevant information from free-form text for update to databases or for report generation. This paper describes the major challenge of knowledge representation issues in an information extraction task-representing the meaning of the input text, the knowledge of the field of application (or domain application) and the knowledge about the target information to be extracted. In this research, we have chosen a directed graph structure to represent the input text meaning, a domain ontology to represent the domain application and a frame representation to capture the target information to be extracted. We discuss in this paper how these knowledge structures interplay to perform the task of information extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Appelt D E, J Bear, J R Hobbs, D Israel and M Tyson (1992). “SRI International FASTUS System” Proc. MUC-4, Morgan Kaufmann: 143–147.

    Google Scholar 

  2. Bobrow G. Daniel and Winograd Terry (1977). “An Overview of KRL, a Knowledge Representation Language”. Cognitive Science 1(1), 1977, 3–46.

    Article  Google Scholar 

  3. Bobrow G. Daniel, R M Kaplan, M Kay, D A Norman, H Thompson and T Winograd (1977). “GUS, A Frame-Driven Dialog System” Artificial Intelligence, North-Holland Publishing Company 1977:155–173.

    Google Scholar 

  4. Brachman R.J. and Schmolze J.G. (1985). “An Overview of the KL-ONE Knowledge Representation System”. Cognitive Science 9: 171–216.

    Article  Google Scholar 

  5. Charniak Eugene (1978). “On the use of framed knowledge in language comprehension”. Artificial Intelligence 11: 225–265

    Article  Google Scholar 

  6. DARPA (1991). Proc. of Third Message Understanding Conference (MUC-3). Morgan Kaufmann Publishers Inc.

    Google Scholar 

  7. DARPA (1992). Proc. of Fourth Message Understanding Conference (MUC-4). Morgan Kaufmann Publishers Inc.

    Google Scholar 

  8. DARPA (1993). Proc. of Fifth Message Understanding Conference (MUC-5). Morgan Kaufmann Publishers Inc.

    Google Scholar 

  9. DARPA (1995). Proc. of Sixth Message Understanding Conference (MUC-6). Morgan Kaufmann Publishers Inc.

    Google Scholar 

  10. Jensen Karen, Heidorn E. George, Richardson D. Stephen 1993 Natural Language Processing: The PLNLP Approach. Kluwer Academic Publishers, Boston/Dordrecht/London. Chapter 16: 203–214. Chapter 21: 273–283.

    Google Scholar 

  11. Krupka G, P Jacobs, L Rau, and L Iwanska (1991). “The GE NLToolset System” Proc. MUC-3, Morgan Kaufmann.

    Google Scholar 

  12. Marco Costantino, Richard G. Morgan, Russell J. Collingham, Roberto Garigliano (1997). “Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles” Proc. of Conference on Computational Intelligence for Financial Engineering (CIFEr’97), New York City, March 23–25, 1997.

    Google Scholar 

  13. Nyberg H. Eric (1988). “The FrameKit User’s Guide Version 2.0”, Carnegie Mellon University, 1988.

    Google Scholar 

  14. Tan Sian Lip, Tong Loong Cheong (1993). “A statistical approach to automatic text extraction.” Asian Libraries, Vol. 3 No 1, Mar 1993: 46–54.

    Google Scholar 

  15. Tan Sian Lip, Aw Ai Ti (1993). “Domain specific information Extraction—a NLP-Enable application.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.

    Google Scholar 

  16. Tong Loong Cheong, Wee Li Kwang, Goh Ann Loo, Lee Chee Qwun, and Teo Pit Koon (dy1992). “A Telex Destination Identification System.” Proc. First Singapore Int. Conf. on Intelligent Systems (SPICIS 92), Sep 1992: 281–287.

    Google Scholar 

  17. Allen James (1987). “Natural Language Understanding”. University of Rochester. Menlo Park: The Benjamin/Cummings Publishing Company, Inc.

    Google Scholar 

  18. Wan Kwee Ngim, Tong Loong Cheong, Lynda Ang Seok Lay (1993). “Automatic Categorisation of Cargo Descriptions.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.

    Google Scholar 

  19. Tong Loong Cheong, Angela Wee Li Kwang, Augustina Gunawan, Goh Ann Loo, Lee Chee Qwun, and Shu Huey Leng (1994). “A Pragmatic Information Extraction Architecture for the Message Formatting Expert (MFE) System”. Proc. Second Singapore Int. Conf. on Intelligent Systems (SPICIS 94), Nov 1994: B371–377.

    Google Scholar 

  20. Wee Li Kwang Angela, Tong Loong Cheong, Chng Tiak Jung (1997). “DeNews—A Personalized News System.” Journal of Expert Systems with Applications, Vol. 13, 1997, Elsevier Science Ltd., UK 0957-4174/97.

    Google Scholar 

  21. Dolan C P, S R Goldman, T V Cuda and A M Nakamura (1991). “Hughes Trainable Text Skimmer” Proc. MUC-3, Morgan Kaufmann: 155–162.

    Google Scholar 

  22. Tong Loong Cheong, Low Poh Lian (1991). “Automatic Text Abstraction-Prospects and a Proposed R&D Plan” Information Technology. Journal of SCS, Vol 4 No 2, Sep 1991: 85–94.

    Google Scholar 

  23. Julian Kupiec, Jan O. Pedersen, Francine Chen (1995). “A Trainable Document Summarizer”. Proc. of the 18th ACM/SIGIR Conference, 1995: 68–73.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hing-Yan Lee Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Angela, W.L.K., Cheong, T.L., Lim, T.C. (1998). Knowledge representation issues in information extraction. In: Lee, HY., Motoda, H. (eds) PRICAI’98: Topics in Artificial Intelligence. PRICAI 1998. Lecture Notes in Computer Science, vol 1531. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095291

Download citation

  • DOI: https://doi.org/10.1007/BFb0095291

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65271-7

  • Online ISBN: 978-3-540-49461-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics