Advertisement

Extracting Information from Short Messages

  • Richard Cooper
  • Sajjad Ali
  • Chenlan Bi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3513)

Abstract

Much currently transmitted information takes the form of e-mails or SMS text messages and so extracting information from such short messages is increasingly important. The words in a message can be partitioned into the syntactic structure, terms from the domain of discourse and the data being transmitted. This paper describes a light-weight Information Extraction component which uses pattern matching to separate the three aspects: the structure is supplied as a template; domain terms are the metadata of a data source (or their synonyms), and data is extracted as those words matching placeholders in the templates.

Keywords

Pattern Match Entity Type Sentence Structure Short Message Natural Language Semantic 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)CrossRefGoogle Scholar
  2. 2.
    Fisher, D., Soderland, S., McCarthy, J., Feng, F., Lehnert, W.: Umass System, MUC-6 (1995)Google Scholar
  3. 3.
    Cardie, C.: Empirical Methods in Information Extraction. AI Magazine 18(4), 65–79 (1997)Google Scholar
  4. 4.
  5. 5.
    Kang, I.-S., Na, S.-H., Lee, J.-H., Yang, G.: Lightweight Natural Language Database Interfaces. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 76–88. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  6. 6.
    Stratica, N., Desai, B.C.: Schema-Based Natural Language Semantic Mapping. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 103–113. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  7. 7.
    Cooper, R.L., Ali, S.: Extracting Database Information from E-mail Messages. In: James, A., Younas, M., Lings, B. (eds.) BNCOD 2003. LNCS, vol. 2712, pp. 271–279. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  8. 8.
    Cooper, R.L., Ali, S., Bi, C.L.: A System for Extracting Information from Short Messages, Technical Report, University of Glasgow (in press)Google Scholar
  9. 9.
    Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-Text Collections. In: Proc. 5th ACM International Conference on Digital Libraries, DL (2000)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Richard Cooper
    • 1
  • Sajjad Ali
    • 1
  • Chenlan Bi
    • 1
  1. 1.Computing ScienceUniversity of GlasgowGlasgow

Personalised recommendations