Information Extraction from Email Announcements

  • Viktor Pekar
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3513)


Public email announcements present a number of unique challenges for an Information Extraction (IE) system, such as the presence of both free and semi-structured text, inconsistent document layout and widely varying formats of template fillers. In this paper we describe a study of parametrisation of an IE method to determine settings that best suit the specifics of the task at hand.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Kushmerick, N., Weld, D., Doorenbos, R.: Wrapper Induction for Information Extraction. In: Proc. of IJCAI 1997, pp. 729–737 (1997)Google Scholar
  2. 2.
    De Sitter, A., Daelemans, W.: Information Extraction via Double Classification. In: Proc. of the ECML/PKDD 2003 Workshop on Adaptive Text Extraction and Mining, Cavtat- Dubrovnik, Croatia (2003)Google Scholar
  3. 3.
    Soderland, S.: Learning Information Extraction Rules for Semi-structured and Free Text. Machine Learning 34, 233–272 (1999)zbMATHCrossRefGoogle Scholar
  4. 4.
    Witten, I., Frank, E.: Data Mining – Practical Machine Learning Tools and Techniques with Java Implementations. Morgan-Kaufmann, San Francisco (2000)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Viktor Pekar
    • 1
  1. 1.Research Group for Computational LinguisticsUniversity of WolverhamptonWolverhamptonUK

Personalised recommendations