Advertisement

Classification of Email Queries by Topic: Approach Based on Hierarchically Structured Subject Domain

  • Anna V. Zhdanova
  • Denis V. Shishkin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2412)

Abstract

We describe a Classifier of email queries, which executes text categorization by topic. The specifics of our Classifier is that it allows accurate categorization of short messages containing only a few words. This advantage is achieved by executing morphological and semantic analyses of an incoming text. Specifically, the Classifier provides an efficient information extraction and takes the meaning of words into consideration. By using the hierarchically structured subject domain and classification rules, the Classifier’s engine assigns an email query to the most relevant category or categories.

Keywords

Regular Expression Information Extraction Tree Node Subject Domain Document Cluster 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Žižka, J., Bourek, A., Frey, L.: TEA: A Text Analysis Tool for the Intelligent Text Document Filtering. In: Proceedings of the 3-rd International Workshop on Text, Speech and Dialogue (2000).Google Scholar
  2. 2.
    Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. The MIT Press (2001).Google Scholar
  3. 3.
    Makagonov, P., Sboychakov, K.: Software for Creating Domain-Oriented Dictionaries and Document Clustering in Full-Text Databases. In: Proceedings of the Second International Conference on Intelligent Text Processing and Computational Linguistics (2001).Google Scholar
  4. 4.
    Cohen, W.W.: Fast Effective Rule Induction. In: Proceedings of the Second International Conference on Machine Learning (1995).Google Scholar
  5. 5.
    Tiun, S., Abdullah, R., Kong, T.E.: Automatic Topic Identification Using Ontology Hierarchy. In: Proceedings of the Second International Conference on Intelligent Text Processing and Computational Linguistics (2001).Google Scholar
  6. 6.
    Kosseim, L., Beauregard, S., Lapalme, G.: Using Information Extraction and Natural Language Generation to Answer E-mail. In: Proceedings of the 5-th International Conference on Applications of Natural Language to Information Systems (2000).Google Scholar
  7. 7.
    Bergholtz, M., Johannesson, P.: Validating Conceptual Models-Utilizing Analysis Patterns as an Instrument for Explanation Generation. In: Proceedings of the 5-th International Conference on Applications of Natural Language to Information Systems (2000).Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Anna V. Zhdanova
    • 1
    • 2
  • Denis V. Shishkin
    • 1
    • 2
  1. 1.Novosibirsk State UniversityNovosibirskRussia
  2. 2.A.P. Ershov Institute of Informatics SystemsNovosibirskRussia

Personalised recommendations