Using Transduction and Multi-view Learning to Answer Emails

Kockelkorn, Michael; Lüneburg, Andreas; Scheffer, Tobias

doi:10.1007/978-3-540-39804-2_25

Michael Kockelkorn¹⁰,
Andreas Lüneburg¹⁰ &
Tobias Scheffer¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2838))

Included in the following conference series:

European Conference on Principles of Data Mining and Knowledge Discovery

2207 Accesses
11 Citations

Abstract

Many organizations and companies have to answer large amounts of emails. Often, most of these emails contain variations of relatively few frequently asked questions. We address the problem of predicting which of several frequently used answers a user will choose to respond to an email. Our approach effectively utilizes the data that is typically available in this setting: inbound and outbound emails stored on a server. We take into account that there are no explicit links between inbound and corresponding outbound mails on the server. We map the problem to a semi-supervised classification problem that can be addressed by algorithms such as the transductive support vector machine and multi-view learning. We evaluate our approach using emails sent to a corporate customer service department.

Download to read the full chapter text

Chapter PDF

Multi-class E-mail Classification with a Semi-Supervised Approach Based on Automatic Feature Selection and Information Retrieval

Semi-supervised Learning with Transfer Learning

A Preliminary Study on Transductive Extreme Learning Machines

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Androutsopoulos, I., Koutsias, J., Chandrinos, K., Spyropoulos, C.: An experimental comparison of naive bayesian and keaword based anti-spam filtering with personal email messsages. In: Proceedings of the International ACM SIGIR Conference (2000)
Google Scholar
Bennett, P.: Assessing the calibration of naive bayes posterior estimates. Technical report, CMU (2000)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory (1998)
Google Scholar
Boone, T.: Concept features in Re:Agent, an intelligent email agent. Autonomous Agents (1998)
Google Scholar
Bradley, A.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7), 1145–1159 (1997)
Article Google Scholar
Cohen, W.: Learnig rules that classify email. In: Proceedings of the IEEE Spring Symposium on Machine learning for Information Access (1996)
Google Scholar
Crawford, E., Kay, J., McCreath, E.: IEMS - the intelligent email sorter. In: Proceedings of the International Conference on Machine Learning (2002)
Google Scholar
Green, C., Edwards, P.: Using machine learning to enhance software tools for internet information management. In: Proceedings of the AAAI Workshop on Internet Information Management (1996)
Google Scholar
Joachims, T.: Making large-scale svm learning practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning (1999)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of the International Conference on Machine Learning (1999)
Google Scholar
Kiritchenko, S., Matwin, S.: Email classification with co-training. Technical report, University of Ottawa (2002)
Google Scholar
Kolcz, A., Alspector, J.: Svm-based filtering of e-mail spam with content-specific misclassification costs. In: Proceedings of the ICDM Workshop on Text Mining (2001)
Google Scholar
Lewis, D.: The trec-5 filtering track. In: Proceedings of the Fifth Text Retrieval Conference (1997)
Google Scholar
Muslea, I., Kloblock, C., Minton, S.: Active + semi-supervised learning = robust multi-view learning. In: Proceedings of the International Conference on Machine Learning (2002)
Google Scholar
Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of cotraining. In: Proceedings of Information and Knowledge Management (2000)
Google Scholar
Pantel, P., Lin, D.: Spamcop: a spam classification and organization program. In: Proceedings of the AAAI Workshop on Learning for Text Categorization (1998)
Google Scholar
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation in comparing classifiers. In: Proceedings of the International Conference on Machine Learning (1998)
Google Scholar
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A bayesian approach to filtering junk email. In: Proceedings of the AAAI Workshop on Learning for Text Categorization (1998)
Google Scholar
Segal, R., Kephart, J.: Mailcat: An intelligent assistant for organizing mail. Autonomous Agents (1999)
Google Scholar
Vorhees, E.: The trec-8 question answering track report. In: Proceedings of TREC-8 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Tonxx, Berlin, Germany
Michael Kockelkorn & Andreas Lüneburg
School of Computer Science, Humboldt Univerersity, Unter den Linden 6, 10099, Berlin, Germany
Tobias Scheffer

Authors

Michael Kockelkorn
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Lüneburg
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Scheffer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Nova Gorica, Nova Gorica, Slovenia
Nada Lavrač
Rudjer Bošković Institute, Bijenička 54, 10000, Zagreb, Croatia
Dragan Gamberger
Jozef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Ljupčo Todorovski
Leiden Institute of Advanced Computer Science, Leiden University,
Hendrik Blockeel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kockelkorn, M., Lüneburg, A., Scheffer, T. (2003). Using Transduction and Multi-view Learning to Answer Emails. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds) Knowledge Discovery in Databases: PKDD 2003. PKDD 2003. Lecture Notes in Computer Science(), vol 2838. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39804-2_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-39804-2_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20085-7
Online ISBN: 978-3-540-39804-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Using Transduction and Multi-view Learning to Answer Emails

Abstract

Chapter PDF

Similar content being viewed by others

Multi-class E-mail Classification with a Semi-Supervised Approach Based on Automatic Feature Selection and Information Retrieval

Semi-supervised Learning with Transfer Learning

A Preliminary Study on Transductive Extreme Learning Machines

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Using Transduction and Multi-view Learning to Answer Emails

Abstract

Chapter PDF

Similar content being viewed by others

Multi-class E-mail Classification with a Semi-Supervised Approach Based on Automatic Feature Selection and Information Retrieval

Semi-supervised Learning with Transfer Learning

A Preliminary Study on Transductive Extreme Learning Machines

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation