Abstract
Many organizations and companies have to answer large amounts of emails. Often, most of these emails contain variations of relatively few frequently asked questions. We address the problem of predicting which of several frequently used answers a user will choose to respond to an email. Our approach effectively utilizes the data that is typically available in this setting: inbound and outbound emails stored on a server. We take into account that there are no explicit links between inbound and corresponding outbound mails on the server. We map the problem to a semi-supervised classification problem that can be addressed by algorithms such as the transductive support vector machine and multi-view learning. We evaluate our approach using emails sent to a corporate customer service department.
Chapter PDF
Similar content being viewed by others
Keywords
- Receiver Operating Characteristic Curve
- Decision Function
- Unlabeled Data
- Standard Answer
- Server Breakdown
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Androutsopoulos, I., Koutsias, J., Chandrinos, K., Spyropoulos, C.: An experimental comparison of naive bayesian and keaword based anti-spam filtering with personal email messsages. In: Proceedings of the International ACM SIGIR Conference (2000)
Bennett, P.: Assessing the calibration of naive bayes posterior estimates. Technical report, CMU (2000)
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory (1998)
Boone, T.: Concept features in Re:Agent, an intelligent email agent. Autonomous Agents (1998)
Bradley, A.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7), 1145–1159 (1997)
Cohen, W.: Learnig rules that classify email. In: Proceedings of the IEEE Spring Symposium on Machine learning for Information Access (1996)
Crawford, E., Kay, J., McCreath, E.: IEMS - the intelligent email sorter. In: Proceedings of the International Conference on Machine Learning (2002)
Green, C., Edwards, P.: Using machine learning to enhance software tools for internet information management. In: Proceedings of the AAAI Workshop on Internet Information Management (1996)
Joachims, T.: Making large-scale svm learning practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning (1999)
Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of the International Conference on Machine Learning (1999)
Kiritchenko, S., Matwin, S.: Email classification with co-training. Technical report, University of Ottawa (2002)
Kolcz, A., Alspector, J.: Svm-based filtering of e-mail spam with content-specific misclassification costs. In: Proceedings of the ICDM Workshop on Text Mining (2001)
Lewis, D.: The trec-5 filtering track. In: Proceedings of the Fifth Text Retrieval Conference (1997)
Muslea, I., Kloblock, C., Minton, S.: Active + semi-supervised learning = robust multi-view learning. In: Proceedings of the International Conference on Machine Learning (2002)
Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of cotraining. In: Proceedings of Information and Knowledge Management (2000)
Pantel, P., Lin, D.: Spamcop: a spam classification and organization program. In: Proceedings of the AAAI Workshop on Learning for Text Categorization (1998)
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation in comparing classifiers. In: Proceedings of the International Conference on Machine Learning (1998)
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A bayesian approach to filtering junk email. In: Proceedings of the AAAI Workshop on Learning for Text Categorization (1998)
Segal, R., Kephart, J.: Mailcat: An intelligent assistant for organizing mail. Autonomous Agents (1999)
Vorhees, E.: The trec-8 question answering track report. In: Proceedings of TREC-8 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kockelkorn, M., Lüneburg, A., Scheffer, T. (2003). Using Transduction and Multi-view Learning to Answer Emails. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds) Knowledge Discovery in Databases: PKDD 2003. PKDD 2003. Lecture Notes in Computer Science(), vol 2838. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39804-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-540-39804-2_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20085-7
Online ISBN: 978-3-540-39804-2
eBook Packages: Springer Book Archive