Skip to main content

Conversation Detection in Email Systems

  • Conference paper
Advances in Information Retrieval (ECIR 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4956))

Included in the following conference series:

Abstract

This work explores a novel approach for conversation detection in email mailboxes. This approach clusters messages into coherent conversations by using a similarity function among messages that takes into consideration all relevant email attributes, such as message subject, participants, date of submission, and message content. The detection algorithm is evaluated against a manual partition of two email mailboxes into conversations. Experimental results demonstrate the superiority of our detection algorithm over several other alternative approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aaron, H., Jen-Yuan, Y.: Email thread reassembly using similarity matching. In: Proceedings of the Third Conference on Email and Anti-Spam (CEAS) (2006)

    Google Scholar 

  2. Gabor, C., Keno, A., Roger, W.: BuzzTrack: Topic Detection and Tracking in Email. In: Proceedings of the 12th international conference on Intelligent user interfaces IUI 2007, ACM Press, New York (2007)

    Google Scholar 

  3. Kalman, Y.M., Rafaeli, S.: Email Chronemics: Unobtrusive Profiling of Response Times. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS 2005), vol. 04, pp. 108.2 (2005)

    Google Scholar 

  4. Kerr, B.: THREAD ARCS: An Email Thread Visualization. In: Proceedings of IEEE InfoVis, Seattle, WA, pp. 211–218 (2003)

    Google Scholar 

  5. Klimt, B., Yang, Y.: Introducing the Enron Corpus. In: Proceedings of the First Conference on Email and Anti-Spam (CEAS), Mountain View, CA (2004)

    Google Scholar 

  6. Lam, D., Rohall, S.L., Schmandt, C., Stern, M.K.: Exploiting e-mail structure to improve summarization. In: ACM 2002 Conference on Computer Supported Cooperative Work (CSCW2002), New Orlenes, LA (2002)

    Google Scholar 

  7. Lewis, D.D., Gale, A.W.: A Sequential Algorithm for Training Text Classifiers. In: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland, pp. 3–12 (1994)

    Google Scholar 

  8. Lewis, D.D., Knowels, K.A.: Threading Electronic Mail: a preliminary study. In Information Processing and Management 33(2), 209–217 (1997)

    Article  Google Scholar 

  9. Rudy, I.A.: A Critical Review of Research on Electronic Mail. European Journal of Information Systems 4, 198–213 (1996)

    Article  Google Scholar 

  10. The Internet Society. RFC 2822 – Internet Message Format (2001), http://www.faqs.org/rfcs/rfc2822.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Craig Macdonald Iadh Ounis Vassilis Plachouras Ian Ruthven Ryen W. White

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Erera, S., Carmel, D. (2008). Conversation Detection in Email Systems. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds) Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, vol 4956. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78646-7_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78646-7_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78645-0

  • Online ISBN: 978-3-540-78646-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics