Skip to main content

Spam Filtering and Email-Mediated Applications

  • Conference paper
Web Intelligence Meets Brain Informatics (WImBI 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4845))

Included in the following conference series:

Abstract

This chapter reviews and examines two important research topics related to intelligent email processing, namely, email filtering and email-mediated applications. We present a framework to show a full process of email filtering. Within the framework, we suggest a new method of combining multiple filters and propose a novel filtering model based on ensemble learning. For email-mediated applications, we introduce the concept of operable email (OE). It is argued that operable email will play a fundamental role in future email systems, in order to meet the need of the World Wide Wisdom Web (W4). We demonstrate the use of OE in implementing an email assistant and other intelligent applications on the World Social Email Network (WSEN).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Androutsopoulos, I., Koutsias, J., Chandrinos, K.V., Spyropoulos, C.D.: An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with encrypted personal e-mail messages. In: Proc. of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000), pp. 160–167 (2000)

    Google Scholar 

  2. Androutsopoulos, I., Georgios, P., Michelakis, E.: Learning to filter unsolicited commercial e-mail. Technical Report 2004/2, NCSR Demokritos00 (2004)

    Google Scholar 

  3. Bergman, R., Griss, M., Staelin, C.: A personal email assistant. Technical Report HPL-2002-236, HP Labs Palo Alto (2002), http://citeseer.ist.psu.edu/bergman02personal.html

  4. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web: a new form of Web content that is meaningful to computers will unleash a revolution of new possibilities. Scientific American 284(5), 34–43 (2001)

    Article  Google Scholar 

  5. Boykin, P.O., Roychowdhury, V.: Personal email networks: an effective anti-spam tool. IEEE Computer 38(4), 61–68 (2005)

    MathSciNet  Google Scholar 

  6. Chris, D., Robert, C.H.: Cost curves: an improved method for visualizing classifier performance. Machine Learning 65(1), 95–130 (2006)

    Article  Google Scholar 

  7. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)

    MATH  MathSciNet  Google Scholar 

  8. Deng, Y.H., Tsai, T.H., Hsu, J.: P@rty: a personal email agent. In: Proc. of Agent Technology Workshop, pp. 61–64 (1999)

    Google Scholar 

  9. Drucker, H., Wu, D., Vapnik, V.N.: Support vector machines for spam categorization. IEEE Transactions on Neural Networks 20(5), 1048–1054 (1999)

    Article  Google Scholar 

  10. Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis (1973)

    Google Scholar 

  11. Fawcett, T.: In vivo spam filtering: a challenge problem for data mining. KDD Explorations 5(2), 140–148 (2003)

    Article  Google Scholar 

  12. Ho, V., Wobcke, W., Compton, P.: EMMA: an email management assistant. In: Proc. of 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT 2003), pp. 67–74 (2003)

    Google Scholar 

  13. Hardle, W., Simar, L.: Applied Multivariate Statistical Analysis, 341–357 (2003)

    Google Scholar 

  14. Hendler, J.: Agents and the Semantic Web. IEEE Intelligent Systems 16(2), 30–37 (2001)

    Article  Google Scholar 

  15. Jason, D.M., Rennie, J.: ifile: an application of machine learning to e-mail filtering. In: Proc. of the KDD-2000 Text Mining Workshop, pp. 95–98 (2000)

    Google Scholar 

  16. Li, W.B., Zhong, N., Liu, J.M., Yao, Y.Y., Liu, C.N.: A perspective on global email networks. In: Proc. of 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pp. 117–120 (2006)

    Google Scholar 

  17. Li, W.B., Zhong, N., Yao, Y.Y., Liu, J.M., Liu, C.N.: Developing intelligent applications in social e-mail networks. In: Greco, S., Hata, Y., Hirano, S., Inuiguchi, M., Miyamoto, S., Nguyen, H.S., Słowiński, R. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 776–785. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  18. Li, W.B., Liu, C.N., Chen, Y.Y.: Combining multiple email filters of naive Bayes based on GMM. Acta Electronica Sinica 34(2), 247–251 (2006)

    Google Scholar 

  19. Li, W.B., Zhong, N., Liu, C.N.: Design and implementation of an email classifier. In: Proc. of International Conference on Active Media Technology (AMT 2003), pp. 423–430 (2003)

    Google Scholar 

  20. Li, W.B., Zhong, N., Liu, C.N.: Combining multiple email filters based on multivariate statistical analysis. In: Esposito, F., Raś, Z.W., Malerba, D., Semeraro, G. (eds.) ISMIS 2006. LNCS (LNAI), vol. 4203, pp. 729–738. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  21. Li, W.B., Zhong, N., Liu, C.N.: ECPIA: An email-centric personal intelligent assistant. In: Wang, G.-Y., Peters, J.F., Skowron, A., Yao, Y. (eds.) RSKT 2006. LNCS (LNAI), vol. 4062, pp. 502–509. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  22. Liu, J.M.: Web intelligence (WI): What makes Wisdom Web? In: Proc. of the 18th International Joint Conference on Artificial Intelligence (IJCAI’03), pp. 1596–1601 (2003)

    Google Scholar 

  23. McDowell, L., Etzioni, O., Halevy, A., Henry, L.: Semantic email. In: Proc. of the Thirteenth Int. WWW Conference (WWW 2004) (2004)

    Google Scholar 

  24. Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: Proc. of the AAAI-98 Workshop on Learning for Text Categorization, pp. 55–62 (1998)

    Google Scholar 

  25. Salton, G.: Automatic text processing: the transformation, analysis, and retrieval of information by computer (1989)

    Google Scholar 

  26. Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)

    Article  Google Scholar 

  27. Sun, D., Tran, Q.A., Duan, H., Zhang, G.: A novel method for Chinese spam detection based on one-class support vector machine. Journal of Information and Computational Science 2(1), 109–114 (2005)

    Google Scholar 

  28. Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Proc. of 14th International Conference on Machine Learning (ICML 1997), pp. 412–420 (1997)

    Google Scholar 

  29. Zhong, N.: Developing intelligent portals by using WI technologies. In: Proc. of Wavelet Analysis and Its Applications, and Active Media Technology (AMT 2004), pp. 555–567 (2004)

    Google Scholar 

  30. Zhong, N., Liu, J.M.: The alchemy of intelligent IT (iIT): blueprint for future of information technology. In: Intelligent Technologies for Information Analysis, Springer Monograph, pp. 1–16 (2004)

    Google Scholar 

  31. Zhong, N., Ohara, H., Iwasaki, T., Yao, Y.Y.: Using WI technology to develop intelligent enterprise portals. In: Proc. of International Workshop on Applications, Products and Services of Web-based Support Systems, pp. 83–90 (2003)

    Google Scholar 

  32. Zhong, N., Liu, J.M., Yao, Y.Y.: In search of the Wisdom Web. In: IEEE Computer, pp. 27–31 (2002)

    Google Scholar 

  33. Zhong, N., Liu, J.M., Yao, Y.Y.: Envisioning intelligent Information Technologies (iIT) from the stand-point of Web Intelligence (WI). Communications of the ACM 50(3), 89–94 (2007)

    Article  Google Scholar 

  34. Zhou, Z.H., Liu, X.Y.: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Transaction on Knowledge and Data Engineering 18(1), 63–77 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Ning Zhong Jiming Liu Yiyu Yao Jinglong Wu Shengfu Lu Kuncheng Li

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, W., Zhong, N., Yao, Y.Y., Liu, J., Liu, C. (2007). Spam Filtering and Email-Mediated Applications. In: Zhong, N., Liu, J., Yao, Y., Wu, J., Lu, S., Li, K. (eds) Web Intelligence Meets Brain Informatics. WImBI 2006. Lecture Notes in Computer Science(), vol 4845. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77028-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77028-2_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77027-5

  • Online ISBN: 978-3-540-77028-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics