Content-Based Recommender Systems



The collaborative systems discussed in the previous chapters use the correlations in the ratings patterns across users to make recommendations. On the other hand, these methods do not use item attributes for computing predictions. This would seem rather wasteful; after all, if John likes the futuristic science fiction movie Terminator, then there is a very good chance that he might like a movie from a similar genre, such as Aliens. In such cases, the ratings of other users may not be required to make meaningful recommendations.


Recommender System User Profile Gini Index Recommendation Process Implicit Feedback 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. [5]
    G. Adomavicius, and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6), pp. 734–749, 2005.CrossRefGoogle Scholar
  2. [18]
    C. Aggarwal. Data classification: algorithms and applications. CRC Press, 2014.Google Scholar
  3. [21]
    C. Aggarwal and C. Zhai. A survey of text classification algorithms. Mining Text Data, Springer, 2012.Google Scholar
  4. [22]
    C. Aggarwal. Data mining: the textbook. Springer, New York, 2015.Google Scholar
  5. [41]
    J. Ahn, P. Brusilovsky, J. Grady, D. He, and S. Syn. Open user profiles for adaptive news systems: help or harm? World Wide Web Conference, pp. 11–20, 2007.Google Scholar
  6. [50]
    A. Ansari, S. Essegaier, and R. Kohli. Internet recommendation systems. Journal of Marketing Research, 37(3), pp. 363–375, 2000.CrossRefGoogle Scholar
  7. [55]
    F. Asnicar and C. Tasso. IfWeb: a prototype of user model-based intelligent agent for document filtering and navigation in the world wide web. International Conference on User Modeling, pp. 3–12, 1997.Google Scholar
  8. [60]
    M. Balabanovic, and Y. Shoham. Fab: content-based, collaborative recommendation. Communications of the ACM, 40(3), pp. 66–72, 1997.CrossRefGoogle Scholar
  9. [82]
    D. Billsus and M. Pazzani. Learning collaborative information filters. ICML Conference, pp. 46–54, 1998.Google Scholar
  10. [83]
    D. Billsus and M. Pazzani. Learning probabilistic user models. International Conference on User Modeling, Workshop on Machine Learning for User Modeling, 1997.Google Scholar
  11. [84]
    D. Billsus and M. Pazzani. A hybrid user model for news story classification. International Conference on User Modeling, 1999.Google Scholar
  12. [85]
    D. Billsus and M. Pazzani. User modeling for adaptive news access. User Modeling and User-Adapted Interaction, 10(2–3), pp. 147–180, 2000.CrossRefGoogle Scholar
  13. [86]
    C. M. Bishop. Pattern recognition and machine learning. Springer, 2007.Google Scholar
  14. [87]
    C. M. Bishop. Neural networks for pattern recognition. Oxford University Press, 1995.Google Scholar
  15. [91]
    K. Bollacker, S. Lawrence, and C. L. Giles. CiteSeer: An autonomous web agent for automatic retrieval and identification of interesting publications. International Conference on Autonomous Agents, pp. 116–123, 1998.Google Scholar
  16. [114]
    C. Burges. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery, 2(2), pp. 121–167, 1998.CrossRefGoogle Scholar
  17. [117]
    R. Burke. Hybrid recommender systems: Survey and experiments. User Modeling and User-adapted Interaction, 12(4), pp. 331–370, 2002.CrossRefzbMATHGoogle Scholar
  18. [126]
    D. Cai, S. Yu, J. Wen, and W. Y. Ma. Extracting content structure for web pages based on visual representation. Web Technologies and Applications, pp. 406–417, 2003.Google Scholar
  19. [138]
    O. Celma, M. Ramirez, and P. Herrera. Foafing the music: A music recommendation system based on RSS feeds and user preferences. International Conference on Music Information Retrieval, pp. 464–467, 2005.Google Scholar
  20. [139]
    O. Celma, and X. Serra. FOAFing the music: Bridging the semantic gap in music recommendation. Web Semantics: Science, Services and Agents on the World Wide Web, 6(4), pp. 250–256, 2008.CrossRefGoogle Scholar
  21. [144]
    S. Chakrabarti. Mining the Web: Discovering knowledge from hypertext data. Morgan Kaufmann, 2003.Google Scholar
  22. [150]
    L. Chen, and K. Sycara. WebMate: a personal agent for browsing and searching. International conference on Autonomous agents, pp. 9–13, 1998.Google Scholar
  23. [163]
    W. Cohen, R. Schapire and Y. Singer. Learning to order things. Advances in Neural Information Processing Systems, pp. 451–457, 2007.Google Scholar
  24. [164]
    W. Cohen. Learning rules that classify e-mail. AAAI symposium on machine learning in information access. pp. 18–25, 1996.Google Scholar
  25. [165]
    W. Cohen. Fast effective rule induction. ICML Conference, pp. 115–123, 1995.Google Scholar
  26. [174]
    A. Csomai and R. Mihalcea. Linking documents to encyclopedic knowledge. IEEE Intelligent Systems, 23(5), pp. 34–41, 2008.CrossRefGoogle Scholar
  27. [178]
    M. De Gemmis, P. Lops, and G. Semeraro. A content-collaborative recommender that exploits WordNet-based user profiles for neighborhood formation. User Modeling and User-Adapted Interaction, 17(3), pp. 217–255, 2007.CrossRefGoogle Scholar
  28. [179]
    M. De Gemmis, P. Lops, G. Semeraro and P. Basile. Integrating tags in a semantic content-based recommender. Proceedings of the ACM Conference on Recommender Systems, pp. 163–170, 2008.Google Scholar
  29. [210]
    E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. IJCAI Conference, pp. 1606–1611, 2007.Google Scholar
  30. [211]
    E. Gabrilovich, and S. Markovitch. Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge. AAAI Conference, pp. 1301–1306, 2006.Google Scholar
  31. [242]
    T. Hastie, R. Tibshirani, and J. Friedman. The elements of statistical learning. Springer, 2009.Google Scholar
  32. [283]
    T. Joachims. Training linear SVMs in linear time. ACM KDD Conference, pp. 217–226, 2006.Google Scholar
  33. [341]
    J. Lees-Miller, F. Anderson, B. Hoehn, and R. Greiner. Does Wikipedia information help Netflix predictions?. Machine Learning and Applications, pp. 337–343, 2008.Google Scholar
  34. [356]
    H. Lieberman. Letizia: An agent that assists Web browsing, IJCAI, pp. 924–929, 1995.Google Scholar
  35. [364]
    B. Liu. Web data mining: exploring hyperlinks, contents, and usage data. Springer, New York, 2007.Google Scholar
  36. [376]
    P. Lops, M. de Gemmis, and G. Semeraro. Content-based recommender systems: state of the art and trends. Recommender Systems Handbook, Springer, pp. 73–105, 2011.Google Scholar
  37. [391]
    H. Mak, I. Koprinska, and J. Poon. Intimate: A web-based movie recommender using text categorization. International Conference on Web Intelligence, pp. 602–605, 2003.Google Scholar
  38. [392]
    B. Magnini, and C. Strapparava. Improving user modelling with content-based techniques. International Conference on User Modeling, pp. 74–83, 2001.Google Scholar
  39. [400]
    C. Manning, P. Raghavan, and H. Schutze. Introduction to information retrieval. Cambridge University Press, Cambridge, 2008.Google Scholar
  40. [418]
    S. McNee, J. Riedl, and J. Konstan. Being accurate is not enough: how accuracy metrics have hurt recommender systems. SIGCHI Conference, pp. 1097–1101, 2006.Google Scholar
  41. [436]
    T. M. Mitchell. Machine learning. McGraw Hill International Edition, 1997.Google Scholar
  42. [438]
    D. Mladenic. Machine learning used by Personal WebWatcher. Proceedings of the ACAI-99 Workshop on Machine Learning and Intelligent Agents, 1999.Google Scholar
  43. [439]
    D. Mladenic. Text learning and related intelligent agents: A survey. IEEE Intelligent Systems, 14(4), pp. 44–54, 1999.CrossRefGoogle Scholar
  44. [448]
    R. J. Mooney and L. Roy. Content-based book recommending using learning for text categorization. ACM Conference on Digital libraries, pp. 195–204, 2000.Google Scholar
  45. [476]
    M. Pazzani and D. Billsus. Learning and revising user profiles: The identification of interesting Web sites. Machine learning, 27(3), pp. 313–331, 1997.CrossRefGoogle Scholar
  46. [477]
    M. Pazzani and D. Billsus. Content-based recommendation systems. Lecture Notes in Computer Science, Springer, 4321, pp. 325–341, 2007.Google Scholar
  47. [478]
    M. Pazzani, J. Muramatsu, and D. Billsus. Syskill and Webert: Identifying interesting Web sites. AAAI Conference, pp. 54–61, 1996.Google Scholar
  48. [511]
    J. Rocchio. Relevance feedback information retrieval. The SMART retrieval system – experiments in automated document processing, pp. 313–323, Prentice-Hall, Englewood Cliffs, NJ, 1971.Google Scholar
  49. [520]
    J. Salter, and N. Antonopoulos. CinemaScreen recommender agent: combining collaborative and content-based filtering. Intelligent Systems, 21(1), pp. 35–41, 2006.CrossRefGoogle Scholar
  50. [543]
    B. Sheth and P. Maes. Evolving agents for personalized information filtering. Ninth Conference on Artificial Intelligence for Applications, pp. 345–352, 1993.Google Scholar
  51. [561]
    H. Sorensen and M. McElligott. PSUN: a profiling system for Usenet news. CIKM Intelligent Information Agents Workshop, 1995.Google Scholar
  52. [599]
    E. G. Toms. Serendipitous information retrieval. DELOS Workshop: Information Seeking, Searching and Querying in Digital Libraries, 2000.Google Scholar
  53. [636]
    K. L. Wu, C. C. Aggarwal, and P. S. Yu. Personalization with dynamic profiler. International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, pp. 12–20, 2001. Also available online as IBM Research Report, RC22004, 2001. Search interface at
  54. [662]
    Y. Zhai, and B. Liu. Web data extraction based on partial tree alignment. World Wide Web Conference, pp. 76–85, 2005.Google Scholar
  55. [693]
  56. [699]
  57. [703]
  58. [710]
  59. [712]

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.IBM T.J. Watson Research CenterYorktown HeightsUSA

Personalised recommendations