Leveraging clustering to improve collaborative filtering
- 319 Downloads
Extensive work on matrix factorization (MF) techniques have been done recently as they provide accurate rating prediction models in recommendation systems. Additional extensions, such as neighbour-aware models, have been shown to improve rating prediction further. However, these models often suffer from a long computation time. In this paper, we propose a novel method that applies clustering algorithms to the latent vectors of users and items. Our method can capture the common interests between the cluster of users and the cluster of items in a latent space. A matrix factorization technique is then applied to this cluster-level rating matrix to predict the future cluster-level interests. We then aggregate the traditional user-item rating predictions with our cluster-level rating predictions to improve the rating prediction accuracy. Our method is a general “wrapper” that can be applied to all collaborative filtering methods. In our experiments, we show that our new approach, when applied to a variety of existing matrix factorization techniques, improves their rating predictions and also results in better rating predictions for cold-start users. Above all, in this paper we show that better quality and more quantity of these clusters achieve a better rating prediction accuracy.
KeywordsCollaborative filtering Recommendation system Matrix factorization
This work was made possible by the facilities of the Shared Hierarchical Academic Research Computing Network (SHARCNET: www.sharcnet.ca) and Compute/Calcul Canada. The authors would like to thank the reviewers of the 2013 ACM Recommender System conference (RecSys’13) for their valuable comments.
- Rapidminer (2016). http://www.rapidminer.com. Accessed.
- Weka 3: Data mining software in java (2016). http://www.cs.waikato.ac.nz/ml/weka/. Accessed.
- Balijepally, V., Mangalaraj G., Iyengar K. (2011) Are we wielding this hammer correctly? A reflective review of the application of cluster analysis in information systems research. Journal AIS 12 (5) [http://aisel.aisnet.org/jais/vol12/iss5/1].
- Beutel, A., Murray K., Faloutsos C., Smola A.J. 2014. Cobafi: Collaborative bayesian filtering. ACM, NY, USA. doi: 10.1145/2566486.2568040.
- Bishop, C.M. (2006) Pattern recognition and machine learning (information science and statistics). Springer-Verlag New York, Inc., NJ, USA.Google Scholar
- Connor, M., Herlocker J. (1999) Clustering items for collaborative filtering. Proceedings of the ACM SIGIR Workshop on Recommender Systems, Berkeley, CA.Google Scholar
- Desrosiers, C., Karypis G. (2011) A comprehensive survey of neighborhood-based recommendation methods. Recommender Systems Handbook. In: Ricci F., Rokach L., Shapira B., Kantor P.B. (eds), 107–144.. Springer, US. doi: 10.1007/978-0-387-85820-3_4.
- George, T., Merugu S. (2005) A scalable collaborative filtering framework based on co-clustering. Proceedings of the Fifth IEEE International Conference on Data Mining, ICDM ’05, 625–628.. IEEE Computer Society, DC, USA. doi: 10.1109/ICDM.2005.14.
- Gueye, M., Abdessalem T., Naacke H. (2011) A cluster-based matrix-factorization for online integration of new ratings. Journées de Bases de Données Avancées (BDA) , 1–18.Google Scholar
- Herlocker, J.L., Konstan J.A., Borchers A., Riedl J. (1999) An algorithmic framework for performing collaborative filtering. Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’99, 230–237.. ACM, NY, USA. doi: 10.1145/312624.312682 10.1145/312624.312682.
- Jamali, M., Huang T., Ester M. (2011) A generalized stochastic block model for recommendation in social rating networks. Proceedings of the Fifth ACM Conference on Recommender Systems, RecSys ’11, 53–60.. ACM, NY, USA. doi: 10.1145/2043932.2043946.
- Koren, Y. (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’08, 426–434.. ACM, NY, USA. doi: 10.1145/1401890.1401944.
- Koren, Y., Bell R. (2011) Advances in collaborative filtering. Recommender Systems Handbook. In: Ricci F., Rokach L., Shapira B., Kantor P.B. (eds), 145–186.. Springer, US. doi: 10.1007/978-0-387-85820-3_5.
- Mirbakhsh, N., Ling C.X. (2013) Clustering-based factorized collaborative filtering. Proceedings of the 7th ACM conference on Recommender systems, RecSys ’13, 315–318.. ACM, NY, USA. doi: 10.1145/2507157.2507233.
- Mirbakhsh, N., Ling C.X. (2015) Improving top-n recommendation for cold-start users via cross-domain information (accepted to publish) the Transactions on Knowledge Discovery from Data (TKDD).Google Scholar
- Rendle, S., Freudenthaler C., Gantner Z., Schmidt-Thieme L. (2009) Bpr: Bayesian personalized ranking from implicit feedback. Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI ’09, 452–461.. AUAI Press, Virginia, US [http://dl.acm.org/citation.cfm?id=1795114.1795167].
- Steck, H. (2010) Training and testing of recommender systems on data missing not at random. Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’10, 713–722.. ACM, NY, USA. doi: 10.1145/1835804.1835895.
- Steck, H. (2010) Training and testing of recommender systems on data missing not at random. Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’10, 713–722.. ACM, NY, USA. doi: 10.1145/1835804.1835895.
- Töscher, A., Jahrer M., Legenstein R. (2008) Improved neighborhood-based algorithms for large-scale recommender systems. Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize Competition, NETFLIX ’08, 4:1–4:6.. ACM, NY, USA. doi: 10.1145/1722149.1722153.
- Witten, I.H., Frank E. (2005) Data mining: Practical machine learning tools and techniques, second edition (morgan kaufmann series in data management systems). Morgan Kaufmann Publishers Inc., CA, USA.Google Scholar
- Xu, B., Bu J., Chen C., Cai D. (2012) An exploration of improving collaborative recommender systems via user-item subgroups. Proceedings of the 21st international conference on World Wide Web, WWW ’12, 21–30.. ACM, NY, USA. doi: 10.1145/2187836.2187840.