Additive Regression Applied to a Large-Scale Collaborative Filtering Problem

Frank, Eibe; Hall, Mark

doi:10.1007/978-3-540-89378-3_44

Additive Regression Applied to a Large-Scale Collaborative Filtering Problem

Eibe Frank³ &
Mark Hall⁴

Conference paper

1803 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5360))

Abstract

The much-publicized Netflix competition has put the spotlight on the application domain of collaborative filtering and has sparked interest in machine learning algorithms that can be applied to this sort of problem. The demanding nature of the Netflix data has lead to some interesting and ingenious modifications to standard learning methods in the name of efficiency and speed. There are three basic methods that have been applied in most approaches to the Netflix problem so far: stand-alone neighborhood-based methods, latent factor models based on singular-value decomposition, and ensembles consisting of variations of these techniques. In this paper we investigate the application of forward stage-wise additive modeling to the Netflix problem, using two regression schemes as base learners: ensembles of weighted simple linear regressors and k-means clustering—the latter being interpreted as a tool for multi-variate regression in this context. Experimental results show that our methods produce competitive results.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bell, R., Koren, Y., Volinsky, C.: Chasing $1,000,000: How we won the Netflix progress prize. ASA Statistical and Computing Graphics Newsletter 18(2), 4–12 (2007)
Google Scholar
Bell, R.M., Koren, Y.: Improved neighborhood-based collaborative filtering. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar
Dembczyński, K., Kotłowski, W., Słowiński, R.: Ordinal classification with decision rules. In: Proc. 3rd International Workshop on Mining Complex Data, pp. 169–181. Springer, Heidelberg (2008)
Chapter Google Scholar
Freund, Y., Shapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1), 119–139 (1997)
Article MathSciNet Google Scholar
Friedman, J.: Greedy function approximation: A gradient boosting machine. Annals of Statistics 29(5), 1189–1232 (2001)
Article MathSciNet MATH Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting (with discussion and rejoinder by the authors). Annals of Statistics 28(2), 337–407 (2000)
Article MathSciNet MATH Google Scholar
Kurucz, M., Benczúr, A.A., Csalogány, K.: Methods for large scale SVD with missing values. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar
Lim, Y.J., Teh, Y.W.: Variational Bayesian approach to movie rating prediction. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar
Paterek, A.: Improving regularized singular value decomposition for collaborative filtering. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar
Takács, G., Pilászy, I., Németh, B., Tikk, D.: On the Gravity recommendation system. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar
Wu, M.: Collaborative filtering via ensembles of matrix factorizations. In: KDD Cup and Workshop at the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Waikato, Hamilton, New Zealand
Eibe Frank
Pentaho Corporation, 5950 Hazeltine National Drive, Suite 340, Orlando, FL, USA
Mark Hall

Authors

Eibe Frank
View author publications
You can also search for this author in PubMed Google Scholar
Mark Hall
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Wales, School of Computer Science and Engineering,, University of New South, NSW 2052, Sydney, Australia
Wayne Wobcke
School of Mathematics, Statistics and Computer Science, Victoria University of Wellington, P.O. Box 600, 6140, Wellington, New Zealand
Mengjie Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Frank, E., Hall, M. (2008). Additive Regression Applied to a Large-Scale Collaborative Filtering Problem. In: Wobcke, W., Zhang, M. (eds) AI 2008: Advances in Artificial Intelligence. AI 2008. Lecture Notes in Computer Science(), vol 5360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89378-3_44

Download citation

DOI: https://doi.org/10.1007/978-3-540-89378-3_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89377-6
Online ISBN: 978-3-540-89378-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics