Abstract
Collaborative Filtering (CF) is a popular technique employed by Recommender Systems, a term used to describe intelligent methods that generate personalized recommendations. The most common and accurate approaches to CF are based on latent factor models. Latent factor models can tackle two fundamental problems of CF, data sparsity and scalability and have received considerable attention in recent literature. In this work, we present an optimal scaling approach to address both of these problems using Categorical Principal Component Analysis for the low-rank approximation of the user-item ratings matrix, followed by a neighborhood formation step. The optimal scaling approach has the advantage that it can be easily extended to the case when there are missing data and restrictions for ordinal and numerical variables can be easily imposed. We considered different measurement levels for the user ratings on items, starting with a multiple nominal and consecutively applying nominal, ordinal and numeric levels. Experiments were executed on the MovieLens dataset, aiming to evaluate the aforementioned options in terms of accuracy. Results indicated that a combined approach (multiple nominal measurement level, ‘‘passive’’ missing data strategy) clearly outperformed the other tested options.
Chapter PDF
Similar content being viewed by others
Keywords
References
Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.T.: Application of dimensionality reduction in recommender systems - a case study. In: ACM WebKDD 2000 Web Mining for E-Commerce Workshop, pp. 82–90 (2000)
Goldberg, K., Roeder, T., Gupta, D., Perkins, C.: Eigentaste: A constant time collaborative filtering algorithm. Information Retrieval Journal 4, 133–151 (2001)
Hofmann, T.: Latent semantic models for collaborative filtering. ACM Transactions on Information Systems 22(1), 89–115 (2004)
Salakhutdinov, S., Mnih, A.: Probabilistic matrix factorization. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems, vol. 20, pp. 1257–1264. MIT Press, Cambridge (2008)
Bell, M., Koren, Y.: Scalable collaborative filtering with jointly derived neighborhood interpolation weights. In: Proceedings of 2007 Seventh IEEE International Conference on Data Mining (ICDM), pp. 43–52 (2007)
Tacacs, G., Pilaszy, I., Nemeth, B., Tikk, D.: Scalable collaborative filtering approaches for large recommender systems. The Journal of Machine Learning Research 10, 623–656 (2009)
Kim, D., Yum, B.J.: Collaborative filtering based on iterative principal component analysis. Expert Systems with Applications 28(4), 823–830 (2005)
Paterek, A.: Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of 13th ACM International Conference on Knowledge Discovery and Data Mining (KDD 2007), San Jose, CA, USA, pp. 39–42 (2007)
de Leeuw, J.: Nonlinear principal component analysis and related techniques. In: Greenacre, M., Blasius, J. (eds.) Multiple Correspondence Analysis and Related Techniques, pp. 107–133. Chapman & Hall, Boca Raton (2006)
Costantini, P., Linting, M., Porzio, G.: Mining performance data through nonlinear pca with optimal scaling. Applied Stochastical Models in Business and Industry 26, 85–101 (2010)
Meulman, J., van der Kooij, A., Heiser, W.: Principal components analysis with nonlinear optimal scaling transformations for ordinal and nominal data. In: Kaplan, D. (ed.) Handbook of Quantitative Methods in the Social Sciences, pp. 49–70. Sage Publications, Newbury Park (2004)
Michailidis, G., de Leeuw, J.: The gifi system of descriptive multivariate analysis. Statistical Science 13(4), 307–336 (1998)
de Leeuw, J., Patrick, M.: Gifi methods for optimal scaling in r: The package homals. Journal of Statistical Software 31(4), 1–21 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 IFIP
About this paper
Cite this paper
Markos, A.I., Vozalis, M.G., Margaritis, K.G. (2010). An Optimal Scaling Approach to Collaborative Filtering Using Categorical Principal Component Analysis and Neighborhood Formation. In: Papadopoulos, H., Andreou, A.S., Bramer, M. (eds) Artificial Intelligence Applications and Innovations. AIAI 2010. IFIP Advances in Information and Communication Technology, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16239-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-16239-8_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16238-1
Online ISBN: 978-3-642-16239-8
eBook Packages: Computer ScienceComputer Science (R0)