Matrix Factorization

Aggarwal, Charu C.

doi:10.1007/978-3-030-40344-7_8

Charu C. Aggarwal²

10k Accesses

Abstract

Just as multiplication can be generalized from scalars to matrices, the notion of factorization can also be generalized from scalars to matrices. Exact matrix factorizations need to satisfy the size and rank constraints that are imposed on matrix multiplication. For example, when an n × d matrix A is factorized into two matrices B and C (i.e., A = BC), the matrices B and C must be of sizes n × k and k × d for some constant k. For exact factorization to occur, the value of k must be equal to at least the rank of A. This is because the rank of A is at most equal to the minimum of the ranks of B and C. In practice, it is common to perform approximate factorization with much smaller values of k than the rank of A.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Hardcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
This will occur under the assumption that the top-k eigenvalues of D ^TD are distinct. Tied eigenvalues result in a non-unique solution for SVD, which might sometimes result in some differences in the subspace corresponding to the smallest eigenvalue within the rank-k solution.
2.
Strictly speaking, the objective function is not defined when p _ij is 0 or 1. However, the loss is zero when p _ij → x _ij in the limit. A logistic function will never yield values of exactly 0 or 1 for p _ij.
3.
The libraries libFM and libMF are different.

References

C. Aggarwal. Machine learning for text. Springer, 2018.
Book Google Scholar
C. Aggarwal. Recommender systems: The textbook. Springer, 2016.
Book Google Scholar
I. Bayer. Fastfm: a library for factorization machines. arXiv preprint arXiv:1505.00641, 2015. https://arxiv.org/pdf/1505.00641v2.pdf
C. Ding, T. Li, and W. Peng. On the equivalence between non-negative matrix factorization and probabilistic latent semantic indexing. Computational Statistics and Data Analysis, 52(8), pp. 3913–3927, 2008.
Article MathSciNet Google Scholar
C. Freudenthaler, L. Schmidt-Thieme, and S. Rendle. Factorization machines: Factorized polynomial regression models. GPSDAA, 2011.
Google Scholar
E. Gaussier and C. Goutte. Relation between PLSA and NMF and implications. ACM SIGIR Conference, pp. 601–602, 2005.
Google Scholar
A. Grover and J. Leskovec. node2vec: Scalable feature learning for networks. ACM KDD Conference, pp. 855–864, 2016.
Google Scholar
T. Hofmann. Probabilistic latent semantic indexing. ACM SIGIR Conference, pp. 50–57, 1999.
Google Scholar
Y. Hu, Y. Koren, and C. Volinsky. Collaborative filtering for implicit feedback datasets. IEEE ICDM, pp. 263–272, 2008.
Google Scholar
P. Jain, P. Netrapalli, and S. Sanghavi. Low-rank matrix completion using alternating minimization. ACM Symposium on Theory of Computing, pp. 665–674, 2013.
Google Scholar
C. Johnson. Logistic matrix factorization for implicit feedback data. NIPS Conference, 2014.
Google Scholar
Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 8, pp. 30–37, 2009.
Article Google Scholar
A. Langville, C. Meyer, R. Albright, J. Cox, and D. Duling. Initializations for the nonnegative matrix factorization. ACM KDD Conference, pp. 23–26, 2006.
Google Scholar
D. Lay, S. Lay, and J. McDonald. Linear Algebra and its applications, Pearson, 2012.
Google Scholar
D. Lee and H. Seung. Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems, pp. 556–562, 2001.
Google Scholar
P. McCullagh. Regression models for ordinal data. Journal of the royal statistical society. Series B (Methodological), pp. 109–142, 1980.
Google Scholar
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. arXiv:1301.3781, 2013. https://arxiv.org/abs/1301.3781
T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. NIPS Conference, pp. 3111–3119, 2013.
Google Scholar
J. Pennington, R. Socher, and C. Manning. Glove: Global Vectors for Word Representation. EMNLP, pp. 1532–1543, 2014.
Google Scholar
B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. ACM KDD Conference, pp. 701–710, 2014.
Google Scholar
S. Rendle. Factorization machines. IEEE ICDM Conference, pp. 995–100, 2010.
Google Scholar
S. Rendle. Factorization machines with libfm. ACM Transactions on Intelligent Systems and Technology, 3(3), 57, 2012.
Google Scholar
A. Singh and G. Gordon. A unified view of matrix factorization models. Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 358–373, 2008.
Google Scholar
N. Srebro, J. Rennie, and T. Jaakkola. Maximum-margin matrix factorization. Advances in neural information processing systems, pp. 1329–1336, 2004.
Google Scholar
G. Strang. An introduction to linear algebra, Fifth Edition. Wellseley-Cambridge Press, 2016.
MATH Google Scholar
G. Strang. Linear algebra and its applications, Fourth Edition. Brooks Cole, 2011.
Google Scholar
M. Udell, C. Horn, R. Zadeh, and S. Boyd. Generalized low rank models. Foundations and Trends in Machine Learning, 9(1), pp. 1–118, 2016. https://github.com/madeleineudell/LowRankModels.jl
Article Google Scholar
H. Wendland. Numerical linear algebra: An introduction. Cambridge University Press, 2018.
MATH Google Scholar
H. Yu, C. Hsieh, S. Si, and I. S. Dhillon. Scalable coordinate descent approaches to parallel matrix factorization for recommender systems. IEEE ICDM, pp. 765–774, 2012.
Google Scholar
Y. Zhou, D. Wilkinson, R. Schreiber, and R. Pan. Large-scale parallel collaborative filtering for the Netflix prize. Algorithmic Aspects in Information and Management, pp. 337–348, 2008.
Google Scholar
https://www.csie.ntu.edu.tw/~cjlin/libmf/

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
Charu C. Aggarwal (Distinguished Research Staff Member)

Authors

Charu C. Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Aggarwal, C.C. (2020). Matrix Factorization. In: Linear Algebra and Optimization for Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-030-40344-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-40344-7_8
Published: 13 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-40343-0
Online ISBN: 978-3-030-40344-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics