An Evolutionary Matrix Factorization Approach for Missing Value Prediction

  • Sujoy Chatterjee
  • Anirban Mukhopadhyay
Part of the Studies in Computational Intelligence book series (SCI, volume 687)


Sparseness of data is a common problem in many fields such as data mining and pattern recognition. During the last decade, collecting opinions from people has been established to be an useful tool for solving different real-life problems. In crowdsourcing systems, prediction based on very few observations leads to complete disregard for the inherent latent features of the crowd workers corresponding to the items. Similarly in bioinformatics, sparsity has a major negative impact in finding relevant gene from gene expression data. Although this problem is being studied over the last decade, there are some benefits and pitfalls of the different proposed approaches. In this article, we have proposed a genetic algorithm-based matrix factorization technique to estimate the missing entries in the rating matrix of recommender systems. We have created four synthetic datasets and used two real-life gene expression datasets to show the efficacy of the proposed method in comparison with the other state-of-the-art approaches.


Matrix factorization Sparsity Genetic algorithm Judgment analysis 


  1. 1.
    Adomavicius, G., Tuzhilin, A.: Towards the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 6, 734–749 (2005)CrossRefGoogle Scholar
  2. 2.
    Berry, M., Browne, M., Langville, A., Pauca, V., Plemmons, R.: Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 52(2), 155–173 (2007)MathSciNetCrossRefzbMATHGoogle Scholar
  3. 3.
    Brabham, D.C.: Detecting stable clusters using principal component analysis. Methods Mol. Biol. 224(10) (2013)Google Scholar
  4. 4.
    Brunet, J., Tamayo, P., Golub, T., Mesirov, J.: Metagenes and molecular pattern discovery using matrix factorization. Proc. Nat. Acad. Sci. 101, 1464–1469 (2004)CrossRefGoogle Scholar
  5. 5.
    Chatterjee, S., Mukhopadhyay, A.: A genetic algorithm-based matrix factorization in missing value prediction. In: Proceedings of the 1st International Conference on Computational Intelligence, Communications, and Business Analytics. Springer, CCIS, Kolkata, India (2017)Google Scholar
  6. 6.
    Christidis, K., Mentzas, G.: A topic-based recommender system for electronic marketplace platforms. Expert Syst. Appl. 40(11), 4370–4379 (2013)CrossRefGoogle Scholar
  7. 7.
    Cichocki, A., Zdunek, R., Amari, S.I.: Hierarchical als algorithms for nonnegative matrix and 3d tensor factorization. In: Lecture Notes in Computer Science, vol. 4666, pp. 169–176. Springer (2007)Google Scholar
  8. 8.
    Demartini, G., Difallah, D.E., Mauroax, C.: Zencrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large scale entity linking. In: Proceedings of the 21st International Conference on World Wide Web, pp. 469–478. Lyon, France (2012)Google Scholar
  9. 9.
    Ding, C.H.Q., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Pattern Anal. Mach. Intell. 32(1), 45–55 (2010)CrossRefGoogle Scholar
  10. 10.
    Friedman, A., Berkovsky, S., Kaafar, M.A.: A differential privacy framework for matrix factorization recommender systems. User Model. User-Adap. Interact. 26(5), 425–458 (2016)CrossRefGoogle Scholar
  11. 11.
    Kim, J., He, Y., Park, H.: Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework. J. Global Opt. 58(2), 285–319 (2014)MathSciNetCrossRefzbMATHGoogle Scholar
  12. 12.
    Kim, J., Park, H.: Fast nonnegative matrix factorization: an active-set-like method and comparisons. SIAM J. Sci. Comput. (SISC) 33(6), 3261–3281 (2011)MathSciNetCrossRefzbMATHGoogle Scholar
  13. 13.
    Kittur, A., Nickerson, J.V., Bernstein, M., Gerber, E., Shaw, A., Zimmerman, J., Lease, M., Horton, J.: The future of crowd work. In: Proceedings of the CSCW, pp. 1301–1318 (2013)Google Scholar
  14. 14.
    Lee, D., Seung, H.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)CrossRefzbMATHGoogle Scholar
  15. 15.
    Lee, D., Seung, H.: Algorithms for non-negative matrix factorization. Proc. Adv. Neural Inf. Process. Syst. 13, 556–562 (2001)Google Scholar
  16. 16.
    Lin, C.J.: Projected gradient methods for nonnegative matrix factorization. Neural Comput. 19(10), 2756–2779 (2007)MathSciNetCrossRefzbMATHGoogle Scholar
  17. 17.
    Liu, J., Liu, J., Wonka, P., Ye, J.: Sparse non-negative tensor factorization using columnwise coordinate descent. Pattern Recognit. 45(1), 649–656 (2012)CrossRefzbMATHGoogle Scholar
  18. 18.
    Luo, X., Liu, H., Gou, G., Xia, Y., Zhu, Q.: A parallel matrix factorization based recommender by alternating stochastic gradient decent. Eng. Appl. Artif. Intell. 25(7), 1403–1412 (2012)CrossRefGoogle Scholar
  19. 19.
    Maulik, U., Bandyopadhyay, S.: Genetic algorithm based clustering technique. Pattern Recognit. 32, 1455–1465 (2000)CrossRefGoogle Scholar
  20. 20.
    Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: Multiobjective genetic fuzzy clustering of categorical attributes. In: Proceedings 10th International Conference on Information Technology, pp. 74–79 (2007)Google Scholar
  21. 21.
    Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47(4), 61:1–61:46 (2015)CrossRefGoogle Scholar
  22. 22.
    Paatero, P.: Least squares formulation of robust non-negative factor analysis. Chem. Intell. Lab. Syst. 37(1), 23–35 (1997)CrossRefGoogle Scholar
  23. 23.
    Paatero, P., Tapper, U.: Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data value. Environmetrics 5(2), 111–126 (1994)CrossRefGoogle Scholar
  24. 24.
    Phan, A.H., Tichavský, P., Cichocki, A.: Fast damped gauss-newton algorithm for sparse and nonnegative tensor factorization. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 1988–1991. Prague, Czech Republic (2011)Google Scholar
  25. 25.
    Ross, J., Irani, L., Silberman, M., Zaldivar, A., Tomilson, B.: Who are the crowdworkers? shifting demographics in mechanical turk. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 2863–2872 (2010)Google Scholar
  26. 26.
    Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, vol. 101, pp. 267–273. ACM Press (2003)Google Scholar
  27. 27.
    Yoo, J., Choi, S.: Orthogonal nonnegative matrix tri-factorization for co-clustering: multiplicative updates on stiefel manifolds. Inf. Process. Manage. 46(5), 559–570 (2010)CrossRefGoogle Scholar
  28. 28.
    Zafeiriou, S.: Algorithms for Nonnegative Tensor Factorization, pp. 105–124. Springer, London (2009)CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of Computer Science & EngineeringUniversity of KalyaniNadiaIndia

Personalised recommendations