Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data

  • Mi-Xiao Hou
  • Jin-Xing LiuEmail author
  • Junliang ShangEmail author
  • Ying-Lian Gao
  • Xiang-Zhen Kong
  • Ling-Yun Dai
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10955)


Non-negative Matrix Factorization (NMF) is recognized as one of fundamentally important and highly popular methods for clustering and feature selection, and many related methods have been proposed so far. Nevertheless, their performances, especially on real data, are still unclear due to few studies focusing on their comparison. This study aims at a assessment study of several representative methods from clustering and feature selection, including NMF, GNMF, MD-NMF, L2,1NMF, LNMF, Convex-NMF and Semi-NMF, on the data of the Cancer Genome Atlas (TCGA), which is one of current research hotspot of bioinformatics. Specifically, three data types of four cancers are either separately or integratedly decomposed as the coefficient matrices and the basis matrices by these NMF methods. The coefficient matrices are evaluated by accuracies of clustered samples and the basis matrices are assessed by p-values of selected genes. Experiment results not only show merits and limitations of compared NMF methods, which may provide guidelines for applying them and proposing novel NMF methods, but also reveal several clues for the exploration of related cancers.


Non-negative Matrix Factorization Clustering Genomic data Dimensionality reduction 



This work was supported in part by the NSFC under grant Nos. 61572284, 61502272 and 61702299.


  1. 1.
    Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)CrossRefGoogle Scholar
  2. 2.
    Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing systems, pp. 556–562 (2001)Google Scholar
  3. 3.
    Wang, D., Liu, J.X., Gao, Y.L. et al.: Characteristic gene selection based on robust graph regularized non-negative matrix factorization. IEEE/ACM Trans. Comput. Biol. Bioinform. 13(6), 1059–1067 (2016)CrossRefGoogle Scholar
  4. 4.
    Wang, S., Tang, J., Liu, H.: Embedded unsupervised feature selection (2015)Google Scholar
  5. 5.
    Sumanta, R., Sanghamitra, B.: A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs. BMC Bioinf. 17(1), 1–13 (2016)Google Scholar
  6. 6.
    Zhang, W., Liu, X., Chen, Y., Wu, W., Wang, W., Li, X.: Feature-derived graph regularized matrix factorization for predicting drug side effects. Neurocomputing 287, 154–162 (2018)CrossRefGoogle Scholar
  7. 7.
    Yang, Z., Michailidis, G.: A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data. Bioinformatics 32(1), 325–342 (2015)Google Scholar
  8. 8.
    Gao, J., Aksoy, B.A., Dogrusoz, U., Dresdner, G., Gross, B., Sumer, S.O., Sun, Y., Jacobsen, A., Sinha, R., Larsson, E.: Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Sign. 6(269), 2383 (2013)Google Scholar
  9. 9.
    Zhu, Y., Qiu, P., Ji, Y.: TCGA-assembler: open-source software for retrieving and processing TCGA data. Nat. Meth. 11(6), 599–600 (2014)CrossRefGoogle Scholar
  10. 10.
    Hou, M.-X., Gao, Y.-L., Liu, J.-X., Shang, J.-L., Zheng, C.-H.: Comparison of Non-negative Matrix Factorization Methods for Clustering Genomic Data. In: International Conference on Intelligent Computing, pp. 290–299 (2016)CrossRefGoogle Scholar
  11. 11.
    Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1548–1560 (2011)CrossRefGoogle Scholar
  12. 12.
    Guan, N., Tao, D., Luo, Z., Yuan, B.: Manifold regularized discriminative nonnegative matrix factorization with fast gradient descent. IEEE Trans. Image Process. 20(7), 2030–2048 (2011)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Kong, D., Ding, C., Huang, H.: Robust nonnegative matrix factorization using l21-norm. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 673–682 (2011)Google Scholar
  14. 14.
    Li, S.Z., Hou, X.W., Zhang, H., Cheng, Q.: Learning spatially localized, parts-based representation. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, I-207-I-212 (2001). vol. 201Google Scholar
  15. 15.
    Ding, C., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Softw. Eng. 32(1), 45–55 (2010)Google Scholar
  16. 16.
    Long, X., Lu, H., Peng, Y., Li, W.: Graph regularized discriminative non-negative matrix factorization for face recognition. Multimed. Tools Appl. 72(3), 2679–2699 (2014)CrossRefGoogle Scholar
  17. 17.
    Chung, F.R.: Spectral Graph Theory. Volume 92 of CBMS Regional Conference Series in Mathematics. American Mathematical Society, Providence (1997)zbMATHGoogle Scholar
  18. 18.
    Le, L., Yu-Jin, Z.: A survey on algorithms of non-negative matrix factorization. J. Acta Electronica Sinica. 36(4), 737–743 (2008)Google Scholar
  19. 19.
    Chen, X., Gu, L., Li, S.Z., Zhang, H.-J.: Learning representative local features for face detection. In: 2001 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, I-1126-I-1131 (2001). vol. 1121Google Scholar
  20. 20.
    Pascual-Montano, A., Carazo, J.M., Kochi, K., Lehmann, D., Pascual-Marqui, R.D.: Nonsmooth nonnegative matrix factorization (nsNMF). IEEE Trans. Pattern Anal. Mach. Intell. 28(3), 403–415 (2006)CrossRefGoogle Scholar
  21. 21.
    Li, Y., Alioune, N.: The non-negative matrix factorization toolbox for biological data mining. Source Code Biol. Med. 8(1), 10 (2013)CrossRefGoogle Scholar
  22. 22.
    Wang, Q., Liu, X.D.: Genes and Cholangiocarcinoma Genesis and Development. Medical Recapitulate (2012)Google Scholar
  23. 23.
    Zou, S., Li, J., Zhou, H., Frech, C., Jiang, X., Chu, J.S., Zhao, X., Li, Y., Li, Q., Wang, H.: Mutational landscape of intrahepatic cholangiocarcinoma. Nat. Commun. 5, 5696 (2014)CrossRefGoogle Scholar
  24. 24.
    Biankin, A.V., Waddell, N., Kassahn, K.S., Gingras, M.C., Muthuswamy, L.B., Johns, A.L., Miller, D.K., Wilson, P.J., Patch, A.M., Wu, J.: Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes. Nature 491(7424), 399–405 (2012)CrossRefGoogle Scholar
  25. 25.
    Zhang, S., Liu, C.C., Li, W., Shen, H., Laird, P.W., Zhou, X.J.: Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Res. 40(19), 9379–9391 (2012)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.School of Information Science and EngineeringQufu Normal UniversityRizhaoChina
  2. 2.Library of Qufu Normal UniversityQufu Normal UniversityRizhaoChina

Personalised recommendations