Abstract
Non-negative Matrix Factorization (NMF) is recognized as one of fundamentally important and highly popular methods for clustering and feature selection, and many related methods have been proposed so far. Nevertheless, their performances, especially on real data, are still unclear due to few studies focusing on their comparison. This study aims at a assessment study of several representative methods from clustering and feature selection, including NMF, GNMF, MD-NMF, L2,1NMF, LNMF, Convex-NMF and Semi-NMF, on the data of the Cancer Genome Atlas (TCGA), which is one of current research hotspot of bioinformatics. Specifically, three data types of four cancers are either separately or integratedly decomposed as the coefficient matrices and the basis matrices by these NMF methods. The coefficient matrices are evaluated by accuracies of clustered samples and the basis matrices are assessed by p-values of selected genes. Experiment results not only show merits and limitations of compared NMF methods, which may provide guidelines for applying them and proposing novel NMF methods, but also reveal several clues for the exploration of related cancers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing systems, pp. 556–562 (2001)
Wang, D., Liu, J.X., Gao, Y.L. et al.: Characteristic gene selection based on robust graph regularized non-negative matrix factorization. IEEE/ACM Trans. Comput. Biol. Bioinform. 13(6), 1059–1067 (2016)
Wang, S., Tang, J., Liu, H.: Embedded unsupervised feature selection (2015)
Sumanta, R., Sanghamitra, B.: A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs. BMC Bioinf. 17(1), 1–13 (2016)
Zhang, W., Liu, X., Chen, Y., Wu, W., Wang, W., Li, X.: Feature-derived graph regularized matrix factorization for predicting drug side effects. Neurocomputing 287, 154–162 (2018)
Yang, Z., Michailidis, G.: A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data. Bioinformatics 32(1), 325–342 (2015)
Gao, J., Aksoy, B.A., Dogrusoz, U., Dresdner, G., Gross, B., Sumer, S.O., Sun, Y., Jacobsen, A., Sinha, R., Larsson, E.: Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Sign. 6(269), 2383 (2013)
Zhu, Y., Qiu, P., Ji, Y.: TCGA-assembler: open-source software for retrieving and processing TCGA data. Nat. Meth. 11(6), 599–600 (2014)
Hou, M.-X., Gao, Y.-L., Liu, J.-X., Shang, J.-L., Zheng, C.-H.: Comparison of Non-negative Matrix Factorization Methods for Clustering Genomic Data. In: International Conference on Intelligent Computing, pp. 290–299 (2016)
Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1548–1560 (2011)
Guan, N., Tao, D., Luo, Z., Yuan, B.: Manifold regularized discriminative nonnegative matrix factorization with fast gradient descent. IEEE Trans. Image Process. 20(7), 2030–2048 (2011)
Kong, D., Ding, C., Huang, H.: Robust nonnegative matrix factorization using l21-norm. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 673–682 (2011)
Li, S.Z., Hou, X.W., Zhang, H., Cheng, Q.: Learning spatially localized, parts-based representation. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, I-207-I-212 (2001). vol. 201
Ding, C., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Softw. Eng. 32(1), 45–55 (2010)
Long, X., Lu, H., Peng, Y., Li, W.: Graph regularized discriminative non-negative matrix factorization for face recognition. Multimed. Tools Appl. 72(3), 2679–2699 (2014)
Chung, F.R.: Spectral Graph Theory. Volume 92 of CBMS Regional Conference Series in Mathematics. American Mathematical Society, Providence (1997)
Le, L., Yu-Jin, Z.: A survey on algorithms of non-negative matrix factorization. J. Acta Electronica Sinica. 36(4), 737–743 (2008)
Chen, X., Gu, L., Li, S.Z., Zhang, H.-J.: Learning representative local features for face detection. In: 2001 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, I-1126-I-1131 (2001). vol. 1121
Pascual-Montano, A., Carazo, J.M., Kochi, K., Lehmann, D., Pascual-Marqui, R.D.: Nonsmooth nonnegative matrix factorization (nsNMF). IEEE Trans. Pattern Anal. Mach. Intell. 28(3), 403–415 (2006)
Li, Y., Alioune, N.: The non-negative matrix factorization toolbox for biological data mining. Source Code Biol. Med. 8(1), 10 (2013)
Wang, Q., Liu, X.D.: Genes and Cholangiocarcinoma Genesis and Development. Medical Recapitulate (2012)
Zou, S., Li, J., Zhou, H., Frech, C., Jiang, X., Chu, J.S., Zhao, X., Li, Y., Li, Q., Wang, H.: Mutational landscape of intrahepatic cholangiocarcinoma. Nat. Commun. 5, 5696 (2014)
Biankin, A.V., Waddell, N., Kassahn, K.S., Gingras, M.C., Muthuswamy, L.B., Johns, A.L., Miller, D.K., Wilson, P.J., Patch, A.M., Wu, J.: Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes. Nature 491(7424), 399–405 (2012)
Zhang, S., Liu, C.C., Li, W., Shen, H., Laird, P.W., Zhou, X.J.: Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Res. 40(19), 9379–9391 (2012)
Acknowledgement
This work was supported in part by the NSFC under grant Nos. 61572284, 61502272 and 61702299.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Hou, MX., Liu, JX., Shang, J., Gao, YL., Kong, XZ., Dai, LY. (2018). Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data. In: Huang, DS., Jo, KH., Zhang, XL. (eds) Intelligent Computing Theories and Application. ICIC 2018. Lecture Notes in Computer Science(), vol 10955. Springer, Cham. https://doi.org/10.1007/978-3-319-95933-7_50
Download citation
DOI: https://doi.org/10.1007/978-3-319-95933-7_50
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95932-0
Online ISBN: 978-3-319-95933-7
eBook Packages: Computer ScienceComputer Science (R0)