Abstract
In this paper, we develop a new feature extraction method based on sparse singular value decomposition (SSVD). We apply SSVD algorithm to select the characteristic genes from Colorectal Cancer (CRC) genomic dataset, and then the differentially expressed genes obtained are evaluated by the tools based on Gene Ontology. As a gene extraction method, SSVD is also compared with some existing feature extraction methods such as independent component analysis (ICA), the p-norm robust feature extraction (PREE) and sparse principal component analysis (SPCA). The experimental results show that SSVD method outperforms the existing algorithms.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Lee, D., Lee, W., Lee, Y., Pawitan, Y.: Super-sparse principal component analyses for high-throughput genomic data. BMC Bioinf. 11(1), 296 (2010)
Journée, M., Nesterov, Y., Richtárik, P., Sepulchre, R.: Generalized power method for sparse principal component analysis. J. Mach. Learn. Res. 11, 517–553 (2010)
Liu, J.X., Wang, Y.T., Zheng, C.H.: Robust PCA based method for discovering differentially expressed genes. BMC Bioinf. 14(Suppl 8), S3 (2013)
Liu, J.X., Zheng, C.H., Xu, Y.: Extracting plants core genes responding to abiotic stresses by penalized matrix decomposition. Comput. Biol. Med. 42(5), 582–589 (2012)
Huang, D.S., Zheng, C.H.: Independent component analysis-based penalized discriminant method for tumor classification using gene expression data. Bioinformatics 22(15), 1855–1862 (2006)
Liu, J., Liu, J.X., Gao, Y.L., Kong, X.Z., Wang, D.: A p-norm robust feature extraction method for identifying differentially expressed genes. PLoSONE 10(7), e0133124 (2015)
Lee, M., Shen, H.P., Huang, J.Z., Marron, J.S.: Biclustering via sparse value decomposition. Biometrics 66, 1087–1095 (2010)
Eckart, C., Young, G.: The approximation of one matrix by another of lower rank. Psychometrika 1, 211–218 (1936)
Zou, H.: The adaptive lasso and its oracle properties. J. Am. Stat. Assoc. 101(475), 1418–1429 (2006)
Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978)
Zou, H., Hastie, T., Tibshirani, R.: On the “degrees of freedom” of the lasso. Ann. Stat. 35, 2173–2192 (2007)
Kilian, J., Whitehead, D., Horak, J., Wanke, D., Weinl, S., Batistic, O.: The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J. 50(2), 347–363 (2007)
Zheng, C.H., Huang, D.S., Zhang, L., Kong, X.Z.: Tumor clustering using nonnegative matrix factorization with gene selection. IEEE Trans. Inf. Technol. Biomed. 13(4), 599–607 (2009)
Sartor, M.A., Mahavisno, V., Keshamouni, V.G., Cavalcoli, J., Wright, Z., Karnovsky, A., Kuick, R., Jagadish, H., Mirel, B., Weymouth, T.: ConceptGen: a gene set enrichment and gene set relation mapping tool. Bioinformatics 26(4), 456–463 (2010)
Boyle, E.I., Weng, S.A., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO: termfinder-open source software for accessing gene ontology information and finding significantly enriched gene ontology terms associated with a list of genes. Bioinformatics 20(18), 3710–3715 (2004)
Chen, J., Bardes, E.E., Aronow, B.J., Jegga, A.G.: ToppGene suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res. 37(suppl 2), W305–W311 (2009)
Wang, E.T., Sandberg, R., Luo, S., Khrebtukova, I., Zhang, L., Mayr, C., Kingsmore, S.F., Schroth, G.P., Burge, C.B.: Alternative isoform regulation in human tissue transcriptomes. Nature 456(7221), 470–476 (2008)
Acknowledgement
This work was supported in part by the grants of the National Science Foundation of China, Nos. 61572284, 61502272, 61572283; Shenzhen Municipal Science and Technology Innovation Council, No. JCYJ20140417172417174; Natural Science Foundation of Shandong Province, No. BS2014DX004.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Kong, X., Liu, J., Zheng, C., Shang, J. (2016). Gene Extraction Based on Sparse Singular Value Decomposition. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2016. Lecture Notes in Computer Science(), vol 9771. Springer, Cham. https://doi.org/10.1007/978-3-319-42291-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-42291-6_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42290-9
Online ISBN: 978-3-319-42291-6
eBook Packages: Computer ScienceComputer Science (R0)