Identifying Characteristic Genes Based on Robust Principal Component Analysis
In this paper, based on robust PCA, a novel method of characteristic genes identification is proposed. In our method, the differentially expressed genes and non-differentially expressed genes are treated as perturbation signals S 0 and low-rank matrix A 0, respectively, which can be recovered from the gene expression data using robust PCA. The scheme to identify the characteristic genes is as following. Firstly, the matrix S 0 of perturbation signals is discovered from gene expression data matrix D by using robust PCA. Secondly, the characteristic genes are selected according to matrix S 0. Finally, the characteristic genes are checked by the tool of Gene Ontology. The experimental results show that our method is efficient and effective.
Keywordsrobust PCA gene identification gene expression data
Unable to display preview. Download preview PDF.
- 5.Liu, J.X., Zheng, C.H., Xu, Y.: Extracting Plants Core Genes Responding to Abiotic Stresses by Penalized Matrix Decomposition. Comput. Biol. Med. (2012), doi:10.1016 /j.compbiomed.2012.1002.1002Google Scholar
- 7.Lin, Z., Chen, M., Wu, L., Ma, Y.: The Augmented Lagrange Multiplier Method for Exact Recovery of Corrupted Low-rank Matrices (2010), http://Arxiv.org/abs/1009.5055v2
- 8.Journée, M., Nesterov, Y., Richtarik, P., Sepulchre, R.: Generalized Power Method for Sparse Principal Component Analysis. The Journal of Machine Learning Research 11, 517–553 (2010)Google Scholar
- 9.Boyle, E.I., Weng, S.A., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO:TermFinder - Open Source Software for Accessing Gene Ontology Information and Finding Significantly Enriched Gene Ontology Terms Associated with a list of Genes. Bioinformatics 20, 3710–3715 (2004)CrossRefGoogle Scholar