Sparse Decomposition of Gene Expression Data to Infer Transcriptional Modules Guided by Motif Information
An important topic in computational biology is to identify transcriptional modules through sequence analysis and gene expression profiling. A transcriptional module is formed by a group of genes under control of one or several transcription factors (TFs) that bind to cis-regulatory elements in the promoter regions of those genes. In this paper, we develop an integrative approach, namely motif-guided sparse decomposition (mSD), to uncover transcriptional modules by combining motif information and gene expression data. The method exploits the interplay of co-expression and co-regulation to find regulated gene patterns guided by TF binding information. Specifically, a motif-guided clustering method is first developed to estimate transcription factor binding activities (TFBAs); sparse component analysis is then followed to further identify TFs’ target genes. The experimental results show that the mSD approach can successfully help uncover condition-specific transcriptional modules that may have important implications in endocrine therapy of breast cancer.
KeywordsMotif analysis sparse component analysis transcriptional modules gene regulatory networks estrogen receptor binding
Unable to display preview. Download preview PDF.
- 1.Speed, T.: Statistical Analysis of Gene Expression Microarray Data. Chapman & Hall/CRC (2003)Google Scholar
- 2.Nguyen, D.H., D’Haeseleer, P.: Deciphering principles of transcription regulation in eukaryotic genomes. Mol. Syst. Biol. 2 (2006)Google Scholar
- 12.Arash Ali, A., Massoud, B.-Z., Christian, J.: A Fast Method for Sparse Component Analysis Based on Iterative Detection-Estimation. In: AIP Conference Proceedings, vol. 872(1), pp. 123–130 (2006)Google Scholar