Gene co-expression network analysis identifies trait-related modules in Arabidopsis thaliana
A comprehensive network of the Arabidopsis transcriptome was analyzed and may serve as a valuable resource for candidate gene function investigations. A web tool to explore module information was also provided.
Arabidopsis thaliana is a widely studied model plant whose transcriptome has been substantially profiled in various tissues, development stages and other conditions. These data can be reused for research on gene function through a systematic analysis of gene co-expression relationships. We collected microarray data from National Center for Biotechnology Information Gene Expression Omnibus, identified modules of co-expressed genes and annotated module functions. These modules were associated with experiments/traits, which provided potential signature modules for phenotypes. Novel heat shock proteins were implicated according to guilt by association. A higher-order module networks analysis suggested that the Arabidopsis network can be further organized into 15 meta-modules and that a chloroplast meta-module has a distinct gene expression pattern from the other 14 meta-modules. A comparison with the rice transcriptome revealed preserved modules and KEGG pathways. All the module gene information was available from an online tool at http://bioinformatics.fafu.edu.cn/arabi/. Our findings provide a new source for future gene discovery in Arabidopsis.
KeywordsRice Conservation Hub gene Transcriptome
Gene co-expression network
Weighted gene co-expression network analysis
National Centre for Biotechnology Information
Gene Expression Omnibus
Relative standard deviation
Kyoto Encyclopedia of Genes and Genomes
There are so many insightful literatures about gene co-expression analysis. The authors apologize that not all related studies were cited due to lack of space.
This work was supported in part by the National Natural Science Foundation of China (Grant numbers 31270454 and 81502091) and Open Project of Key laboratory of Loquat Germplasm Innovation and Utilization, Putian University, Fujian Province (Grant number 2017003).
Compliance with ethical standards
Conflicts of interest
The authors have no conflicts of interest to declare.
- Boruc J, Van den Daele H, Hollunder J, Rombauts S, Mylle E, Hilson P, Inze D, De Veylder L, Russinova E (2010) Functional modules in the Arabidopsis core cell cycle binary protein–protein interaction network. Plant Cell 22(4):1264–1280. https://doi.org/10.1105/tpc.109.073635 CrossRefPubMedPubMedCentralGoogle Scholar
- Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J (2015) Shiny: web application framework for R. R package version 011 1(4):106Google Scholar
- Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Obenchain V, Oles AK, Pages H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, Morgan M (2015) Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods 12(2):115–121. https://doi.org/10.1038/nmeth.3252 CrossRefPubMedPubMedCentralGoogle Scholar
- Khan D, Millar JL, Girard IJ, Chan A, Kirkbride RC, Pelletier JM, Kost S, Becker MG, Yeung EC, Stasolla C, Goldberg RB, Harada JJ, Belmonte MF (2015) Transcriptome atlas of the Arabidopsis funiculus—a study of maternal seed subregions. Plant J 82(1):41–53. https://doi.org/10.1111/tpj.12790 CrossRefPubMedGoogle Scholar
- Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, Reich M, Hieronymus H, Wei G, Armstrong SA, Haggarty SJ, Clemons PA, Wei R, Carr SA, Lander ES, Golub TR (2006) The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313(5795):1929–1935. https://doi.org/10.1126/science.1132939 CrossRefPubMedGoogle Scholar
- Lee T, Yang S, Kim E, Ko Y, Hwang S, Shin J, Shim JE, Shim H, Kim H, Kim C, Lee I (2015) AraNet v2: an improved database of co-functional gene networks for the study of Arabidopsis thaliana and 27 other nonmodel plant species. Nucleic Acids Res 43:996–1002. https://doi.org/10.1093/nar/gku1053 (database issue) CrossRefGoogle Scholar
- Mutwil M, Klie S, Tohge T, Giorgi FM, Wilkins O, Campbell MM, Fernie AR, Usadel B, Nikoloski Z, Persson S (2011) PlaNet: combined sequence and expression comparisons across plant networks derived from seven species. Plant Cell 23(3):895–910. https://doi.org/10.1105/tpc.111.083667 CrossRefPubMedPubMedCentralGoogle Scholar
- Rajjou L, Belghazi M, Huguet R, Robin C, Moreau A, Job C, Job D (2006) Proteomic investigation of the effect of salicylic acid on Arabidopsis seed germination and establishment of early defense mechanisms. Plant Physiol 141(3):910–923. https://doi.org/10.1104/pp.106.082057 CrossRefPubMedPubMedCentralGoogle Scholar
- R Development Core Team (2013) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/. Accessed 1 May 2018
- Ruprecht C, Proost S, Hernandez-Coronado M, Ortiz-Ramirez C, Lang D, Rensing SA, Becker JD, Vandepoele K, Mutwil M (2017) Phylogenomic analysis of gene co-expression networks reveals the evolution of functional modules. Plant J 90(3):447–465. https://doi.org/10.1111/tpj.13502 CrossRefPubMedGoogle Scholar
- Usadel B, Obayashi T, Mutwil M, Giorgi FM, Bassel GW, Tanimoto M, Chow A, Steinhauser D, Persson S, Provart NJ (2009) Co-expression tools for plant biology: opportunities for hypothesis generation and caveats. Plant Cell Environ 32(12):1633–1651. https://doi.org/10.1111/j.1365-3040.2009.02040.x CrossRefPubMedGoogle Scholar
- van Veen H, Vashisht D, Akman M, Girke T, Mustroph A, Reinen E, Hartman S, Kooiker M, van Tienderen P, Schranz ME, Bailey-Serres J, Voesenek LA, Sasidharan R (2016) Transcriptomes of eight Arabidopsis thaliana accessions reveal core conserved, genotype- and organ-specific responses to flooding stress. Plant Physiol 172(2):668–689. https://doi.org/10.1104/pp.16.00472 CrossRefPubMedPubMedCentralGoogle Scholar
- Vlot AC, Liu PP, Cameron RK, Park SW, Yang Y, Kumar D, Zhou F, Padukkavidana T, Gustafsson C, Pichersky E, Klessig DF (2008) Identification of likely orthologs of tobacco salicylic acid-binding protein 2 and their role in systemic acquired resistance in Arabidopsis thaliana. Plant J 56(3):445–456. https://doi.org/10.1111/j.1365-313X.2008.03618.x CrossRefPubMedGoogle Scholar
- Yang Y, Xu R, Ma CJ, Vlot AC, Klessig DF, Pichersky E (2008) Inactive methyl indole-3-acetic acid ester can be hydrolyzed and activated by several esterases belonging to the AtMES esterase family of Arabidopsis. Plant Physiol 147(3):1034–1045. https://doi.org/10.1104/pp.108.118224 CrossRefPubMedPubMedCentralGoogle Scholar
- Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4:Article17. https://doi.org/10.2202/1544-6115.1128