Skip to main content

A Study of the Correlation Structure of Microarray Gene Expression Data Based on Mechanistic Modeling of Cell Population Kinetics

  • Chapter
  • First Online:
Statistical Modeling for Biological Systems

Abstract

Sample correlations between gene pairs within expression profiles are potentially informative regarding gene regulatory pathway structure. However, as is the case with other statistical summaries, observed correlation may be induced or suppressed by factors which are unrelated to gene functionality. In this paper, we consider the effect of heterogeneity on observed correlations, both at the tissue and subject level. Using gene expression profiles from highly enriched samples of three distinct embryonic glial cell types of the rodent neural tube, the effect of tissue heterogeneity on correlations is directly estimated for a simple two component model. Then, a stochastic model of cell population kinetics is used to assess correlation effects for more complex mixtures. Finally, a mathematical model for correlation effects of subject-level heterogeneity is developed. Although decomposition of correlation into functional and nonfunctional sources will generally not be possible, since this depends on nonobservable parameters, reasonable bounds on the size of such effects can be made using the methods proposed here.

On February 27, 2008, Dr. Andrei Yakovlev tragically passed away. We deeply grieve the loss of our colleague, advisor, and friend who was a source of inspiration for all around him.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 74.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Berger, S. L. (2007). The complex language of chromatin regulation during transcription. Nature, 447, 407–412.

    Article  Google Scholar 

  2. Braun, R., Cope, L., & Parmigiani, G. (2008). Identifying differential correlation in gene/pathway combinations. BMC Bioinformatics, 9, 1.

    Article  Google Scholar 

  3. Chen, L., Almudevar, A., & Klebanov, L. (2013). Aggregation effect in microarray data analysis. In A. Y. Yakovlev, L. Klebanov, D. Gaile (Eds.), Statistical methods for microarray data analysis. Methods in Molecular Biology (Vol. 972, pp. 177–191). New York: Springer.

    Google Scholar 

  4. Davies, J., Proschel, C., Zhang, N., Mayer-Proschel, M., & Davies, S. (2008). Transplanted astrocytes derived from BMP- or CNTF-treated glial-restricted precursors have opposite effects on recovery and allodynia after spinal cord injury. Brazilian Journal of Biology, 7, 24.

    Google Scholar 

  5. Dulac, C. (2010). Brain function and chromatin plasticity. Nature, 465, 728–35.

    Article  Google Scholar 

  6. Fraser, P., & Bickmore, W. (2007). Nuclear organization of the genome and the potential for gene regulation. Nature, 447, 413–417.

    Article  Google Scholar 

  7. Friedman, N. (2004). Inferring cellular networks using probabilistic graphical models. Science, 303, 799–805.

    Article  Google Scholar 

  8. Grewal, S. I., & Moazed, D. (2003). Heterochromatin and epigenetic control of gene expression. Science, 301, 798–802.

    Article  Google Scholar 

  9. Hyrien, O., Mayer-Proschel, M., Noble, M., & Yakovlev, A. (2005). A stochastic model to analyze clonal data on multi-type cell populations. Biometrics, 61, 199–207.

    Article  MathSciNet  Google Scholar 

  10. Klebanov, L., Jordan, C., & Yakovlev, A. (2006). A new type of stochastic dependence revealed in gene expression data. Statistical Applications in Genetics and Molecular Biology, 5, Article 7.

    Google Scholar 

  11. Klebanov, L., & Yakovlev, A. (2007). Diverse correlation structures in microarray gene expression data and their utility in improving statistical inference. The Annals of Applied Statistics, 1, 538–559.

    Article  MathSciNet  Google Scholar 

  12. Li, E. (2002). Chromatin modification and epigenetic reprogramming in mammalian development. Nature Reviews Genetics, 3, 662–673.

    Article  Google Scholar 

  13. Li, G., & Reinberg, D. (2011). Chromatin higher-order structures and gene regulation. Current Opinion in Genetics & Development, 21, 175–186.

    Article  Google Scholar 

  14. Numata, K., Kanai, A., Saito, R., Kondo, S., Adachi, J., Wilming, L. G., et al. (2003). Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Research, 13, 1301–1306.

    Article  Google Scholar 

  15. Ponjavic, J., Ponting, C. P., & Lunter, G. (2007). Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Research, 17, 556–565.

    Article  Google Scholar 

  16. Qiu, X., Brooks, A., Klebanov, L., & Yakovlev, A. (2005). The effects of normalization on the correlation structure of microarray data. BMC Bioinformatics, 6, 120.

    Article  Google Scholar 

  17. Rao, M., Tanksale, A., Ghatge, M., & Deshpande, V. (1998). Molecular and biotechnological aspects of microbial proteases. Microbiology and Molecular Biology Reviews, 62, 597–635.

    Article  Google Scholar 

  18. Ravasi, T., Suzuki, H., Pang, K. C., Katayama, S., Furuno, M., Okunishi, R., et al. (2006). Experimental validation of the regulated expression of large numbers of non-coding RNA from the mouse genome. Genome Research, 16, 11–19.

    Article  Google Scholar 

  19. Saxena, A., & Carninci, P. (2011). Long non-coding RNA modifies chromatin: Epigenetic silencing by long non-coding RNAs. Bioessays, 33, 830–839.

    Article  Google Scholar 

  20. Soares, L., & Valcarcel, J. (2006). The expanding transcriptome: The genome as the “Book of Sand”. The EMBO Journal, 25, 923–931.

    Article  Google Scholar 

  21. Vogelmann, J., Valeri, A., Guillou, E., Cuvier, O., & Nollmann, M. (2011). Roles of chromatin insulator proteins in higher-order chromatin organization and transcription regulation. Nucleus, 2, 358–369.

    Article  Google Scholar 

  22. Werhli, A. V., Grzegorczyk, M., & Husmeier, D. (2006). Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical Gaussian models and Bayesian networks. Bioinformatics, 22, 2523–2531.

    Article  Google Scholar 

  23. Zhou, X., Kao, M., & Wong, W. (2002). Transitive functional annotation by shortest-path analysis of gene expression data. Proceedings of the National Academy of Sciences of the United States of America, 99, 12783–12788.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Linlin Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Chen, L., Klebanov, L., Almudevar, A., Proschel, C., Yakovlev, A. (2020). A Study of the Correlation Structure of Microarray Gene Expression Data Based on Mechanistic Modeling of Cell Population Kinetics. In: Almudevar, A., Oakes, D., Hall, J. (eds) Statistical Modeling for Biological Systems. Springer, Cham. https://doi.org/10.1007/978-3-030-34675-1_3

Download citation

Publish with us

Policies and ethics