Skip to main content

Gene Regulatory Networks from Single Cell Data for Exploring Cell Fate Decisions

  • Protocol
  • First Online:
Computational Stem Cell Biology

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1975))

Abstract

Single cell experimental techniques now allow us to quantify gene expression in up to thousands of individual cells. These data reveal the changes in transcriptional state that occur as cells progress through development and adopt specialized cell fates. In this chapter we describe in detail how to use our network inference algorithm (PIDC)—and the associated software package NetworkInference.jl—to infer functional interactions between genes from the observed gene expression patterns. We exploit the large sample sizes and inherent variability of single cell data to detect statistical dependencies between genes that indicate putative (co-)regulatory relationships, using multivariate information measures that can capture complex statistical relationships. We provide guidelines on how best to combine this analysis with other complementary methods designed to explore single cell data, and how to interpret the resulting gene regulatory network models to gain insight into the processes regulating cell differentiation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Andrews TS, Hemberg M (2018) M3Drop: dropout-based feature selection for scRNASeq. Bioinformatics bty1044. https://doi.org/10.1093/bioinformatics/bty1044

  2. Babtie AC, Chan TE, Stumpf MPH (2017) Learning regulatory models for cell development from single cell transcriptomic data. Curr Opin Syst Biol 5:72–81

    Article  Google Scholar 

  3. Bacher R, Kendziorski C (2016) Design and computational analysis of single-cell RNA-sequencing experiments. Genome Biol 17:63

    Google Scholar 

  4. Bezanson J, Edelman A, Karpinski S, Shah VB (2014) Julia: a fresh approach to numerical computing. arXiv 1411.1607

    Google Scholar 

  5. Brennecke P, Anders S, Kim JK, Kołodziejczyk AA, Zhang X, Proserpio V, Baying B, Benes V, Teichmann SA, Marioni JC, Heisler MG (2013) Accounting for technical noise in single-cell RNA-seq experiments. Nat Methods 10(11):1093–1095

    Article  CAS  Google Scholar 

  6. Buettner F, Natarajan KN, Casale FP, Proserpio V, Scialdone A, Theis FJ, Teichmann SA, Marioni JC, Stegle O (2015) Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells. Nat Biotechnol 33(2):155–160

    Article  CAS  Google Scholar 

  7. Butte AJ, Tamayo P, Slonim D, Golub TR, Kohane IS (2000) Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proc Natl Acad Sci 97(22):12182–12186. https://doi.org/10.1073/pnas.220392197

    Article  CAS  Google Scholar 

  8. Chan TE, Stumpf MPH, Babtie AC (2017) Gene regulatory network inference from single-cell data using multivariate information measures. Cell Syst 5(3):251.e3–267.e3

    Google Scholar 

  9. Chan TE, Pallaseni A, Babtie AC, McEwen K, Stumpf MPH (2018) Empirical Bayes meets information theoretical network reconstruction from single cell data. bioRxiv. https://doi.org/10.1101/264853

  10. Delvenne JC, Yaliraki SN, Barahona M (2010) Stability of graph communities across time scales. Proc Natl Acad Sci 107(29):12755–12760. https://doi.org/10.1073/pnas.0903215107

    Article  CAS  Google Scholar 

  11. Efron B (2010) Large-scale inference. Empirical Bayes methods for estimation, testing, and prediction. Cambridge University Press, Cambridge

    Google Scholar 

  12. Faith JJ, Hayete B, Thaden JT, Mogno I, Wierzbowski J, Cottarel G, Kasif S, Collins JJ, Gardner TS (2007) Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol 5(1):e8–e13

    Article  Google Scholar 

  13. Fan J, Salathia N, Liu R, Kaeser GE, Yung YC, Herman JL, Kaper F, Fan JB, Zhang K, Chun J, Kharchenko PV (2016) Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis. Nat Methods 13(3):241–244

    Article  CAS  Google Scholar 

  14. Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, Slichter CK, Miller HW, McElrath MJ, Prlic M, Linsley PS, Gottardo R (2015) MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol 16:278

    Google Scholar 

  15. Grün D, van Oudenaarden A (2015) Design and analysis of single-cell sequencing experiments. Cell 163(4):799–810

    Article  Google Scholar 

  16. Haghverdi L, Büttner M, Wolf FA, Buettner F, Theis FJ (2016) Diffusion pseudotime robustly reconstructs lineage branching. Nat Methods 13(10):845–848. https://doi.org/10.1038/nmeth.3971

  17. Hausser J, Strimmer K (2009) Entropy inference and the James-Stein estimator, with application to nonlinear gene association networks. J Mach Learn Res 10:1469–1484

    Google Scholar 

  18. Kharchenko PV, Silberstein L, Scadden DT (2014) Bayesian approach to single-cell differential expression analysis. Nat Methods 11(7):740–742

    Article  CAS  Google Scholar 

  19. Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, Peshkin L, Weitz DA, Kirschner MW (2015) Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell 161(5):1187–1201

    Article  CAS  Google Scholar 

  20. Korthauer KD, Chu LF, Newton MA, Li Y, Thomson J, Stewart R, Kendziorski C (2016) A statistical approach for identifying differential distributions in single-cell RNA-seq experiments. Genome Biol 17:222. https://doi.org/10.1186/s13059-016-1077-y

  21. Kumar P, Tan Y, Cahan P (2017) Understanding development and stem cells using single cell-based analyses of gene expression. Development 144(1):17–32

    Article  CAS  Google Scholar 

  22. Lambiotte R, Delvenne JC, Barahona M (2014) Random walks, Markov processes and the multiscale modular organization of complex networks. IEEE Trans Netw Sci Eng 1(2):76–90. https://doi.org/10.1109/tnse.2015.2391998

    Article  Google Scholar 

  23. Liang KC, Wang X (2008) Gene regulatory network reconstruction using conditional mutual information. EURASIP J Bioinform Syst Biol 2008(1):253894. https://doi.org/10.1155/2008/253894

  24. Liu S, Trapnell C (2016) Single-cell transcriptome sequencing: recent advances and remaining challenges. F1000Research 5:182. https://doi.org/10.12688/f1000research.7223.1

  25. Lönnberg T, Svensson V, James KR (2017) Single-cell RNA-seq and computational analysis using temporal mixture modelling resolves Th1/Tfh fate bifurcation in malaria. Science 2(9):eaal2192

    Google Scholar 

  26. Lun ATL, Calero-Nieto FJ, Haim-Vilmovsky L, Göttgens B, Marioni JC (2017) Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data. Genome Res 27:1795-1806. https://doi.org/10.1101/gr.222877.117

  27. Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, Tirosh I, Bialas AR, Kamitaki N, Martersteck EM, Trombetta JJ, Weitz DA, Sanes JR, Shalek AK, Regev A, McCarroll SA (2015) Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161(5):1202–1214

    Article  CAS  Google Scholar 

  28. Marbach D, Costello JC, Küffner R, Vega NM, Prill RJ, Camacho DM, Allison KR, DREAM5 Consortium, Kellis M, Collins JJ, Stolovitzky G (2012) Wisdom of crowds for robust gene network inference. Nat Methods 9(8):796–804

    Article  Google Scholar 

  29. Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Favera R, Califano A (2006) ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinf 7(Suppl 1):S7–S15

    Article  Google Scholar 

  30. McMahon SS, Sim A, Johnson R, Liepe J, Stumpf MPH (2014) Information theory and signal transduction systems: from molecular information processing to network inference. Semin Cell Dev Biol 35:98–108. https://doi.org/10.1016/j.semcdb.2014.06.011

    Article  CAS  Google Scholar 

  31. Meyer PE, Kontos K, Lafitte F, Bontempi G (2007) Information-theoretic inference of large transcriptional regulatory networks. EURASIP J Bioinform Syst Biol 2007(1):79879

    Google Scholar 

  32. Meyer PE, Lafitte F, Bontempi G (2008) minet: a R/Bioconductor package for inferring large transcriptional networks using mutual information. BMC Bioinf 9(1):461–510

    Article  Google Scholar 

  33. Moignard V, Göttgens B (2016) Dissecting stem cell differentiation using single cell expression profiling. Curr Opin Cell Biol 43:78–86

    Article  CAS  Google Scholar 

  34. Moignard V, Woodhouse S, Haghverdi L, Lilly AJ, Tanaka Y, Wilkinson AC, Buettner F, Macaulay IC, Jawaid W, Diamanti E, Nishikawa SI, Piterman N, Kouskoff V, Theis FJ, Fisher J, Göttgens B (2015) Decoding the regulatory network of early blood development from single-cell gene expression measurements. Nat Biotechnol 33(3):269–276. https://doi.org/10.1038/nbt.3154

    Article  CAS  Google Scholar 

  35. Newman MEJ (2002) Mixing patterns in networks. arXiv.org (2 Pt 2), 026,126

    Google Scholar 

  36. Oates CJ, Mukherjee S (2012) Network inference and biological dynamics. Ann Appl Stat 6(3):1209–1235. https://doi.org/10.1214/11-AOAS532

    Article  CAS  Google Scholar 

  37. Ocone A, Haghverdi L, Mueller NS, Theis FJ (2015) Reconstructing gene regulatory dynamics from high-dimensional single-cell snapshot data. Bioinformatics 31(12):i89–i96. https://doi.org/10.1093/bioinformatics/btv257

    Article  CAS  Google Scholar 

  38. Penfold CA, Wild DL (2011) How to infer gene networks from expression profiles, revisited. Interface Focus 1(6):857–870

    Article  Google Scholar 

  39. Rizvi AH, Camara PG, Kandror EK, Roberts TJ, Schieren I, Maniatis T, Rabadan R (2017) Single-cell topological RNA-seq analysis reveals insights into cellular differentiation and development. Nat Biotechnol 35(6):551–560

    Google Scholar 

  40. Salter Townshend M, White A, Gollini I, Murphy TB (2012) Review of statistical network analysis: models, algorithms, and software. Stat Anal Data Min ASA Data Sci J 5(4):243–264

    Article  Google Scholar 

  41. Scargle JD, Norris JP, Jackson B, Chiang J (2013) Studies in astronomical time series analysis. VI. Bayesian block representations. Astrophys J 764:167. https://doi.org/10.1088/0004-637X/764/2/167

  42. Setty M, Tadmor MD, Reich-Zeliger S, Angel O, Salame TM, Kathail P, Choi K, Bendall S, Friedman N, Pe’er D (2016) Wishbone identifies bifurcating developmental trajectories from single-cell data. Nat Biotechnol 34(6):637–714

    Google Scholar 

  43. Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27(3):379–423. http://dx.doi.org/10.1002/j.1538-7305.1948.tb01338.x

    Article  Google Scholar 

  44. Stegle O, Teichmann SA, Marioni JC (2015) Computational and analytical challenges in single-cell transcriptomics. Nat Rev Genet 16(3):133–145

    Article  CAS  Google Scholar 

  45. Stumpf MPH, Stumpf MPH, Kelly W, Thorne TW, Wiuf C, Wiuf C (2007) Evolution at the system level: the natural history of protein interaction networks. Trends Ecol Evol 22(7):366–373

    Article  Google Scholar 

  46. Stumpf PS, Smith RCG, Lenz M, Schuppert A, Müller FJ, Babtie A, Chan TE, Stumpf MPH, Please CP, Howison SD, Arai F, MacArthur BD (2017) Stem cell differentiation as a non-Markov stochastic process. Cell Syst 5(3):268.e7–282.e7

    Google Scholar 

  47. Tanay A, Regev A (2017) Scaling single-cell genomics from phenomenology to mechanism. Nat Biotechnol 541(7637):331–338

    Google Scholar 

  48. Timme N, Alford W, Flecker B, Beggs JM (2014) Synergy, redundancy, and multivariate information measures: an experimentalist’s perspective. J Comput Neurosci 36(2):119–140. https://doi.org/10.1007/s10827-013-0458-4

    Article  Google Scholar 

  49. Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, Lennon NJ, Livak KJ, Mikkelsen TS, Rinn JL (2014) The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol 32(4):381–U251. https://doi.org/10.1038/nbt.2859

    Article  CAS  Google Scholar 

  50. Vallejos C (2016) Beyond comparisons of means: understanding changes in gene expression at the single-cell level. Genome Biol 17:70. https://doi.org/10.1186/s13059-016-0930-3

  51. Vallejos CA, Risso D, Scialdone A, Dudoit S, Marioni JC (2017) Normalizing single-cell RNA sequencing data: challenges and opportunities. Nat Methods 14(6):565–571

    Google Scholar 

  52. van Dijk D, Sharma R, Nainys J, Yim K, Kathail P, Carr AJ, Burdziak C, Moon KR, Chaffer CL, Pattabiraman D, Bierie B, Mazutis L, Wolf G, Krishnaswamy S, Pe’er D (2018) Recovering gene interactions from single-cell data using data diffusion. Cell 174(3):716-729

    Google Scholar 

  53. Villaverde A, Ross J, Banga J (2013) Reverse engineering cellular networks with information theoretic methods. Cells 2(2):306–329

    Article  Google Scholar 

  54. Villaverde AF, Ross J, Morán F, Banga JR (2014) MIDER: network inference with mutual information distance and entropy reduction. PLoS ONE 9(5):e96,732

    Article  Google Scholar 

  55. Villaverde AF, Becker K, Banga JR (2017) PREMER: a tool to infer biological networks. IEEE/ACM Trans Comput Biol Bioinform 15(4):1193–1202. https://doi.org/10.1109/TCBB.2017.2758786

  56. Wagner A, Regev A, Yosef N (2016) Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol 34(11):1145–1160

    Google Scholar 

  57. Watkinson J, Liang KC, Wang X, Zheng T, Anastassiou D (2009) Inference of regulatory gene interactions from expression data using three-way mutual information. Ann N Y Acad Sci 1158(1):302–313. https://doi.org/10.1111/j.1749-6632.2008.03757.x

    Article  CAS  Google Scholar 

  58. Welch JD, Hartemink AJ, Prins JF (2016) SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data. Genome Biol 17(1):106

    Article  Google Scholar 

  59. Williams PL, Beer RD (2010) Nonnegative decomposition of multivariate information. arXiv.org, arXiv:1004.2515

    Google Scholar 

  60. Woodhouse S, Moignard V, Göttgens B, Fisher J (2016) Processing, visualising and reconstructing network models from single-cell data. Immunol Cell Biol 94(3):256–265

    Article  CAS  Google Scholar 

  61. Zhao J, Zhou Y, Zhang X, Chen L (2016) Part mutual information for quantifying direct associations in networks. Proc Natl Acad Sci USA 113(18):5130–5135

    Article  CAS  Google Scholar 

Download references

Acknowledgements

This work was supported by a Biotechnology and Biological Sciences Research Council (BBSRC) DTP Studentship to T.E.C., and a BBSRC Future Leaders Fellowship (grant reference BB/N011597/1) to A.C.B. We thank Joe Greener, Gal Horesh, and Ananth Pallaseni for sharing code with us, Suhail Islam for computing support, and Ben MacArthur, Patrick Stumpf, and members of the theoretical systems biology group for useful discussions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ann C. Babtie .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Chan, T.E., Stumpf, M.P.H., Babtie, A.C. (2019). Gene Regulatory Networks from Single Cell Data for Exploring Cell Fate Decisions. In: Cahan, P. (eds) Computational Stem Cell Biology. Methods in Molecular Biology, vol 1975. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9224-9_10

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-9224-9_10

  • Published:

  • Publisher Name: Humana, New York, NY

  • Print ISBN: 978-1-4939-9223-2

  • Online ISBN: 978-1-4939-9224-9

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics