Incorporating Literature Knowledge in Bayesian Network for Inferring Gene Networks with Gene Expression Data

  • Eyad Almasri
  • Peter Larsen
  • Guanrao Chen
  • Yang Dai
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4983)


The reconstruction of gene networks from microarray gene expression has been a challenging problem in bioinformatics. Various methods have been proposed for this problem. The incorporation of various genomic and proteomic data has been shown to enhance the learning ability in the Bayesian Network (BN) approach. However, the knowledge embedded in the large body of published literature has not been utilized in a systematic way. In this work, prior knowledge on gene interaction was derived based on the statistical analysis of published interactions between pairs of genes or gene products. This information was used (1) to construct a structure prior and (2) to reduce the search space in the BN algorithm. The performance of the two approaches was evaluated and compared with the BN method without prior knowledge on two time course microarray gene expression data related to the yeast cell cycle. The results indicate that the proposed algorithms can identify edges in learned networks with higher biological relevance. Furthermore, the method using literature knowledge for the reduction of the search space outperformed the method using a structure prior in the BN framework.


Bayesian Network Likelihood score Prior probability 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Friedman, N., et al.: Using Bayesian networks to analyze expression data. J. Computational Biology, 601–620 (2000)Google Scholar
  2. 2.
    Spirtes, P., et al.: Causation, prediction, and search. Springer, New York (1993)zbMATHGoogle Scholar
  3. 3.
    Cooper, G., et al.: A Bayesian method for the induction of probabilistic networks from data. J. Machine Learning, 309–347 (1992)Google Scholar
  4. 4.
    Akutsu, T., et al.: Identification of genetic networks from a small number of gene expression patterns under the Boolean network model. In: Pacific Symp. Biocomputing, pp. 17–28 (1999)Google Scholar
  5. 5.
    Friedman, N., et al.: On the sample complexity of learning Bayesian networks. In: Proc. Twelfth Conference on Uncertainty in Artificial Intelligence (2001)Google Scholar
  6. 6.
    Pe’er, D., et al.: Inferring subnetworks from perturbed expression profiles. Bioinformatics, 215–S224 (2001)Google Scholar
  7. 7.
    Hartemink, A., et al.: Using graphical models and genomic expression data to statistically validate models of genetic regulatory networks. In: Pacific Symp. Biocomputing, pp. 422–433 (2002)Google Scholar
  8. 8.
    Imoto, S., et al.: Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression. In: Pacific Symp. Biocomputing, pp. 175–186 (2002)Google Scholar
  9. 9.
    Hartemink, A., et al.: Combining location and expression data for principled discovery of genetic regulatory network models. In: Pacific Symp. Biocomputing, pp. 437–449 (2002)Google Scholar
  10. 10.
    Chrisman, L., et al.: Incorporating biological knowledge into evaluation of causal regulatory hypotheses. In: Pacific Symp. Biocomputing, pp. 128–139 (2003)Google Scholar
  11. 11.
    Tamada, Y., et al.: Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection. Bioinformatics, 227–236 (2003)Google Scholar
  12. 12.
    Nariai, N., et al.: Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks. In: Pacific Symp. Biocomputing, pp. 336–347 (2004)Google Scholar
  13. 13.
    Yeang, C., et al.: Physical network models. J. of Computational Biology, 243–262 (2004)Google Scholar
  14. 14.
    Werhli, A., et al.: Reconstructing gene regulatory networks with bayesian networks by combining expression data with multiple sources of prior knowledge. Statistical Applications in Genetics and Molecular Biology 6(1), Article 15 (2007)Google Scholar
  15. 15.
    Larsen, P., et al.: A statistical method to incorporate biological knowledge for generating testable novel gene regulatory interactions from microarray experiments. BMC Bioinformatics 317 (2007)Google Scholar
  16. 16.
    Spirtes, P., et al.: Causation, prediction, and search. The MIT Press, New York (2000)Google Scholar
  17. 17.
    Spellman, P., et al.: Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Molecular Cell, 3273–3297 (1998)Google Scholar
  18. 18.
    Zou, M., et al.: A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data. Bioinformatic, 71–79 (2005)Google Scholar
  19. 19.
    Nikitin, A., et al.: Pathway studio–the analysis and navigation of molecular networks. Bioinformatic, 2155–2157 (2003)Google Scholar
  20. 20.
  21. 21.
    Battle, A., et al.: Probabilistic discovery of overlapping cellular processes and their regulation. In: Proc. of the Annual International Conference on Computational Molecular Biology, pp. 167–176 (2004)Google Scholar
  22. 22.
    Friedman, N., et al.: Being Bayesian about network structure: A Bayesian approach to structure discovery in Bayesian networks. J. Machine Learning, 601–620 (2003)Google Scholar
  23. 23.
    Castelo, R., et al.: Priors on network structures. Biasing the search for Bayesian networks, Technical Report (CWI) (Centre for Mathematics and Computer Science) (1998)Google Scholar
  24. 24.
    Norris, D., et al.: The effect of histone gene deletions on chromatin structure in Saccharomyces Cerevisiae. Science, 759–761 (1988)Google Scholar
  25. 25.
    Luger, K., et al.: Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature, 251–260 (1997)Google Scholar
  26. 26.
    Briggs, S., et al.: Gene silencing: trans-histone regulatory pathway in chromatin. Nature, 498 (1997)Google Scholar
  27. 27.
    Krogan, N., et al.: The Paf1 complex is required for histone H3 methylation by COMPASS and Dot1p: linking transcriptional elongation to histone methylation. Molecular Cell, 721–729 (2003)Google Scholar
  28. 28.
    Castelo, R., et al.: Priors on network structures. Biasing the search for Bayesian networks. International Journal of Approximate Reasoning, 39–57 (2000)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Eyad Almasri
    • 1
  • Peter Larsen
    • 1
  • Guanrao Chen
    • 1
  • Yang Dai
    • 1
  1. 1.University of Illinois at ChicagoChicagoUSA

Personalised recommendations