Abstract
We present an all-atom molecular modeling method that can predict the binding specificity of a transcription factor based on its 3D structure, with no further information required. We use molecular dynamics and free energy calculations to compute the relative binding free energies for a transcription factor with multiple possible DNA sequences. These sequences are then used to construct a position weight matrix to represent the transcription factor–binding sites. Free energy differences are calculated by morphing one base pair into another using a multi-copy representation in which multiple base pairs are superimposed at a single DNA position. Water-mediated hydrogen bonds between transcription factor side chains and DNA bases are known to contribute to binding specificity for certain transcription factors. To account for this important effect, the simulation protocol includes an explicit molecular water solvent and counter-ions. For computational efficiency, we use a standard additive approximation for the contribution of each DNA base pair to the total binding free energy. The additive approximation is not strictly necessary, and more detailed computations could be used to investigate non-additive effects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pabo CO, Sauer RT. Transcription factors: structural families and principles of DNA recognition. Annu Rev Biochem 1992, 61:1053–1095.
Tuerk C, Gold L. Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science 1990, 249(4968):505–510.
Ren B, Robert F, Wyrick JJ, et al. Genome-wide location and function of DNA binding proteins. Science 2000, 290(5500):2306–2309.
Mukherjee S, Berger MF, Jona G, et al. Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat Genet 2004, 36(12):1331–1339.
Morozov AV, Havranek JJ, Baker D, Siggia ED. Protein-DNA binding specificity predictions with structural models. Nucleic Acids Res 2005, 33(18):5781–5798.
Paillard G, Lavery R. Analyzing protein-DNA recognition mechanisms. Structure (Camb) 2004, 12(1):113–122.
Endres RG, Schulthess TC, Wingreen NS. Toward an atomistic model for predicting transcription-factor binding sites. Proteins 2004, 57(2):262–268.
Schneider TD, Stephens RM. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res 1990, 18(20):6097–6100.
Leach A. Molecular Modelling: Principles and Applications, 2nd ed. Prentice Hall, Harlow, England; New York, 2001.
Frenkel D, Smit B. Understanding Molecular Simulations: From Algorithms to Applications, 2nd ed. San Diego: Academic Press, 2002.
Berman HM, Westbrook J, Feng Z, et al. The protein data bank. Nucleic Acids Res 2000, 28(1):235–242.
Olson WK, Bansal M, Burley SK, et al. A standard reference frame for the description of nucleic acid base-pair geometry. J Mol Biol 2001, 313(1):229–237.
Cheatham TE, III, Young MA. Molecular dynamics simulation of nucleic acids: successes, limitations, and promise. Biopolymers 2000, 56(4):232–256.
Mackerell AD, Jr. Empirical force fields for biological macromolecules: overview and issues. J Comput Chem 2004, 25(13):1584–1604.
Kollman P. Free energy calculations: applications to chemical and biochemical phenomena. Chem Rev 1993, 93:2395–2417.
Acknowledgments
LAL acknowledges funding from the Department of Energy (DE-FG0204ER25626). JSB acknowledges funding from NSF CAREER 0546446, NIH/NCRR U54RR020839, and the Whitaker foundation. We acknowledge a starter grant and an MRAC grant of computer time from the Pittsburgh Supercomputer Center, MCB060010P, MCB060033P, and MCB060056N.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Liu, L.A., Bader, J.S. (2009). Structure-Based Ab Initio Prediction of Transcription Factor–Binding Sites. In: Ireton, R., Montgomery, K., Bumgarner, R., Samudrala, R., McDermott, J. (eds) Computational Systems Biology. Methods in Molecular Biology, vol 541. Humana Press. https://doi.org/10.1007/978-1-59745-243-4_2
Download citation
DOI: https://doi.org/10.1007/978-1-59745-243-4_2
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-58829-905-5
Online ISBN: 978-1-59745-243-4
eBook Packages: Springer Protocols