Abstract
Transcription of DNA generates single-stranded RNAs that may operate by virtue of their structures (tRNAs, rRNAs), or by virtue of their encoding of proteins (mRNAs). The former consist of a small number of types which, while quantitatively abundant, are collectively encoded by a small part of genomes. The latter consist of a large number of types which, while each is quantitatively sparse, are collectively encoded by a larger part of genomes. All types of RNA fold into highly significant (non-random) ‘stem-loop’ structures. While the selective forces affecting tRNA and rRNA structures mainly relate to their roles in the cytoplasm, those affecting mRNA structures largely relate to the need for structure (stem-loop potential) of the genes from which they were transcribed. This need may conflict with the need to optimize the sequence of an encoded protein. Since non-genic regions of genomes also have potential for highly significant stem-loop structure, the potential is likely to be a genome-wide response to some selective force. In 1972 Harold White proposed that the strain of an unwinding force acting on regular duplex DNA would be relieved by extrusion of stem-loop structures. The potentials for such higher ordered structures in single-stranded nucleic acids may be calculated from the base-pairing energies of overlapping dinucleotides, which are fundamental units of nucleic acid structure. Contributions to the energetics of such structures decompose into base composition-dependent and base order-dependent components that may either support or oppose each other. The latter, most important, component is determined by subtracting the base composition-dependent component from the total folding energy. The base composition-dependent component is itself determined by shuffling and refolding a sequence several times – thus destroying the base order-dependent component – and then taking the average folding energy of the resulting structures. Extrusion symmetry between the two strands of a DNA duplex is violated in regions rich in A and C (with complements T and G). Selection for nucleic acid structure provides an explanation for PR2.
I am puzzled by the frequent appearance of palindromes in these sequences. Naturally one would expect occasional short ones by chance, but they seem to occur too often. … I have wondered if there is any mechanism which would produce them, but have been unable to think of one.
F. Crick , 1974 [1]
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Crick F (1974) Letter to H. G. Khorana, 28th June. Archives of National Library of Medicine, Washington, DC
Delbrück M (1949) Transactions of the Connecticut Academy of Arts and Sciences 38:173–190
Salser W (1970) Discussion. Cold Spring Harbor Symposium in Quantitative Biology 35:19
Ball LA (1972) Implications of secondary structure in messenger RNA. Journal of Theoretical Biology 36:313–320
White HB, Laux BE, Dennis D (1972) Messenger RNA structure: compatibility of hairpin loops with protein sequence. Science 175:1264–1266
Forsdyke DR (1998) An alternative way of thinking about stem-loops in DNA. Journal of Theoretical Biology 192:489–504; Einert TR, Netz RR (2011) Theory for RNA folding, stretching, and melting including loops and salt. Biophysical Journal 100:2745–2753 [An early model of Poland and Scheraga did not take into account the possibility that stem-loop extrusion from duplex DNA could be either assisted or impeded by the order of bases.]
Seffens W, Digby D (1999) mRNAs have greater negative folding free energies than shuffled or codon choice randomized sequences. Nucleic Acids Research 27:1578–1584
Forsdyke DR (2007) Calculation of folding energies of single-stranded nucleic acid sequences: conceptual issues. Journal of Theoretical Biology 248:745–753 [For many purposes, the control randomization of sequences can be performed at the 1-mer level. This retains base compositions, but not nearest-neighbor associations.]
Tinoco I, Uhlenbeck OC, Levine MD (1971) Estimating secondary structure in ribonucleic acids. Nature 230:362–367
Allawi HT, SantaLucia J (1997) Thermodynamics and NMR of internal GT mismatches in DNA. Biochemistry 36:10581–10589 [Since the Table 5.1 values for DNA were not then available, in my early studies I assumed that the corresponding values for RNA (later shown to closely approximated the DNA values), would suffice for my purposes.]
Zuker M (1990) Predicting optimal and suboptimal secondary structure for RNA. Methods in Enzymology 183:281–306
Hundley HA, Bass BL (2010) ADAR editing in double-stranded UTRs and other non-coding RNA sequences. Trends in Biochemical Science 35:377–383
Yura K, Sulaiman S, Hatta Y, Shionyu M, Go M (2009) RESOPS: A database for analyzing the correspondence of RNA editing sites to protein three-dimensional structures. Plant and Cell Physiology 50:1865–1873
Kudla G, Murray AW, Tollervey D, Plotkin JB (2009) Coding-sequence determinants of gene expression in Escherichia coli. Science 324:255–258
Gu W, Zhou T, Wilke CO (2010) A universal trend of reduced mRNA stability near the translation initiation site in prokaryotes and eukaryotes. PLOS Computational Biology 6:e1000664
Zhang S-H, Huang Y-Z (2010) Limited contribution of stem-loop potential to symmetry of single-stranded genomic DNA. Bioinformatics 26:478–485
Fontana W, Konings DAM, Stadler PF, Schuster P (1994) Statistics of RNA secondary structures. Biopolymers 33:1389–1404
Arnold J, Cuticchia AJ, Newsome DA, Jennings WW, Ivarie R (1988) Mono- through hexanucleotide composition of the sense strand of yeast DNA: a Markov chain analysis. Nucleic Acids Research 16:7145–7157
Nussinov R (1982) Some indications for inverse DNA duplication. Journal of Theoretical Biology 95:783–791
Alff-Steinberger C (1984) Evidence for a coding pattern in the non-coding strand of the E. coli genome. Nucleic Acids Research 12:2235–2241
Kong S-G, Fan W-L, Chen H-D, Hsu Z-T, Zhou N, Zheng B, Lee H-C (2009) Inverse symmetry in complete genomes and whole-genome inverse duplication. PLOS One 4:e7553
Noort V van, Worning P, Ussery DW, Rosche WA, Sinden RR (2003) Strand misalignments lead to quasipalindrome correction. Trends in Genetics 19:365–369
Zhang C, Xu S, Wei J-F, Forsdyke DR (2008) Microsatellites that violate Chargaff's second parity rule have base order-dependent asymmetries in the folding energies of complementary DNA strands and may not drive speciation. Journal of Theoretical Biology 254:168–177
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Forsdyke, D.R. (2016). Stems and Loops. In: Evolutionary Bioinformatics. Springer, Cham. https://doi.org/10.1007/978-3-319-28755-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-28755-3_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28753-9
Online ISBN: 978-3-319-28755-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)