Advertisement

Multiple Structural RNA Alignment with Lagrangian Relaxation

  • Markus Bauer
  • Gunnar W. Klau
  • Knut Reinert
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3692)

Abstract

Many classes of functionally related RNA molecules show a rather weak sequence conservation but instead a fairly well conserved secondary structure. Hence, it is clear that any method that relates RNA sequences in form of (multiple) alignments should take structural features into account. Since multiple alignments are of great importance for subsequent data analysis, research in improving the speed and accuracy of such alignments benefits many other analysis problems.

We present a formulation for computing provably optimal, structure-based, multiple RNA alignments and give an algorithm that finds such an optimal (or near-optimal) solution. To solve the resulting computational problem we propose an algorithm based on Lagrangian relaxation which already proved successful in the two-sequence case. We compare our implementation, mLARA, to three programs (clustalW, MARNA, and pmmulti) and demonstrate that we can often compute multiple alignments with consensus structures that have a significant lower minimum free energy term than computed by the other programs. Our prototypical experiments show that our new algorithm is competitive and, in contrast to other methods, is applicable to long sequences where standard dynamic programming approaches must fail. Furthermore, the Lagrangian method is capable of handling arbitrary pseudoknot structures.

Keywords

Lagrangian Relaxation Minimum Free Energy Consensus Structure Lagrangian Problem Consensus Secondary Structure 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Althaus, E., Caprara, A., Lenhof, H.-P., Reinert, K.: Multiple sequence alignment with arbitrary gap costs: Computing an optimal solution using polyhedral combinatorics. Bioinformatics 18(90002), S4–S16 (2002)Google Scholar
  2. 2.
    Bafna, V., Muthukrishnan, S., Ravi, R.: Computing similarity between RNA strings. In: Galil, Z., Ukkonen, E. (eds.) CPM 1995. LNCS, vol. 937, pp. 1–16. Springer, Heidelberg (1995)Google Scholar
  3. 3.
    Bauer, M., Klau, G.W.: Structural Alignment of Two RNA Sequences with Lagrangian Relaxation. In: Fleischer, R., Trippen, G. (eds.) ISAAC 2004. LNCS, vol. 3341, pp. 113–123. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  4. 4.
    Caprara, A., Lancia, G.: Structural Alignment of Large-Size Proteins via Lagrangian Relaxation. In: Proc. of RECOMB 2002, pp. 100–108. ACM Press, New York (2002)CrossRefGoogle Scholar
  5. 5.
    Eddy, S.P., Durbin, R.: RNA sequence analysis using covariance models. Nucl. Acids Research 22(11), 2079–2088 (1994)CrossRefGoogle Scholar
  6. 6.
    Evans, P.: Finding common subsequences with arcs and pseudoknots. In: Crochemore, M., Paterson, M. (eds.) CPM 1999. LNCS, vol. 1645, pp. 270–280. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  7. 7.
    Gardner, P., Wilm, A., Washietl, S.: A benchmark of multiple sequence alignment programs upon structural RNAs. Nucl. Acids Res. 33(8), 2433–2439 (2005)CrossRefGoogle Scholar
  8. 8.
    Gorodkin, J., Heyer, L.J., Stormo, G.D.: Finding the most significant common sequence and structure motifs in a set of RNA sequences. Nucl. Acids Res. 25, 3724–3732 (1997)CrossRefGoogle Scholar
  9. 9.
    Held, M., Karp, R.: The traveling-salesman problem and minimum spanning trees: Part II. Mathematical Programming 1, 6–25 (1971)zbMATHCrossRefMathSciNetGoogle Scholar
  10. 10.
    Hofacker, I.L., Bernhart, S.H.F., Stadler, P.F.: Alignment of RNA base pairing probability matrices. Bioinformatics 20, 2222–2227 (2004)CrossRefGoogle Scholar
  11. 11.
    Hofacker, I.L., Fekete, M., Stadler, P.F.: Secondary structure prediction for aligned RNA sequences. J. Mol. Biol. 319, 1059–1066 (2002)CrossRefGoogle Scholar
  12. 12.
    Kececioglu, J., Lenhof, H.-P., Mehlhorn, K., Mutzel, P., Reinert, K., Vingron, M.: A polyhedral approach to sequence alignment problems. Discrete Applied Mathematics 104, 143–186 (2000)zbMATHCrossRefMathSciNetGoogle Scholar
  13. 13.
    Lenhof, H.-P., Reinert, K., Vingron, M.: A polyhedral approach to RNA sequence structure alignment. Journal of Comp. Biology 5(3), 517–530 (1998)CrossRefGoogle Scholar
  14. 14.
    Mathews, D.H., Turner, D.H.: Dynalign: An algorithm for finding secondary structures common to two RNA sequences. J. Mol. Biol. 317, 191–203 (2002)CrossRefGoogle Scholar
  15. 15.
    McCaskill, J.S.: The Equilibrium Partition Function and Base Pair Binding Probabilities for RNA Secondary Structure. Biopolymers 29, 1105–1119 (1990)CrossRefGoogle Scholar
  16. 16.
    Sankoff, D.: Simultaneous solution of the RNA folding, alignment, and proto-sequence problems. SIAM J. Appl. Math. 45, 810–825 (1985)zbMATHCrossRefMathSciNetGoogle Scholar
  17. 17.
    Siebert, S., Backofen, R.: MARNA: Multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons. Bioinformatics (2005), (In press) Google Scholar
  18. 18.
    Washietl, S., Hofacker, I.L.: Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics. J. Mol. Biol. 342(1), 19–30 (2004)CrossRefGoogle Scholar
  19. 19.
    Waterman, M.S.: Consensus methods for folding single-stranded nucleic adds. In: Mathematical Methods for DNA Sequences, pp. 185–224 (1989)Google Scholar
  20. 20.
    Wolsey, L.A.: Integer Programming. Wiley-Interscience series in discrete mathematics and optimization. Wiley, Chichester (1998)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Markus Bauer
    • 1
    • 2
  • Gunnar W. Klau
    • 3
  • Knut Reinert
    • 1
  1. 1.Institute of Computer ScienceFree University of BerlinGermany
  2. 2.International Max Planck Research School for Computational Biology and Scientific Computing 
  3. 3.Institute of MathematicsFree University of BerlinGermany

Personalised recommendations