Abstract
Multiple sequence alignment involves alignment of more than two sequences and is an NP-complete problem. Therefore, heuristic algorithms that use different criteria to find an approximation to the optimal solution are employed. At the heart of these approaches lie the scoring and objective functions that a given algorithm uses to compare competing solutions in constructing a multiple sequence alignment. These objective functions are often motivated by the biological paradigms that govern functional similarities and evolutionary relations. Most existing approaches utilize a progressive process where the final alignment is constructed sequentially by adding new sequences into an existing multiple sequence alignment matrix, which is dynamically updated. In doing this, the core scoring function to assess accuracies of pairwise alignments generally remains the same, while the objective functions used in intermediary steps differ. Nevertheless, the overall assessment of the final multiple sequence alignment is generally calculated by an extension of pairwise scorings. In this chapter, we explore different scoring and objective functions used in calculating the accuracy and optimization of a multiple sequence alignment and provide utilization of these criteria in popularly used multiple sequence alignment algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Setubal C, Meidanis J (1997) Introduction to computational molecular biology. PWS Publishing, Boston
Thompson JD, Poch O (2006) Multiple sequence alignment as a workbench for molecular systems biology. Curr Bioinform 1(1):95–104
Lipman DJ, Altschul SF, Kececioglu JD (1989) A tool for multiple sequence alignment. Proc Natl Acad Sci USA 86(12):4412–4415
Lesk AM (2008) Introduction to bioinformatics. Oxford University Press, USA
Dayhoff MO, Schwartz RM, Orcutt BC (1978) A model of evolutionary change in proteins. In: Atlas of protein sequence and structure, vol. 5, Suppl 3. National Biomedical Research Foundation, Washington, pp 345–352
Henikoff S, Henikoff JG (1992) Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA 89(22):10915–10919
Shannon CE, Weaver W, Blahut RE, Hajek B (1949) The mathematical theory of communication, vol 117. University of Illinois Press, Urbana
Teodorescu O, Galor T, Pillardy J, Elber R (2003) Enriching the sequence substitution matrix by structural information. Proteins Struct Funct Bioinform 54(1):41–48
Thompson JD, Plewniak F, Ripp R, Thierry J-C, Poch O et al (2001) Towards a reliable objective function for multiple sequence alignments. J Mol Biol 314(4):937
Thompson JD, Higgins DG, Gibson TJ (1994) Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22(22):4673–4680
Thompson JD (1995) Introducing variable gap penalties to sequence alignment in linear space. Comput Appl Biosci 11(2):181–186
Notredame C, Higgins DG, Heringa J et al (2000) T-coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol 302(1):205–218
Huang X, Miller W (1991) Lalign-find the best local alignments between two sequences. Adv Appl Math 12:373–381
Armougom F, Moretti S, Poirot O, Audic S, Dumas P, Schaeli B, Keduas V, Notredame C (2006) Expresso: automatic incorporation of structural information in multiple sequence alignments using 3d-coffee. Nucleic Acids Res 34(Suppl 2):W604–W608
Katoh K, Toh H (2008) Recent developments in the mafft multiple sequence alignment program. Brief Bioinform 9(4):286–298
Katoh K, Misawa K, Kuma K-i, Miyata T (2002) Mafft: a novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Res 30(14):3059–3066
Katoh K, Kuma K-i, Toh H, Miyata T (2005) Mafft version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res 33(2):511–518
Edgar RC (2004) Muscle: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32(5):1792–1797
Kimura M, Weiss GH (1964) The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics 49(4):561
Clamp M, Cuff J, Searle SM, Barton GJ (2004) The jalview java alignment editor. Bioinformatics 20(3):426–427
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Doğan, H., Otu, H.H. (2014). Objective Functions. In: Russell, D. (eds) Multiple Sequence Alignment Methods. Methods in Molecular Biology, vol 1079. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-646-7_3
Download citation
DOI: https://doi.org/10.1007/978-1-62703-646-7_3
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-62703-645-0
Online ISBN: 978-1-62703-646-7
eBook Packages: Springer Protocols