Abstract
The determination of the amino acid sequence of a peptide from its MS/MS spectrum is an important task in proteomics. The determination without the help of a protein database is called the de novo sequencing, which is especially useful in the identification of unknown proteins. Many studies on the de novo sequencing problem have been done but none proves to be practical. In this paper, we define a new model for this problem, and provide a sophisticated dynamic programming algorithm to solve it. Experiments on real MS/MS data demonstrated that the algorithm works very well on QTof MS/MS data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bartels, C. 1990. Fast algorithm for peptide sequencing by mass spectroscopy. Biomed. Environ. Mass Spectrom 19, 363–368.
Chen, T., Kao, M-Y., Tepel, M., Rush J., and Church, G. 2001. A Dynamic Programming Approach to de novo Peptide Sequencing via Tandem Mass Spectrometry. J. Comp. Biology 8(3), 325–337.
Dančík, V., Addona, T., Clauser, K., Vath, J., and Pevzner, P. 1999. De novo protein sequencing via tandem mass-spectrometry. J. Comp. Biology 6, 327–341.
Eng, J.K., McCormack, A.L., and Yates, J.R. 1994. An approach to correlate tandem mass spectral data pf peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom 5, 976–989.
Fernández de Cossío, J., Gonzales, J., and Besada, V. 1995. A computer program to aid the sequencing of peptides in collision-activated decomposition experiments. CABIOS 11(4), 427–434.
Hamm, C.W., Wilson, W.E., and Harvan, D.J. 1986. Peptide sequencing program. CABIOS 2, 365.
Hines, W.M., Falick, A.M., Burlingame, A.L., and Gibson, B.W. 1992. Pattern-based algorithm for peptide sequencing from tandem high energy collision-induced dissociation mass spectra. J. Am. Sco. Mass. Spectrom. 3, 326–336.
Ishikawa K., and Niva, Y. 1986. Computer-aided peptide sequencing by fast atom bombardment mass spectrometry. Biomed. Environ. Mass Spectrom. 13, 373–380.
Johnson, R.J., and Biemann, K. 1989. Computer program (seqpep) to aid the interpretation of high-energy collision tandem mass spectra of peptides. Biomed. Environ. Mass. Spectrom. 18, 945–957.
Johnson, R.S., Martin, S.A., Biemann, K., Stults, J.T., and Watson, J.T. 1987. Novel fragmentation process of peptides by collision-induced decomposition in a tandem mass spectrometer: differentiation of leucine and isoleucine. Anal. Chem. 59(21), 2621–5.
Ma, B., Zhang, K., Lajoie, G., Doherty-Kirby, A., Liang, C., and Li, M. 2002. A powerful software tool for the de novo sequencing of peptides from MS/MS data. 50th ASMS Conference on Mass Spectrometry and Allied Topics.
Mann, M., and Wilm, M. 1994. Error-tolerant identification of peptides in sequence databases by peptide sequence tags. Anal. Chem. 66, 4390–4399.
Perkins, D.N., Pappin, D.J.C., Creasy, D.M., and Cottrell, J.S. 1999. Probability-based protein identification by searching sequence database using mass spectrometry data. Electrophoresis 20, 3551–3567.
Pevzner, P.A., Dančík, V., and Tang, C. 2000. Mutation Tolerant Protein Identification by Mass Spectrometry. Journal of Computational Biology 6, 777–787.
Roepstorff, P., and Fohlman J. 1984. Proposal for a common nomenclature for sequence ions in mass spectra of peptides. Biomed Mass Spectrom 11(11), 601.
Sakurai, T., Matsuo, T., Matsuda, H., and Katakuse, I. 1984. Paas3: A computer program to determine probable sequence of peptides from mass spectrometric data. Biomed. Mass spectrum 11(8), 396–399.
Siegel, M.M., and Bauman, N. 1988. An efficient algorithm for sequencing peptides using fast atom bombardment mass spectral data. Biomed. Environ. Mass Spectrom 15, 333–343.
Snyder, A.P. 2000. Interpreting Protein Mass Spectra: A Comprehensive Resource. Oxford University Press.
Taylor, J.A., and Johnson, R.S. 1997. Sequence Database Searches via de novo peptide sequencing by tandem mass spectrometry. Rapid Communications in Mass Spectrometry 11, 1067–1075.
Yates, J.R.I., Eng, J.K., McCormack, A.L., and Schieltz, D. 1995. Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Analytical Chemistry 67, 1426–36.
Yates, J.R., Griffin, P.R., Hood, L.E., and Zhou, J.X. 1991. Computer aided interpretation of low energy MS/MS mass spectra of peptides, 477–485. in J.J. Villafranca ed., Techniques in Protein Chmistry II, Academic Press, San Diego.
Zidarov, D., Thibault, P., Evans, M.J., and Bertrand, M.J. 1990. Determination of the primary structure of peptidesusing fast atom bombardment mass spectrometry. Biomed. Environ. Mass Spectrom 19, 13–16.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ma, B., Zhang, K., Liang, C. (2003). An Effective Algorithm for the Peptide De Novo Sequencing from MS/MS Spectrum. In: Baeza-Yates, R., Chávez, E., Crochemore, M. (eds) Combinatorial Pattern Matching. CPM 2003. Lecture Notes in Computer Science, vol 2676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44888-8_20
Download citation
DOI: https://doi.org/10.1007/3-540-44888-8_20
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40311-1
Online ISBN: 978-3-540-44888-4
eBook Packages: Springer Book Archive