Skip to main content

Comparison of Spectra in Unsequenced Species

  • Conference paper
Advances in Bioinformatics and Computational Biology (BSB 2009)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 5676))

Included in the following conference series:

Abstract

We introduce a new algorithm for the mass spectrometric identification of proteins. Experimental spectra obtained by tandem MS/MS are directly compared to theoretical spectra generated from proteins of evolutionarily closely related organisms. This work is motivated by the need of a method that allows the identification of proteins of unsequenced species against a database containing proteins of related organisms. The idea is that matching spectra of unknown peptides to very similar MS/MS spectra generated from this database of annotated proteins can lead to annotate unknown proteins. This process is similar to ortholog annotation in protein sequence databases. The difficulty with such an approach is that two similar peptides, even with just one modification (i.e. insertion, deletion or substitution of one or several amino acid(s)) between them, usually generate very dissimilar spectra. In this paper, we present a new dynamic programming based algorithm: PacketSpectralAlignment. Our algorithm is tolerant to modifications and fully exploits two important properties that are usually not considered: the notion of inner symmetry, a relation linking pairs of spectrum peaks, and the notion of packet inside each spectrum to keep related peaks together. Our algorithm, PacketSpectralAlignment is then compared to SpectralAlignment [1] on a dataset of simulated spectra. Our tests show that PacketSpectralAlignment behaves better, in terms of results and execution time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pevzner, P.A., Dancík, V., Tang, C.L.: Mutation-tolerant protein identification by mass spectrometry. J. Comput. Biol. 7(6), 777–787 (2000)

    Article  CAS  PubMed  Google Scholar 

  2. Habermann, B., Oegema, J., Sunyaev, S., Shevchenko, A.: The power and the limitations of cross-species protein identification by mass spectrometry-driven sequence similarity searches. Mol. Cell. Proteomics 3(3), 238–249 (2004)

    Article  CAS  PubMed  Google Scholar 

  3. Grossmann, J., Fischer, B., Baerenfaller, K., Owiti, J., Buhmann, J.M., Gruissem, W., Baginsky, S.: A workflow to increase the detection rate of proteins from unsequenced organisms in high-throughput proteomics experiments. Proteomics 7(23), 4245–4254 (2007)

    Article  CAS  PubMed  Google Scholar 

  4. Pevtsov, S., Fedulova, I., Mirzaei, H., Buck, C., Zhang, X.: Performance evaluation of existing de novo sequencing algorithms. J. Proteome. Res. 5(11), 3018–3028 (2006)

    Article  CAS  PubMed  Google Scholar 

  5. Pitzer, E., Masselot, A., Colinge, J.: Assessing peptide de novo sequencing algorithms performance on large and diverse data sets. Proteomics 7(17), 3051–3054 (2007)

    Article  CAS  PubMed  Google Scholar 

  6. Eng, J., McCormack, A., Yates, J.: An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom 5(11), 976–989 (1994)

    Article  CAS  PubMed  Google Scholar 

  7. Perkins, D.N., Pappin, D.J., Creasy, D.M., Cottrell, J.S.: Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20(18), 3551–3567 (1999)

    Article  CAS  PubMed  Google Scholar 

  8. Yates, J.R., Eng, J.K., McCormack, A.L., Schieltz, D.: Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Anal. Chem. 67(8), 1426–1436 (1995)

    Article  CAS  PubMed  Google Scholar 

  9. Pevzner, P.A., Mulyukov, Z., Dancik, V., Tang, C.L.: Efficiency of database search for identification of mutated and modified proteins via mass spectrometry. Genome Res. 11(2), 290–299 (2001)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Tsur, D., Tanner, S., Zandi, E., Bafna, V., Pevzner, P.A.: Identification of post-translational modifications by blind search of mass spectra. Nat. Biotechnol. 23(12), 1562–1567 (2005)

    Article  CAS  PubMed  Google Scholar 

  11. Dancik, V., Addona, T., Clauser, K., Vath, J., Pevzner, P.: De novo peptide sequencing via tandem mass spectrometry. Journal of Computational Biology 6(3-4), 327–342 (1999)

    Article  CAS  PubMed  Google Scholar 

  12. Frank, A.M., Savitski, M.M., Nielsen, M.L., Zubarev, R.A., Pevzner, P.A.: De novo peptide sequencing and identification with precision mass spectrometry. J. Proteome Res. 6(1), 114–123 (2007)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cliquet, F., Fertin, G., Rusu, I., Tessier, D. (2009). Comparison of Spectra in Unsequenced Species. In: Guimarães, K.S., Panchenko, A., Przytycka, T.M. (eds) Advances in Bioinformatics and Computational Biology. BSB 2009. Lecture Notes in Computer Science(), vol 5676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03223-3_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03223-3_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03222-6

  • Online ISBN: 978-3-642-03223-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics