Skip to main content

Evaluation of Peak-Picking Algorithms for Protein Mass Spectrometry

  • Protocol
  • First Online:

Part of the book series: Methods in Molecular Biology ((MIMB,volume 696))

Abstract

Peak picking is an early key step in MS data analysis. We compare three commonly used approaches to peak picking and discuss their merits by means of statistical analysis. Methods investigated encompass signal-to-noise ratio, continuous wavelet transform, and a correlation-based approach using a Gaussian template.

Functionality of the three methods is illustrated and discussed in a practical context using a mass spectral data set created with MALDI-TOF technology. Sensitivity and specificity are investigated using a manually defined reference set of peaks. As an additional criterion, the robustness of the three methods is assessed by a perturbation analysis and illustrated using ROC curves.

This is a preview of subscription content, log in via an institution.

Buying options

Protocol
USD   49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Springer Nature is developing a new tool to find and evaluate Protocols. Learn more

References

  1. Kwon D, Vannucci M, Song JJ, Jeong J, Pfeiffer RM (2008) A novel wavelet-based thresholding method for the pre-processing of mass spectrometry data that accounts for heterogeneous noise. Proteomics 8:3019–3029

    Article  CAS  PubMed  Google Scholar 

  2. Savitzky A, Golay MJE (1964) Smoothing and differentiation of data by simplified least squares procedures. Anal Chem 36(8):1627–1639

    Article  CAS  Google Scholar 

  3. Coombes KR, Tsavachidis S, Morris JS, Baggerly KA, Hung MC, Kuerer HM (2005) Improved peak detection and quantification of mass spectrometry data acquired from surface-enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform. Proteomics 5:4107–4117

    Article  CAS  PubMed  Google Scholar 

  4. Sauve AC, Speed TP (2004) Normalization, baseline correction and alignment of high-throughput mass spectrometry data. Proceedings of the genomic signal processing and statistics, 2004

    Google Scholar 

  5. Cleveland WS, Grosse E, Shyu WM (1992) Local regression models. In: Chambers JM, Hastie T (eds) Statistical models in S. Wadsworth & Brooks/Cole, Pacific Grove, CA, pp 309–376

    Google Scholar 

  6. Lange E, Gropl C, Reinert K, Kohlbacher O, Hildebrandt A (2006) High-accuracy peak picking of proteomics data using wavelet techniques. Pac Symp Biocomput 11:243–254

    Article  Google Scholar 

  7. Du P, Kibbe WA, Lin SM (2006) Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. Bioinformatics 22:2059–2065

    Article  CAS  PubMed  Google Scholar 

  8. Gentleman R, Carey V, Huber W, Irizarry R, Dudoit S (eds) (2005) Bioinformatics and computational biology solutions using r and bioconductor. Springer, New York

    Google Scholar 

  9. Mantini D, Petrucci F, Pieragostino D, Del Boccio P, Di Nicola M, Di Ilio C, Federici G, Sacchetta P, Comani S, Urbani A (2007) LFMPIC: a computational method for the separation of protein MALDI-TOF-MS signals from noise. BMC Bioinform 8:101

    Article  Google Scholar 

  10. Kohlbacher O, Reinert K, Gröpl C, Lange E, Pfeifer N, Schulz-Trieglaff O, Sturm M (2007) TOPP-the OpenMS proteomics pipeline. Bioinformatics 23:e191–197

    Article  CAS  PubMed  Google Scholar 

  11. Sturm M, Bertsch A, Gröpl C, Hildebrandt A, Hussong R, Lange E, Pfeifer N, Schulz-Trieglaff O, Zerck A, Reinert K, Kohlbacher O (2008) OpenMS – an open-source software framework for mass spectrometry. BMC Bioinform 9:163

    Article  Google Scholar 

  12. Yang C, He Z, Yu W (2009) Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis. BMC Bioinform 10:4

    Article  Google Scholar 

  13. Liu Q, Sung AH, Qiao M, Chen Z, Yang JY, Yang MQ, Huang X, Deng Y (2009) Comparison of feature selection and classification for MALDI-MS data. BMC Genomics 10(Suppl 1):S3

    Article  PubMed  Google Scholar 

  14. Jeffries N (2005) Algorithms for alignment of mass spectrometry proteomic data. Bio-informatics 21:3066–3073

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Bauer, C., Cramer, R., Schuchhardt, J. (2011). Evaluation of Peak-Picking Algorithms for Protein Mass Spectrometry. In: Hamacher, M., Eisenacher, M., Stephan, C. (eds) Data Mining in Proteomics. Methods in Molecular Biology, vol 696. Humana Press. https://doi.org/10.1007/978-1-60761-987-1_22

Download citation

  • DOI: https://doi.org/10.1007/978-1-60761-987-1_22

  • Published:

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-60761-986-4

  • Online ISBN: 978-1-60761-987-1

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics