Abstract
This chapter guides the user through an analysis pipeline that includes preprocessing raw mass spectrometry data into a user-friendly quantitative protein report and statistical analysis. We use a publicly available dataset as a working example that covers two prominent strategies for mass spectrometry-based proteomics, the extensively used data-dependent acquisition (DDA) and the emerging data-independent acquisition (DIA) technology. We use MaxQuant for DDA data and Spectronaut for DIA data preprocessing. Both software packages are well-established tools in the field. We perform subsequent analysis in the R software environment which offers a large repertoire of tools for data analysis and visualization. The chapter will aid lab scientists with some familiarity with R to reproducibly analyze their experiments using state-of-the-art bioinformatics methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aebersold R, Mann M (2003) Mass spectrometry-based proteomics. Nature 422:198
Gillet LC, Navarro P, Tate S, Röst H, Selevsek N, Reiter L, Bonner R, Aebersold R (2012) Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol Cell Proteomics 11:O111–016717
Cox J, Mann M (2008) MaxQuant enables high peptide identification rates, individualized ppb-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26:1367
Ma Z-Q, Dasari S, Chambers MC, Litton MD, Sobecki SM, Zimmerman LJ, Halvey PJ, Schilling B, Drake PM, Gibson BW, others (2009) IDPicker 2.0: improved protein assembly with high discrimination peptide identification filtering. J Proteome Res 8:3872–3881
Tabb DL, Fernando CG, Chambers MC (2007) MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J Proteome Res 6:654–661
Kim S, Pevzner PA (2014) MS-GF+ makes progress towards a universal database search tool for proteomics. Nat Commun 5:5277
Bruderer R, Bernhardt OM, Gandhi T, Miladinović SM, Cheng L-Y, Messner S, Ehrenberger T, Zanotelli V, Butscheid Y, Escher C, others (2015) Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol Cell Proteomics 14:1400–1410
Röst HL, Rosenberger G, Navarro P, Gillet L, Miladinović SM, Schubert OT, Wolski W, Collins BC, Malmström J, Malmström L, Aebersold R (2014) OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat Biotechnol 32:219
Tyanova S, Temu T, Sinitcyn P, Carlson A, Hein MY, Geiger T, Mann M, Cox J (2016) The Perseus computational platform for comprehensive analysis of (prote) omics data. Nat Methods 13:731
Pham TV, Piersma SR, Warmoes M, Jimenez CR (2009) On the beta-binomial model for analysis of spectral count data in label-free tandem mass spectrometry-based proteomics. Bioinformatics 26:363–369
Pham TV, Jimenez CR (2012) An accurate paired sample test for count data. Bioinformatics 28:i596–i602
Robinson MD, McCarthy DJ, Smyth GK (2010) EdgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139–140
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550
Pham TV, Jimenez CR (2016) Simulated linear test applied to quantitative proteomics. Bioinformatics 32:i702–i709
Smyth GK (2004) Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3:1–25
Choi M, Chang C-Y, Clough T, Broudy D, Killeen T, MacLean B, Vitek O (2014) MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments. Bioinformatics 30:2524–2526
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13:2498–2504
Liu H, Sadygov RG, Yates JR (2004) A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal Chem 76:4193–4201
Cox J, Hein MY, Luber CA, Paron I, Nagaraj N, Mann M (2014) Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics 13:2513–2526
Tyanova S, Temu T, Cox J (2016) The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat Protoc 11:2301
Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, Lumley T, Maechler M, Magnusson A, Moeller S, Bill V (2009) Gplots: Various R programming tools for plotting data. R package version 2.1
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Pham, T.V., Jimenez, C.R. (2019). Quantitative Analysis of Mass Spectrometry-Based Proteomics Data. In: Li, K. (eds) Neuroproteomics. Neuromethods, vol 146. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9662-9_12
Download citation
DOI: https://doi.org/10.1007/978-1-4939-9662-9_12
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-4939-9661-2
Online ISBN: 978-1-4939-9662-9
eBook Packages: Springer Protocols