Bolt: a New Age Peptide Search Engine for Comprehensive MS/MS Sequencing Through Vast Protein Databases in Minutes
- 194 Downloads
Recent increases in mass spectrometry speed, sensitivity, and resolution now permit comprehensive proteomics coverage. However, the results are often hindered by sub-optimal data processing pipelines. In almost all MS/MS peptide search engines, users must limit their search space to a canonical database due to time constraints and q value considerations, but this typically does not reflect the individual genetic variations of the organism being studied. In addition, engines will nearly always assume the presence of only fully tryptic peptides and limit PTMs to a handful. Even on high-performance servers, these search engines are computationally expensive, and most users decide to dial back their search parameters. We present Bolt, a new cloud-based search engine that can search more than 900,000 protein sequences (canonical, isoform, mutations, and contaminants) with 41 post-translation modifications and N-terminal and C-terminal partial tryptic search in minutes on a standard configuration laptop. Along with increases in speed, Bolt provides an additional benefit of improvement in high-confidence identifications. Sixty-one percent of peptides uniquely identified by Bolt may be validated by strong fragmentation patterns, compared with 13% of peptides uniquely identified by SEQUEST and 6% of peptides uniquely identified by Mascot. Furthermore, 30% of unique Bolt identifications were verified by all three software on the longer gradient analysis, compared with only 20% and 27% for SEQUEST and Mascot identifications respectively. Bolt represents, to the best of our knowledge, the first fully scalable, cloud-based quantitative proteomic solution that can be operated within a user-friendly GUI interface. Data are available via ProteomeXchange with identifier PXD012700.
KeywordsMass spectrometry Proteomics Peptide Mutations Search engine MS/MS Sequencing Variants Cloud Bolt
We would like to acknowledge Simion Kreimer, Ph.D. (Johns Hopkins University) and Dragana Lagundzin, Ph.D. (University of Nebraska) for their help with the Mascot analysis.
- 4.Scheltema, R.A., Hauschild, J.-P., Lange, O., Hornburg, D., Denisov, E., Damoc, E., Kuehn, A., Makarov, A., Mann, M.: The Q Exactive HF, a benchtop mass spectrometer with a pre-filter, high-performance quadrupole and an ultra-high-field Orbitrap analyzer. Mol. Cell. Proteomics. 13(12), 3698–3708 (2014)CrossRefGoogle Scholar
- 6.Prakash, A., Peterman, S., Ahmad, S., Sarracino, D., Frewen, B., Vogelsang, M., Byram, G., Krastins, B., Vadali, G., Lopez, M.: Hybrid data acquisition and processing strategies with increased throughput and selectivity: PSMART analysis for global qualitative and quantitative analysis. J. Proteome Res. 13(12), 5415–5430 (2014)CrossRefGoogle Scholar
- 13.Friedman, D.B., Andacht, T.M., Bunger, M.K., Chien, A.S., Hawke, D.H., Krijgsveld, J., Lane, W.S., Lilley, K.S., Maccoss, M.J., Moritz, R.L., et al.: The ABRF proteomics research group studies: educational exercises for qualitative and quantitative proteomic analyses. Proteomics. 11(8), 1371–1381 (2011)CrossRefGoogle Scholar
- 14.Bekker-Jensen, D.B., Kelstrup, C.D., Batth, T.S., Larsen, S.C., Haldrup, C., Bramsen, J.B., Sorensen, K.D., Hoyer, S., Orntoft, T.F., Andersen, C.L., et al.: An optimized shotgun strategy for the rapid generation of comprehensive human proteomes. Cell Syst. 4(6), 587–599 (2017)CrossRefGoogle Scholar
- 23.Bern, M.; Kil, Y. J.; Becker, C. Byonic: Advanced peptide and protein identification software. Curr. Protoc. Bioinforma. 2012;13;Unit13.20Google Scholar