UV-POSIT: Web-Based Tools for Rapid and Facile Structural Interpretation of Ultraviolet Photodissociation (UVPD) Mass Spectra
- 307 Downloads
UV-POSIT (Ultraviolet Photodissociation Online Structure Interrogation Tools) is a suite of web-based tools designed to facilitate the rapid interpretation of data from native mass spectrometry experiments making use of 193 nm ultraviolet photodissociation (UVPD). The suite includes four separate utilities which assist in the calculation of fragment ion abundances as a function of backbone cleavage sites and sequence position; the localization of charge sites in intact proteins; the calculation of hydrogen elimination propensity for a-type fragment ions; and mass-offset searching of UVPD spectra to identify unknown modifications and assess false positive fragment identifications. UV-POSIT is implemented as a Python/Flask web application hosted at http://uv-posit.cm.utexas.edu. UV-POSIT is available under the MIT license, and the source code is available at https://github.com/jarosenb/UV_POSIT.
KeywordsUltraviolet photodissociation Charge site Protein Native MS
Native mass spectrometry has recently risen to prominence as a method for characterizing the three-dimensional structures of biomolecules. By using low voltages and non-denaturing buffers, electrospray ionization transfers proteins and protein complexes to the gas phase with structures that in many cases preserve features of their solution counterparts [1, 2]. These native-like species can be analyzed by mass spectrometry and MS/MS to obtain information on complex stoichiometry, charge state distribution, and structure .
Ultraviolet photodissociation (UVPD) exhibits distinct advantages for characterizing native proteins and complexes in the gas phase. Energy deposition upon absorption of 193 nm photons may activate ions into excited electronic states, resulting in production of a diverse array of fragment ions that reflect tertiary and quaternary structural features in addition to providing primary sequence information [4, 5, 6, 7, 8, 9, 10, 11, 12, 13]. The abundance of fragment ions in native UVPD mass spectra align with the B-factors derived from crystallography, a measure of backbone flexibility. Regions of proteins which participate in fewer hydrophobic interactions or which do not engage in extensive non-covalent interactions exhibit higher UVPD efficiencies, allowing fragment ion abundances to be used as a probe of protein flexibility . In complexes of proteins with small molecules, comparison of UVPD of the “holo” (ligand-containing) and “apo” (ligand-free) forms of proteins allow the characterization of structural changes at the residue level [4, 6, 9, 10, 11].
In addition to the information obtained from fragment ion abundances, the charge states of fragment ions can be used to probe gas-phase structure. The mechanism of formation of a-type and x-type fragment ions formed by UVPD does not involve extensive mobilization of protons across the protein backbone; therefore, the resulting fragment ions retain charges at the same residues where they were localized in the intact protein. Analysis of fragment ion charge state reveals the location of charges sites in the intact protein, giving deeper insight into the gas-phase structure . As well as charge site information, the a-type fragments give insight into the hydrogen-bonding structure of polypeptides [8, 13]. Residues in which the amide oxygen or amide nitrogen atoms participate in hydrogen-bonding interactions with adjacent residues exhibit minimal hydrogen loss and form a+1-type fragments; cleavage at residues which do not exhibit these interactions leads to more extensive hydrogen loss and the dominant formation of a-type fragment ions (1 Da lower in mass than the corresponding a+1 ions). Deconvolution of the a-ion isotope envelope can be applied to measure the extent of intramolecular hydrogen bonding for each residue.
Implementation and Data Input
The fragment abundance, charge state analysis, and mass-offset search utilities use two separate dialogs for input. The sequence dialog is used to upload an amino acid sequence, and the masses of static modifications can be added parenthetically after the modified residue. The data dialog is then used to input a set of tab-delimited mass and intensity data for each ion peak, as well as a set of options specific to the utility being used. Details of proper data formatting will be discussed in subsequent sections. Clicking the “Validate” button in these dialogs triggers validation of the input; if validation fails, the user is alerted. In the fragment abundance and charge site tools, if both validations pass then the user interface seamlessly updates with the results of the analysis (Figure S1). Within the mass-offset search utility, successful validation enables the user to submit the search parameters to a Celery background worker using a Heroku Redis instance as the broker, after which the user is prompted to download the search results as a .csv file. Data input in the HEM utility is similar but involves a single input dialog in which an mzML-formatted mass spectrum file is uploaded and an amino acid sequence is entered along with a set of configuration options.
Fragment Abundance Utility
The fragment abundance utility allows calculation and visualization of UVPD fragment ion abundances as a function of position along the peptide backbone. For an individual protein, this abundance information can be used as an indicator of backbone flexibility and conformational variations . Practical applications have included the probing of conformational changes in dihydrofolate reductase upon NADPH binding [6, 12] and in K-RAS as a function of point mutations . The utility supports normalization according to total ion chromatogram (TIC) intensity, so that spectra of the same protein under different conditions can be directly compared and sequence regions in which conformational changes occur can be easily identified.
To input spectral data, users paste tab-delimited list of monoisotopic masses and intensities generated from deconvolution of a high-resolution tandem mass spectrum (e.g., using THRASH or Thermo Xtract software). The program builds a library of potential fragment ions based on the input sequence containing R residues (where R is a whole number), then searches the uploaded mass list against this library with a user-specified mass or ppm error tolerance. For each amino acid position n in the sequence, the algorithm sums all an, bn, and cn ions originating from cleavage immediately C-terminal to the given residue along with all C-terminal xR-n + 1, yR-n + 1, and zR-n + 1 ions originating from cleavage N-terminal to the residue. The summed fragment ion abundances for backbone cleavages adjacent to each residue are visualized as a bar plot which can be filtered to represent contributions from all fragment ion series or from N-terminal or C-terminal series alone (Figure S1B). The search results are also displayed in a spreadsheet view showing abundance information for each ion series at each residue position, which can be pasted into a spreadsheet analysis program for further data workup (Figure S1A). The abundances of “apo” and “holo” fragment ions can be displayed individually or their contributions can be summed to show backbone cleavage efficiency at each residue.
Charge State Analysis
The charge state analysis utility allows a user to map and visualize the charge sites in a peptide or protein based on the charge states of a and x fragment ions generated from UVPD. This method has application in the identification of charge site isomers in native mass spectrometry, which can be an indicator of conformational heterogeneity . The utility takes as spectral input a list of tab-delimited values for each experimentally observed fragment ion containing its monoisotopic mass, m/z value, and abundance in the UVPD spectrum (e.g., as output by the implementation of THRASH in ProsightPC 3.0). Using this information, the program assigns a charge to each fragment ion and constructs an array of masses corresponding to each possible charge state of that fragment. Each array is searched against a library of theoretical fragment ions generated from the input sequence. The result of the search is an array with the summed abundance of all experimentally observed fragment ions for each combination of sequence position and charge state (Figure S2A).
When the search is completed, the display interface updates with a constant-height bar plot visualization of the charge states (Figure S2B). The charge state contribution at each residue position is visualized as a set of stacked bars, with the height of the bars corresponding to the abundance of fragment ions found at each charge state.
Hydrogen Elimination Monitoring
Hydrogen loss from a-type fragment ions produced by UVPD is related to the hydrogen-bonding character of the amino acid directly N-terminal to the cleavage site that resulted in production of a-type ions from the selected peptide or protein precursor ion . Quantitation of the extent of this hydrogen loss can therefore be used as a probe of hydrogen bonding interactions in gas-phase protein ions, giving insight into secondary and tertiary structure . The hydrogen-elimination monitoring (HEM) utility searches for a-ion isotope envelopes in raw UVPD mass spectral data and determines the extent of hydrogen loss using a fitting routine. This program takes as the input an amino acid sequence and a profile-mode mzML file corresponding to the UVPD mass spectrum of the peptide or protein of interest. The user specifies the charge of the precursor as well as a noise cutoff and minimum score for fitting. The Python library Pyteomics is used to read the mzML file into memory for processing .
The mass-offset searching utility is used to search a mass spectrum against a range of mass offset values in order to determine the masses of unidentified modifications (e.g., post-translational modifications, bound ligands, and metal adducts) or to gauge the number of false positive fragment ion identifications generated by an absolute mass search under a certain set of tolerances [6, 9, 12]. The large number of fragment ion types generated by UVPD, as compared to ETD or CID, results in an immense search space of possible fragment ions and necessitates special care in choosing search parameters. This utility assists in parameter selection by generating a map of fragment identifications at a variety of offset masses (Figure S4A). The noise level in this map can be evaluated for different search conditions (e.g., PPM error tolerance) in order to gauge the trade-off between the number of fragment identifications and their confidence (Figure S4B).
To perform a mass offset search, the user inputs a sequence and deconvoluted peak list using the same formatting as in the abundance calculator. A search range is then defined by entering a first mass, last mass, and mass increment. When the input is validated, the search job is submitted to the back-end server. The algorithm begins by constructing a library of mass-shifted fragment ions by adding the first mass in the search range to each canonical fragment ion, then searching the library against the inputted spectral data. The fragment ion matches are counted in total and for each individual ion series and recorded. Each mass in the search space is then raised by the user-defined mass increment, and the process is repeated iteratively until the mass increment exceeds the final mass in the range. The result is a tabulation of the number of fragment identifications as a function of the mass shift value. This result can be downloaded by the user as a CSV-formatted text file. Maxima in the plot of fragment identifications vs. mass shift correlate to potential modifications which can be further scrutinized using a fragment-mapping tool such as Prosight Lite . The noise between peaks corresponds to a baseline of false positive fragment matches arising from searching with mass shifts that do not correspond to real chemical modifications.
The suite of tools available in UV-POSIT is available to streamline data analysis for those utilizing ultraviolet photodissociation for native MS applications.
Funding from the NIH (R01GM121714 and 1K12GM102745 (fellowship to MBC)) and the Robert A. Welch Foundation (F-1155) is acknowledged.
- 2.Lorenzen, K., van Duijn, E.: Native mass spectrometry as a tool in structural biology. In: Coligan, J.E., Dunn, B.M., Speicher, D.W., Wingfield, P.T. (eds.) Current Protocols in Protein Science. John Wiley & Sons, Inc., Hoboken (2010)Google Scholar
- 12.Cammarata, M., Thyer, R., Lombardo, M., Anderson, A., Wright, D., Ellington, A., Brodbelt, J.S.: Characterization of trimethoprim resistant E. coli dihydrofolate reductase mutants by mass spectrometry and inhibition by propargyl-linked antifolates. Chem. Sci. 8, 4062–4072 (2017)CrossRefPubMedPubMedCentralGoogle Scholar
- 13.Morrison, L.J., Chai, W., Rosenberg, J.A., Henkelman, G., Brodbelt, J.S.: Characterization of hydrogen bonding motifs in proteins: hydrogen elimination monitoring by ultraviolet photodissociation mass spectrometry. Phys. Chem. Chem. Phys. 19, 20057–20074 (2017)CrossRefPubMedPubMedCentralGoogle Scholar
- 14.Oliphant, T.E.: Python for Scientific Computing. Comput. Sci. Eng. 9, 10–20 (2007)Google Scholar