A Novel Two-Stage Alignment Method for Liquid Chromatography Mass Spectrometry-Based Metabolomics

  • Xiaoli Wei
  • Xue Shi
  • Seongho Kim
  • Craig McClain
  • Xiang Zhang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7389)


We report a novel two-stage alignment algorithm that contains full alignment and partial alignment, for the analysis of LC-MS based metabolomics data. The purpose of full alignment is to detect landmark peaks that present in all peak lists to be aligned. These peaks were first selected based on m/z value and isotopic peak profile matching. After removing peaks with large Euclidian distance of retention time from the potential landmark peaks, a mixture score was calculated to measure the matching quality of each landmark peak pair between reference peak list and a test peak list. After optimizing the weight factor in the mixture score, the value of minimum mixture score of all landmark peaks was used as the threshold for peak matching in the partial alignment. A local optimization based retention time correction method was used to correct the retention time changes between peak lists during partial alignment. The two-stage alignment method was used to analyze a spiked-in experimental data and further compared with literature reported algorithm RANSAC implemented in MZmine.


LC-MS two-stage peak list alignment local optimization 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Zhang, X., et al.: Data pre-processing in Liquid Chromatography-mass Spectrometry-based Proteomics. Bioinformatics 21(21), 4054–4059 (2005)CrossRefGoogle Scholar
  2. 2.
    Wang, B., et al.: DISCO: Distance and Spectrum Correlation Optimization Alignment for two-dimensional Gas Chromatography time-of-flight Mass Spectrometry-based Metabolomics. Anal. Chem. 82(12), 5069–5081 (2010)CrossRefGoogle Scholar
  3. 3.
    Benton, H.P., et al.: XCMS2: Processing Tandem Mass Spectrometry Data for Metabolite Identification and Structural Characterization. Anal. Chem. 80(16), 6382–6389 (2008)CrossRefGoogle Scholar
  4. 4.
    Tautenhahn, R., Bottcher, C., Neumann, S.: Highly Sensitive Feature Detection for High Resolution LC/MS. BMC Bioinformatics 9, 504 (2008)CrossRefGoogle Scholar
  5. 5.
    Pluskal, T., et al.: MZmine 2: Modular Framework for Processing, Visualizing, and Analyzing Mass Spectrometry-based Molecular Profile Data. BMC Bioinformatics 11, 395 (2010)CrossRefGoogle Scholar
  6. 6.
    Draper, J., et al.: Metabolite Signal Identification in Accurate Mass Metabolomics Data with MZedDB, an Interactive m/z Annotation Tool Utilising Predicted Ionisation Behaviour ’rules’. BMC Bioinformatics 10, 227 (2009)CrossRefGoogle Scholar
  7. 7.
    Sturm, M., et al.: OpenMS-An Open-source Software Framework for Mass Spectrometry. BMC Bioinformatics 9 (2008)Google Scholar
  8. 8.
    Wei, X.L., et al.: MetSign: A Computational Platform for High-Resolution Mass Spectrometry-Based Metabolomics. Analytical Chemistry 83(20), 7668–7675 (2011)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Xiaoli Wei
    • 1
  • Xue Shi
    • 1
  • Seongho Kim
    • 2
  • Craig McClain
    • 3
    • 4
    • 5
    • 6
  • Xiang Zhang
    • 1
  1. 1.Departments of ChemistryUniversity of LouisvilleLouisvilleUSA
  2. 2.Bioinformatics and BiostatisticsUniversity of LouisvilleLouisvilleUSA
  3. 3.MedicineUniversity of LouisvilleLouisvilleUSA
  4. 4.Pharmacology & ToxicologyUniversity of LouisvilleLouisvilleUSA
  5. 5.Alcohol Research CenterUniversity of LouisvilleLouisvilleUSA
  6. 6.Robley Rex Louisville VAMCUniversity of LouisvilleLouisvilleUSA

Personalised recommendations