Abstract
Mass spectrometry (MS) is a technology recently used for high dimensionality detection of proteins in proteomics. However, due to the high resolution and noise of MS data (MALDI-TOF), almost existing MS analysis algorithms are not robust with noise and run slowly. Developing new ones is necessary to analyze such data. In this paper, we propose a novel feature extraction method considering the inherent noise of mass spectra. The proposed method combines stationary wavelet transformation (SWT) and bivariate shrinkage estimator for MS feature extraction and denoising. Then, statistical feature testing is applied to denoised wavelet coefficients to select significant features used for biomarker identification. To evaluate the effectiveness of proposed method, a double cross-validation support vector machine classifier, which has high generalizability, and a fast Modest AdaBoost classifier, which improves significantly experimental runtime, are applied for cancer classification based on selected features by proposed method. Several experiments are carried out to evaluate the performance of our proposed methods. The results show that our proposed method can be an effective tool for analyzing MS data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Morris JS, Coombes KR, Koomen J, Baggerly KA, Kobayashi R (2005) Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum. Bioinformatics 21(9):1764–1775
Petricoin EF 3rd, Ornstein DK, Paweletz CP, Ardekani A, Hackett PS, Hitt BA, Velassco A, Trucco C, Wiegand L, Wood K, Simone CB, Levine PJ, Linehan WM, Emmert-Buck MR, Steinberg SM, Kohn EC, Liotta LA (2002) Serum proteomic patterns for detection of prostate cancer. J Natl Cancer Inst 94(20):1576–1578
Ransohoff DF (2004) Rules of evidence for cancer molecular-marker discovery and validation. Nat Rev Cancer 4(4):309–314
Yu JS, Ongarello S, Fiedler R, Chen XW, Toffolo G, Cobelli C, Trajanoski Z (2005) Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data. Bioinformatics 21(10):2200–2209
de Noo ME, Mertens BJA, Ozalp A, Bladergroen MR, van der Werff MPJ, van de Velde CJH, Deelder AM, Tollenaar RAEM (2006) Detection of colorectal cancer using maldi-tof serum protein profiling. Eur J Cancer 42(8):1068–1076
Alexandrov T, Decker J, Mertens B, Deelder AM, Tollenaar RAEM, Maass P, Thiele H (2009) Biomarker discovery in maldi-tof serum protein profiles using discrete wavelet transformation. Bioinformatics 25(5):643–649
Schleif FM, Lindemann M, Diaz M, Maa P, Decker J, Elssner T, Kuhn M, Thiele H (2009) Support vector classification of proteomic profile spectra based on feature extraction with the bi-orthogonal discrete wavelet transform. Comput Visual Sci 12:189–199
Ressom HW, Varghese RS, Drake SK, Hortin GL, Abdel-Hamid M, Loffredo CA, Goldman R (2007) Peak selection from maldi-tof mass spectra using ant colony optimization. Bioinformatics 23(5):619–626
Coifman RR, Donoho DL (1995) Translation-invariant de-noising. Technical report, Department of Statistics
Donoho DL (1995) De-noising by soft-thresholding. Info Theory IEEE Trans 41(3):613–627
Sendur L, Selesnick IW (2002) Bivariate shrinkage with local variance estimation. IEEE Signal Process Lett 9(12):438–441
Sendur L, Selesnick IW (2002) Bivariate shrinkage functions for wavelet-based denoising exploiting interscale dependency. IEEE Trans Signal Process 50(11):2744–2756
Donoho DL, Johnstone JM (1994) Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3):425–455
Mertens BJA, De Noo ME, Tollenaar RAEM, Deelder AM (2006) Mass spectrometry proteomic diagnosis: enacting the double cross-validatory paradigm. J Comput Biol 13(9):1591–1605
Schapire RE (1999) A brief introduction to boosting. In: Ijcai-99: Proceedings of the sixteenth international joint conference on artificial intelligence, Vols 1 and 2. pp 1401–1406, 1452
Friedman J, Hastie T, Tibshirani R (2000) Additive logistic regression: a statistical view of boosting. Ann Stat 28(2):337–374
Vezhnevets A, Vezhnevets V (2005) Modest adaboost-teaching adaboost to generalize better. Graphicon-2005. Novosibirsk Akademgorodok, Russia
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media B.V.
About this paper
Cite this paper
Pham, P., Yu, L., Nguyen, M., Nguyen, N. (2011). Fast Cancer Classification Based on Mass Spectrometry Analysis in Robust Stationary Wavelet Domain. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_21
Download citation
DOI: https://doi.org/10.1007/978-94-007-2598-0_21
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2597-3
Online ISBN: 978-94-007-2598-0
eBook Packages: EngineeringEngineering (R0)