Abstract
Gene time series microarray experiments have been widely used to unravel the genetic machinery of biological process. However, most temporal gene expression data often contain noise, missing data points, and non-uniformly sampled time points, which will make the traditional analyzing methods to be unapplicable. One main approach to solve this problem is to reconstruct each gene expression profile as a continuous function of time. Then the continuous representation enables us to overcome problems related to sampling rate differences and missing values. In this paper, we introduce a novel reconstruction approach based on the support vector regression method. The proposed approach utilizes a framelet based kernel, which has the ability to approximate functions with multiscale structure and can reduce the influence of noise in data. To compensate the inadequate information from noisy and short gene expression data, we use its correlated genes as the test set to choose the optimal parameters. We show that this treatment can help to avoid over-fitting. Experimental results demonstrate that our method can improve the reconstruction accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bar-Joseph, Z.: Analyzing time series gene expression data. Bioinformatics 20, 2493–2503 (2004)
Wang, X., Wu, M., Li, Z., Chan, C.: Short time-series microarray analysis: methods and challenges. BMC Syst. Biol. 2 (2008)
Bar-Joseph, Z., Gerber, G.K., Jaakkola, T.S., Gifford, D.K., Simon, I.: Continuous representations of time series gene expression data. J. Comput. Biol. 10, 341–356 (2003)
Luan, Y., Li, H.: Clustering of time-course gene expression data using a mixed-effects model with b-splines. Bioinformatics 19, 474–482 (2003)
Song, J.J., Lee, H.J., Morris, J.S., Kang, S.: Clustering of time-course gene expression data using functional data analysis. Comput. Biol. Chem. 31, 265–274 (2007)
Leng, X.Y., Müller, H.G.: Classification using functional data analysis for temporal gene expression data. Bioinformatics 22, 68–76 (2006)
Song, J.J., Deng, W.G., Lee, H.J., Kwon, D.: Optimal classification for time-course gene expression data using functional data analysis. Comput. Biol. Chem. 32, 426–432 (2008)
Bar-Joseph, Z., Gerber, G.K., Simon, I., Gifford, D.K., Jaakkola, T.: Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes. Proc. Nat. Acad. Sci. U.S.A. 100, 10146–10151 (2003)
Liu, X.L., Yang, M.C.K.: Identifying temporally differentially expressed genes through functional principal components analysis. Biostatistics 10, 667–679 (2009)
Zhang, W.F., Dai, D.Q.: Spectral reflectance estimation from camera responses by support vector regression and a composite model. J. Opt. Soc. Am. A. 25, 2286–2296 (2008)
Smola, A., Schölkopf, B.: A tutorial on support vector regression. Stat. Comput. 14, 199–222 (2004)
Zhang, W.F., Dai, D.Q., Yan, H.: On a new class of framelet kernels for support vector regression and regularization networks. In: Zhou, Z.-H., Li, H., Yang, Q. (eds.) PAKDD 2007. LNCS (LNAI), vol. 4426, pp. 355–366. Springer, Heidelberg (2007)
Zhang, W.F., Dai, D.Q., Yan, H.: Framelet kernels with applications to support vector regression and regularization networks. IEEE Trans. Syst. Man Cybern. Part B Cybern. (2009) (in press)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
Daubechies, I., Han, B., Ron, A., Shen, Z.: Framelets: Mra-based constructions of wavelet frames. Appl. Comput. Harmon. Anal. 124, 44–88 (2003)
Spellman, P.T., Sherlock, G., Zhang, M., Iyer, V., Anders, K., Eisen, M., Brown, P., Botstein, D., Fucher, B.: Comprehensive identification of cell cycle-regulated genes of the yeast saccharomyces cerevisiae by microarray hybridization. Mol. Biol. Cell. 9, 3273–3297 (1998)
Aach, J., Church, G.M.: Aligning gene expression time series with time warping algorithms. Bioinformatics 174, 495–508 (2001)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, WF., Liu, CC., Yan, H. (2010). Temporal Gene Expression Profiles Reconstruction by Support Vector Regression and Framelet Kernel. In: Zhang, L., Lu, BL., Kwok, J. (eds) Advances in Neural Networks - ISNN 2010. ISNN 2010. Lecture Notes in Computer Science, vol 6064. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13318-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-13318-3_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13317-6
Online ISBN: 978-3-642-13318-3
eBook Packages: Computer ScienceComputer Science (R0)