A Novel Hybrid Method of Gene Selection and Its Application on Tumor Classification

You, Zhuhong; Wang, Shulin; Gui, Jie; Zhang, Shanwen

doi:10.1007/978-3-540-85984-0_127

Zhuhong You^1,2,
Shulin Wang¹,
Jie Gui^1,2 &
…
Shanwen Zhang¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5227))

Included in the following conference series:

International Conference on Intelligent Computing

2149 Accesses
4 Citations

Abstract

Microarray gene expression profile data is used to accurately predict different tumor types, which has great value in providing better treatment and toxicity minimization on the patients. However, it is difficult to classify different tumor types using microarray data because the number of samples is much smaller than the number of genes. It has been proved that a small feature gene subset can improve classification accuracy, so feature gene selection and extraction algorithm is very important in tumor classification. In this paper, a novel hybrid gene selection method is proposed to find a feature gene subset so that the feature genes related to certain cancer can be kept and the redundant genes can be leave out. In the proposed method, we combine the advantages of the PCA and the LDA and proposed a novel feature gene extraction scheme. We also compared several kinds of parametric and non-parametric feature gene selection methods. We use the SVM as the classifier in the experiment and compare the performance of three common SVM kernels. Their differences are analyzed. Using the n-fold cross validation, the proposed algorithm is carried out on three published benchmark tumor datasets and experimental results show that this algorithm leads to better classification performance than other methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Harrington, C.A., Rosenow, C., Retief, J.: Monitoring Gene Expression Using DNA Microarrays. Int. J. Current Opinion in Microbiology 3(3), 285–291 (2000)
Article Google Scholar
Patra, J.C., Lim, G.P., Meher, P.K.: DNA Microarray Data Analysis: Effective Feature Selection for Accurate Cancer Classification. In: IJCNN 2007, pp. 260–265 (2007)
Google Scholar
Kohavi, R., John, G.H.: Wrapper for Feature Subset Selection. Artif. Intell. 97(1/2), 273–324 (1997)
Article MATH Google Scholar
Zhang, H.P., Yu, C.Y., Singer, B., Xiong, M.M.: Recursive Partitioning for Tumor Classification with Gene Expression Microarray Data. PNAS 98(12), 6730–6735 (2001)
Article Google Scholar
Chu, W., Ghahramani, Z., Falciani, F., Wild, D.L.: Biomarker Discovery in Microarray Gene Expression Data with Gaussian Processes. Bioinformatics 21(16), 3385–3393 (2005)
Article Google Scholar
Brown, M.P.S., Grundy, W.N., Lin, D., Cristianini, N., Sugnet, C., Agnes, J.M., Haussler, D.: Support Vector Machine Classification of Microarray Gene Expression Data. Technical Report, U. California (Santa Cruz) (1999)
Google Scholar
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286, 531–537 (1999)
Article Google Scholar
Guyon, I., Weston, J., Barnhill, S.: Gene Selection for Cancer Classification Using Support Vector Machines. Mach. Learn. 46, 389–422 (2002)
Article MATH Google Scholar
Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research, 1157–1182 (2003)
Google Scholar
Wang, Y.H., Makedon, F.S., Ford, J.C., Pearlman, J.: HykGene: A Hybrid Approach for Selecting Marker Genes for Phenotype Classification Using Microarray Gene Expression Data. Bioinformatics 21(8), 1530–1537 (2005)
Article Google Scholar
Deng, L., Pei, J., Ma, J., Lee, D.L.: A Rank Sum Test Method for Informative Gene Discovery. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, pp. 22–25 (2004)
Google Scholar
Lehmann, E.L.: Non-parametrics: Statistical Methods Based on Ranks. Holden-Day, San Francisco (1975)
Google Scholar
Liu, Z.Q., Chen, D.C., Bensmail, H.: Gene Expression Data Classification with Kernel Principal Component Analysis. Journal of Biomedicine and Biotechnology, 155–159 (2005)
Google Scholar
Joliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)
Google Scholar
Niijima, S., Okuno, Y.: Laplacian Linear Discriminant Analysis Approach to Unsupervised Feature Selection. IEEE/ACM Transactions on Computational Biology and Bioinformatics (to appear, 2008)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1992)
Google Scholar
Burges, C.: A Tutorial on Support Vector Machines for Pattern Recognition. Kluwer Academic Publishers, Dordrecht (1998)
Google Scholar
Wang, S.L., Wang, J., Chen, H.W., Tang, W.S.: The Classification of Tumor Using Gene Expression Profile Based on Support Vector Machines and Factor Analysis. In: Intelligent Systems Design and Applications, Jinan, China, pp. 471–476. IEEE Computer Society Press, Los Alamitos (2006)
Chapter Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: A Library for Support Vector Machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm

Download references

Author information

Authors and Affiliations

Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China
Zhuhong You, Shulin Wang, Jie Gui & Shanwen Zhang
Department of Automation, University of Science and Technology of China, Hefei, Anhui, 230027, China
Zhuhong You & Jie Gui

Authors

Zhuhong You
View author publications
You can also search for this author in PubMed Google Scholar
Shulin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Gui
View author publications
You can also search for this author in PubMed Google Scholar
Shanwen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

De-Shuang Huang Donald C. Wunsch II Daniel S. Levine Kang-Hyun Jo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

You, Z., Wang, S., Gui, J., Zhang, S. (2008). A Novel Hybrid Method of Gene Selection and Its Application on Tumor Classification. In: Huang, DS., Wunsch, D.C., Levine, D.S., Jo, KH. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2008. Lecture Notes in Computer Science(), vol 5227. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85984-0_127

Download citation

DOI: https://doi.org/10.1007/978-3-540-85984-0_127
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85983-3
Online ISBN: 978-3-540-85984-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics