Abstract
Correct classification of tumors is an important problem in clinical oncology. Availability of gene expression profiles from DNA microarray experiments has made it possible to use computational techniques to analyze these profiles to identify the molecular biomarkers in samples and hence classify them according to various tumor types. Since gene expression data is very high dimensional and the number of samples is very small compared to the number of dimensions, selection of relevant genes from available gene expression dataset, called as feature selection, is very important to classify samples correctly and efficiently. Various approaches to feature selection have been applied by researchers in literature. In the current work, we have used a method called support vector machine recursive feature elimination for gene ranking with feed-forward neural network as classifier for evaluation of relevance of selected features for the problem of classification of three cancer types using three publicly available gene expression datasets. We compared the classification accuracy and f-score obtained using the proposed method with conventional feature selection-classification pairs—mutual information-random forest classifier, mutual information-support vector machines, particle swarm optimization-support vector machines, principal component analysis-naïve Bayes. The comparison on the basis of confusion matrix parameters—classification accuracy and f-score shows that the proposed method has a better performance as compared to other methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh H, Downi JR (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439):531–538
Ang JC, Mirzal A, Haron H, Hamed H (2016) Supervised, unsupervised, and semi-supervised feature selection: a review on gene selection. IEEE/ACM Trans Comput Biol Bioinf 13(5):971–989
Sharma A, Imoto S, Miyano S (2011) A top-r feature selection algorithm for microarray gene expression data. IEEE/ACM Trans Comput Biol Bioinf 9(3):754–764
Mundra PA, Rajapakse JC (2010) SVM-RFE with MRMR filter for gene selection. IEEE/ACM Trans Nanobiosci 9(1):31–37. https://doi.org/10.1109/TNB.2009.2035284
Wang L, Chu F, Xie W (2007) Accurate cancer classification using expressions of very few genes. IEEE/ACM Trans Comput Biol Bioinf 4(1):40–53
Leung Y, Hung Y (2010) A multiple-filter-multiple-wrapper approach to gene selection and microarray data classification. IEEE/ACM Trans Comput Biol Bioinf 7(1):108–117. https://doi.org/10.1109/TCBB.2008.46
Danaee P, Ghaeini R, Hendrix DA (2017) A deep learning approach for cancer detection and relevant gene identification. In: Proceedings of symposium on biocomputing, pp 219–229
Syafiandini F, Wasito I, Yazid S, Fitriawan A, Amien M (2017) Multimodal deep boltzmann machines for feature selection on gene expression data. Proc Int Conf Adv Comput Sci Inf Syst 37:293–303
Lee Y, Lee CK (2003) Classification of multiple cancer types by multi-category support vector machines using gene expression data. Bioinformatics 19:1132–1139
Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1):389–422.
Zhang F, Kaufman HL, Deng Y, Drabier R (2013) Recursive SVM biomarker selection for early detection of breast cancer in peripheral blood. BMC Med Genomics 6(Suppl 1):S4
Dettling M, Bühlmann P (2002) Supervised clustering of genes, Genome Biol 3(12), Article number: research0069.1–research0069.15
Acknowledgements
The authors thankfully acknowledge the financial grant received from Department of Science & Technology (DST), Government of India, under the ICPS Scheme 2017.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Koul, N., Manvi, S.S. (2020). Feature Selection from Gene Expression Data Using SVMRFE and Feed-Forward Neural Network Classifier. In: Kalya, S., Kulkarni, M., Shivaprakasha, K. (eds) Advances in Communication, Signal Processing, VLSI, and Embedded Systems. Lecture Notes in Electrical Engineering, vol 614. Springer, Singapore. https://doi.org/10.1007/978-981-15-0626-0_12
Download citation
DOI: https://doi.org/10.1007/978-981-15-0626-0_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0625-3
Online ISBN: 978-981-15-0626-0
eBook Packages: EngineeringEngineering (R0)