Feature Extraction and Classification of Microarray Cancer Data Using Intelligent Techniques

Bai, Anita; Pradhan, Anima

doi:10.1007/978-81-322-1665-0_133

Anita Bai⁴ &
Anima Pradhan⁴

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 243))

1227 Accesses
1 Citations

Abstract

Feature extraction plays an important role to improve the performance of the classifier. Microarray consists of a large amount of features with small number of samples. In this paper, we address the dimension reduction of DNA features in which relevant features are extracted among thousands of irrelevant ones through dimensionality reduction. This enhances the speed and accuracy of the classifiers. Principal component analysis (PCA) is a very powerful statistical technique to represent the d-dimensional data in a lower-dimensional space without any significant loss of information. The aim is to project the original I-dimensional space into an \( I_{0} \)-dimensional linear subspace, where \( I > I_{0} \) such that the variance in the data is maximally explained within the smaller \( I_{0} \)-dimensional space to solve the curse of dimensionality problem (where number of features are large with less samples). Support vector machine (SVM) is implemented, and its performance is measured in terms of predictive accuracy, specificity, and sensitivity. First, we implement PCA for significant feature extraction and then SVM to train the reduced feature set. In the second part, we attempt to validate our results on two public data sets (ovarian and colon).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Heller, M.J.: DNA microarray technology: devices, systems, and applications. Annu. Rev. Biomed. Eng. 4(1), 129–153 (2002)
Article Google Scholar
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gassenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(15), 531–537 (1999)
Article Google Scholar
Shlens, J.: A tutorial on principal component analysis. Systems Neurobiology Laboratory, University of California, San Diego (2005)
Google Scholar
Jollie, I.: Principal Component Analysis. Wiley Online Library (2005)
Google Scholar
Vapnik, V.: Statistical Learning Theory. Wiley, New York, USA (1998)
MATH Google Scholar
Rumelhart, D.E., Hintont, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)
Article Google Scholar
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 27 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, NIT Rourkela, Rourkela, India
Anita Bai & Anima Pradhan

Authors

Anita Bai
View author publications
You can also search for this author in PubMed Google Scholar
Anima Pradhan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anita Bai .

Editor information

Editors and Affiliations

Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Orissa, India
Durga Prasad Mohapatra
Computer Science and Engineering, SOA University, Bhubaneswar, India
Srikanta Patnaik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bai, A., Pradhan, A. (2014). Feature Extraction and Classification of Microarray Cancer Data Using Intelligent Techniques. In: Mohapatra, D.P., Patnaik, S. (eds) Intelligent Computing, Networking, and Informatics. Advances in Intelligent Systems and Computing, vol 243. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1665-0_133

Download citation

DOI: https://doi.org/10.1007/978-81-322-1665-0_133
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1664-3
Online ISBN: 978-81-322-1665-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics