Comparative Analysis of Discretization Methods for Gene Selection of Breast Cancer Gene Expression Data

Sathishkumar, E. N.; Thangavel, K.; Nishama, A.

doi:10.1007/978-81-322-1680-3_40

E. N. Sathishkumar⁸,
K. Thangavel⁸ &
A. Nishama⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 246))

1534 Accesses
3 Citations

Abstract

DNA microarrays provide an enormous amount of information about genetically conditioned susceptibility to diseases. However, their analysis is uneasy because the number of genes is extremely large with respect to the number of experiments. The problem is that all genes are not essential in gene expression data. Some of the genes may be redundant, and others may be irrelevant and noisy. This research paper studies the gene expression data using rough set theory; it is an intelligent computing method. In this paper, we studied and implemented following discretization methods such as rough discretization (RD), naïve Bayes, max–min, equal width intervals, K-means-based discretization, and entropy-based discretization (EBD) for gene selection using rough set quick reduct (QR) for breast cancer gene expression data. Further, the performance of the above algorithms has been evaluated using classification tools available in Weka software and BPN classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

D. Slezak and J Wroblewski, “Rough Discretization of Gene Expression Data” International Conference on Hybrid Information Technology, Nov 9, 2006.
Google Scholar
E. N. Sathishkumar, K. Thangavel and T. Chandrasekhar,”A Novel Approach for Single Gene Selection Using Clustering and Dimensionality Reduction”, International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013. ISSN 2229–5518.
Google Scholar
Hu, K., Lu, Y and Shi C., “Feature ranking in Rough sets”, AI Communications 2003, pp 41–50.
Google Scholar
Jensen, R. and Shen, Q., “Finding Rough Set Reducts with ant colony optimization”. Proceeding of the 2003 UK Workshop on Computing Intelligence 2003, pp. 15–22.
Google Scholar
T. Chandrasekhar, K. Thangavel and E.N. Sathishkumar, “Verdict Accuracy of Quick Reduct Algorithm using Clustering and Classification Techniques for Gene Expression Data”, International Journal of Computer Science Issues, Vol. 9, Issue 1, No 1, January 2012. ISSN (Online): 1694–0814.
Google Scholar
Yong Li et.al., “Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks”, BMC Bioinformatics 2010, 1471–2105/11/520.
Google Scholar
Zhi-yong Yan, Cong-fu Xu, Yun-he Pan “Improving naive Bayes classifier by dividing its decision regions”, Journal of Zhejiang University-SCIENCE C (Computers & Electronics), Apr. 8, 2011.
Google Scholar

Download references

Acknowledgments

The first author gratefully acknowledges the partial financial assistance under University Research Fellowship, Periyar University, Salem-11, Tamil Nadu, India.

The second author gratefully acknowledges the UGC, New Delhi, for partial financial assistance under UGC-SAP (DRS) Grant No. F.3–50/2011.

Author information

Authors and Affiliations

Department of Computer Science, Periyar University, Salem, 636011, India
E. N. Sathishkumar, K. Thangavel & A. Nishama

Authors

E. N. Sathishkumar
View author publications
You can also search for this author in PubMed Google Scholar
K. Thangavel
View author publications
You can also search for this author in PubMed Google Scholar
A. Nishama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to E. N. Sathishkumar .

Editor information

Editors and Affiliations

Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
G. Sai Sundara Krishnan
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. Anitha
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. S. Lekshmi
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
M. Senthil Kumar
Department of Mathematics, Ryerson University, Toronto, Ontario, Canada
Anthony Bonato
University of Basque Country, Paseo Manuel De Lardizalbal 1, San Sebastian, Spain
Manuel Graña

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sathishkumar, E.N., Thangavel, K., Nishama, A. (2014). Comparative Analysis of Discretization Methods for Gene Selection of Breast Cancer Gene Expression Data. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_40

Download citation

DOI: https://doi.org/10.1007/978-81-322-1680-3_40
Published: 27 November 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1679-7
Online ISBN: 978-81-322-1680-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics