Abstract
DNA microarrays provide an enormous amount of information about genetically conditioned susceptibility to diseases. However, their analysis is uneasy because the number of genes is extremely large with respect to the number of experiments. The problem is that all genes are not essential in gene expression data. Some of the genes may be redundant, and others may be irrelevant and noisy. This research paper studies the gene expression data using rough set theory; it is an intelligent computing method. In this paper, we studied and implemented following discretization methods such as rough discretization (RD), naïve Bayes, max–min, equal width intervals, K-means-based discretization, and entropy-based discretization (EBD) for gene selection using rough set quick reduct (QR) for breast cancer gene expression data. Further, the performance of the above algorithms has been evaluated using classification tools available in Weka software and BPN classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
D. Slezak and J Wroblewski, “Rough Discretization of Gene Expression Data” International Conference on Hybrid Information Technology, Nov 9, 2006.
E. N. Sathishkumar, K. Thangavel and T. Chandrasekhar,”A Novel Approach for Single Gene Selection Using Clustering and Dimensionality Reduction”, International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013. ISSN 2229–5518.
Hu, K., Lu, Y and Shi C., “Feature ranking in Rough sets”, AI Communications 2003, pp 41–50.
Jensen, R. and Shen, Q., “Finding Rough Set Reducts with ant colony optimization”. Proceeding of the 2003 UK Workshop on Computing Intelligence 2003, pp. 15–22.
T. Chandrasekhar, K. Thangavel and E.N. Sathishkumar, “Verdict Accuracy of Quick Reduct Algorithm using Clustering and Classification Techniques for Gene Expression Data”, International Journal of Computer Science Issues, Vol. 9, Issue 1, No 1, January 2012. ISSN (Online): 1694–0814.
Yong Li et.al., “Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks”, BMC Bioinformatics 2010, 1471–2105/11/520.
Zhi-yong Yan, Cong-fu Xu, Yun-he Pan “Improving naive Bayes classifier by dividing its decision regions”, Journal of Zhejiang University-SCIENCE C (Computers & Electronics), Apr. 8, 2011.
Acknowledgments
The first author gratefully acknowledges the partial financial assistance under University Research Fellowship, Periyar University, Salem-11, Tamil Nadu, India.
The second author gratefully acknowledges the UGC, New Delhi, for partial financial assistance under UGC-SAP (DRS) Grant No. F.3–50/2011.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer India
About this paper
Cite this paper
Sathishkumar, E.N., Thangavel, K., Nishama, A. (2014). Comparative Analysis of Discretization Methods for Gene Selection of Breast Cancer Gene Expression Data. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_40
Download citation
DOI: https://doi.org/10.1007/978-81-322-1680-3_40
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1679-7
Online ISBN: 978-81-322-1680-3
eBook Packages: EngineeringEngineering (R0)