Skip to main content

Comparative Analysis of Discretization Methods for Gene Selection of Breast Cancer Gene Expression Data

  • Conference paper
  • First Online:
Computational Intelligence, Cyber Security and Computational Models

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 246))

Abstract

DNA microarrays provide an enormous amount of information about genetically conditioned susceptibility to diseases. However, their analysis is uneasy because the number of genes is extremely large with respect to the number of experiments. The problem is that all genes are not essential in gene expression data. Some of the genes may be redundant, and others may be irrelevant and noisy. This research paper studies the gene expression data using rough set theory; it is an intelligent computing method. In this paper, we studied and implemented following discretization methods such as rough discretization (RD), naïve Bayes, max–min, equal width intervals, K-means-based discretization, and entropy-based discretization (EBD) for gene selection using rough set quick reduct (QR) for breast cancer gene expression data. Further, the performance of the above algorithms has been evaluated using classification tools available in Weka software and BPN classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D. Slezak and J Wroblewski, “Rough Discretization of Gene Expression Data” International Conference on Hybrid Information Technology, Nov 9, 2006.

    Google Scholar 

  2. E. N. Sathishkumar, K. Thangavel and T. Chandrasekhar,”A Novel Approach for Single Gene Selection Using Clustering and Dimensionality Reduction”, International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013. ISSN 2229–5518.

    Google Scholar 

  3. Hu, K., Lu, Y and Shi C., “Feature ranking in Rough sets”, AI Communications 2003, pp 41–50.

    Google Scholar 

  4. Jensen, R. and Shen, Q., “Finding Rough Set Reducts with ant colony optimization”. Proceeding of the 2003 UK Workshop on Computing Intelligence 2003, pp. 15–22.

    Google Scholar 

  5. T. Chandrasekhar, K. Thangavel and E.N. Sathishkumar, “Verdict Accuracy of Quick Reduct Algorithm using Clustering and Classification Techniques for Gene Expression Data”, International Journal of Computer Science Issues, Vol. 9, Issue 1, No 1, January 2012. ISSN (Online): 1694–0814.

    Google Scholar 

  6. Yong Li et.al., “Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks”, BMC Bioinformatics 2010, 1471–2105/11/520.

    Google Scholar 

  7. Zhi-yong Yan, Cong-fu Xu, Yun-he Pan “Improving naive Bayes classifier by dividing its decision regions”, Journal of Zhejiang University-SCIENCE C (Computers & Electronics), Apr. 8, 2011.

    Google Scholar 

Download references

Acknowledgments

The first author gratefully acknowledges the partial financial assistance under University Research Fellowship, Periyar University, Salem-11, Tamil Nadu, India.

The second author gratefully acknowledges the UGC, New Delhi, for partial financial assistance under UGC-SAP (DRS) Grant No. F.3–50/2011.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to E. N. Sathishkumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer India

About this paper

Cite this paper

Sathishkumar, E.N., Thangavel, K., Nishama, A. (2014). Comparative Analysis of Discretization Methods for Gene Selection of Breast Cancer Gene Expression Data. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_40

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-1680-3_40

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-1679-7

  • Online ISBN: 978-81-322-1680-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics