Classification of DNA Microarray Data with Random Forests

Stokowy, Tomasz

doi:10.1007/978-3-642-13105-9_31

Tomasz Stokowy³

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 69))

1377 Accesses
1 Citations

Abstract

The article includes information about the advantages of Random Forests in DNA microarray data classification. The experiment presented as a background for the publication was performed on the data devoted to Barrett’s Esophagus and two types of Reflux Disease - Erosive and Nonerosive. An original idea of estimation of a quality of the classification evolved during studies on the problem and resulted in many interesting conclusions. There are presented topics such as advantages of Random Forests in supervised DNA microarray analysis, application of bootstrap resampling used for calculation of average quality results and comparison of classification quality for Random Forests, Support Vector Machines and Linear Discriminant Analysis. Proposed solutions are said to be a good measure of quality of classification with Random Forests method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Breiman, L., Cutler, A.: Random Forests. Machine Learning, 5–32, doi:10.1023/A:1010933404324
Google Scholar
Koronacki, J., Cwik, J.: Statystyczne systemy uczące sie. Oficyna wydawnicza EXIT, Warszawa, p. 136, pp. 162–164 (2008) (in Polish)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, July 30. Springer, Heidelberg (2003) (Corrected edition), ISBN-13:978-0387952840
Google Scholar
Jarzab, B., et al.: Gene expression profile of papillary thyroid cancer: Sources of variability and diagnostic implications. Cancer Res. 65, 1587–1597 (2005)
Article Google Scholar
Efron, B., Tibshirani, R.J.: Improvements on cross-validation: the 632+ bootstrap method. J. American Statistical Association 92, 548–560 (1997)
Article MATH MathSciNet Google Scholar
Diaz-Uriarte, R., Alvarez de Andres, S.: Gene selection and classification of microarray data using random forest. BMC Bioinformatics 7, 3 (2006)
Article Google Scholar
Liaw, A., Wiener, M.: Classification and Regression by random Forest. R News 2(3), 18–22 (2002), http://CRAN.R-project.org/doc/Rnews/
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Automatic Control, Silesian University of Technology, ul. Akademicka 16, 44-100, Gliwice
Tomasz Stokowy

Authors

Tomasz Stokowy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, ul. Akademicka 16, 44-100, Gliwice, Poland
Ewa Piȩtka & Jacek Kawa &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stokowy, T. (2010). Classification of DNA Microarray Data with Random Forests. In: Piȩtka, E., Kawa, J. (eds) Information Technologies in Biomedicine. Advances in Intelligent and Soft Computing, vol 69. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13105-9_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-13105-9_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13104-2
Online ISBN: 978-3-642-13105-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics