Feature Selection in DNA Microarray Classification
- 2.4k Downloads
Microarray data classification is a difficult challenge for machine learning researchers due to its large number of features and small sample sizes. Since its introduction, feature selection has been considered a de facto standard in the field, and a huge number of feature selection methods were utilized trying to reduce the input dimensionality while improving the classification performance. This chapter is devoted to reviewing the most up-to-date feature selection methods developed in this field. Section 4.1 introduces the background and first attempts to deal with this type of data. Next, Section 4.2 provides a description of the inherent problematics of microarray data, such as the small sample size, the imbalance of the data, the dataset shift or the presence of outliers. In Section 4.3 we review the state of the art on feature selection methods applied to this type of data. In Section 4.4 we present an experimental study of the most significant algorithms and evaluation techniques. A deep analysis of the findings of this study is also provided (Section 4.5). Finally, in Section 4.6, we summarize the contents of this chapter.
KeywordsFeature Selection Microarray Data Feature Subset Feature Selection Method Microarray Dataset
Unable to display preview. Download preview PDF.