Abstract
Feature selection is an important data pre-processing step that comes before applying a machine learning algorithm. It removes irrelevant and redundant attributes from the dataset with an aim of improving the algorithm performance. There exist feature selection methods which focus on discovering features that are most suitable. These methods include wrappers, a subroutine of the learning algorithm itself, and filters, which discover features according to heuristics, based on the data characteristics and not tied to a specific algorithm. This paper improves the filter approach by enabling it to select strongly relevant and weakly relevant features and gives room to the researcher to decide which of the weakly relevant features to include. This new approach brings clarity and understandability to the feature selection preprocessing step.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bratu, C.V., Muresan, T., Potolea, R.: Improving Classification Accuracy through Feature Selection. In: Proceedings of the 4th IEEE International Conference on Intelligent Computer Communication and Processing, pp. 25–32. IEEE Press, New York (2008)
Garner, S.R.: Weka, The Waikato Environment for Knowledge Analysis. In: Proceedings of the New Zealand Computer Science Research Students Conference, New Zealand, pp. 57–64 (1995)
Hall, M.A.: Feature Selection for Discrete and Numeric Class Machine Learning. Technical report, University of Waikato, Department of Computer Science, New Zealand (1999)
Kotsiantis, S., Pierrakeas, C., Pintelas, P.: Efficiency of Machine Learning Techniques in Predicting Students Performance in Distance Learning Systems. Applied Artificial Intelligence 18(5), 411–426 (2004)
Kohavi, R., John, G., Long, R., Manley, D., Pfleger, K.: MLC++: A Machine Learning Library in C++. In: Proceedings of the Sixth International Conference on Tools with Artificial Intelligence, pp. 740–743. IEEE Press, New York (1994)
Mitchell, T.M.: Machine Learning. McGraw-Hill, Inc., New York (1997)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Yu, L., Liu, H.: Efficient Feature Selection via Analysis of Relevance and Redundancy. Journal of Machine Learning Research 5, 1205–1224 (2004)
Hassanien, A.E., Suraj, Z., Slezak, D., Lingras, P.: Rough computing: Theories, technologies and applications. IGI Publishing Hershey, PA (2008)
Kessler, F.C.: Understanding Descriptive Statistics, http://nationalatlas.gov/articles/mapping/a_statistics.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Mgala, M., Mbogho, A. (2014). Selecting Relevant Features for Classifier Optimization. In: Hassanien, A.E., Tolba, M.F., Taher Azar, A. (eds) Advanced Machine Learning Technologies and Applications. AMLTA 2014. Communications in Computer and Information Science, vol 488. Springer, Cham. https://doi.org/10.1007/978-3-319-13461-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-13461-1_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13460-4
Online ISBN: 978-3-319-13461-1
eBook Packages: Computer ScienceComputer Science (R0)