Abstract
Decision Tree is the most influential machine learning technique for the classification of real world datasets. Entropy based traditional Decision Tree classification methods often produce poor results due to the vagueness, uncertainty and noise in the data. In the past few decades, Rough Set Theory (RST) has been succeeded in dealing with uncertainty and vagueness. Thus, incorporating Rough Set concepts in the construction of Decision Tree for the classification of vague and uncertain data, yields better results. But RST based Decision trees sometimes classifies data too excessively and leads to a major problem called model Overfitting. This problem can be rectified by the Variable Precision Rough Set Theory (VPRST); which allows some level of uncertainty and misclassification in the construction of decision tree and classifies data more accurately. Even though the classification technique is very efficient, due to the increasing dimensionalities of the data sets the accuracy of the classification model is affected. So, from this high dimensional datasets with huge set of features, there is a need to select the most promising ones that contribute maximum for classification. In this paper, the most popular swarm intelligence technique namely Artificial Bee Colony algorithm is used for selecting the robust features and the reduced data set is submitted to the VPRST based Decision Tree for classification. In turn, a Reduced VPRSDT was obtained with promising results and it outperformed the traditional methods of tree induction, both in terms of reduced tree size and significant increase in accuracy.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Quinlan JR (1995) Introduction of decision trees. Alex Publishing Corporation, USA
Mitchell TM (1997) Machine learning. Mc Graw Hill, New York
Pawlak Z (1982) Rough sets. Int J Comput Inf Sci 11:341–356
Wei J (2003) Rough set based approach to selection of node. Int J comput Cogn 1(2):25–40
Ziarko W (1993) Variable precision roughset model. J Comput Syst Sci 46(1):39–59
Gong Z, Shi Z, Yao H (2012) Variable precision rough set model for incomplete information systems and its Β-reducts. Comput Inform 31:1385–1399
Ladha L, Deepa T (2011) Feature selection methods and algorithms. Int J Comput Sci Eng (IJCSE) 3(5):1787–1797
Wei J-M, Wang M-Y, You J-P (2007) VPRSM based decision tree classifier. Comput Inform 26:663–677
Wei J-M, Wang S-Q, Wang M-Y, You J-P, Liu D-Y (2007) Rough set based approach for inducing decision trees. Knowl-Based Syst 20:695–702
Karaboga D, Akay B (2009) A comparitive study of ABC. Appl Math Comput 214(1):108–132
Jia D, Duan X, Khan MK (2014) Binary Artificial Bee Colony optimization using bitwise operation. Comput Ind Eng 76:360–365
Schiezaro M, Pedrini H (2013) Data feature selection based on Artificial Bee Colony algorithm. EURASIP J Image Video Process 2013:47
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, CA
Liu H, Hussain F, Tan CL, Dash M (2002) Discretization: an enabling technique. Data Min Knowl Disc 6:393–423
Karaboga D, Basturk B (2007) A powerful and efficient algorithm for numerical function optimization: Artificial Bee Colony (ABC) algorithm. J Glob Optim 39:459–471
Bazan JG (2000) Rough set Algorithms in Classification Problem (Chapter 2). In: Rough set Methods and Applications. Physica-Verlag, Heidelberg
Son CS, Kim YN, Kim HS, Park HS, Kim MS (2012) Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches. J Biomed Inform 45:999–1008
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 The Author(s)
About this chapter
Cite this chapter
Surekha, S., Jaya Suma, G. (2016). Swarm Intelligence and Variable Precision Rough Set Model: A Hybrid Approach for Classification. In: Lakshmi, P., Zhou, W., Satheesh, P. (eds) Computational Intelligence Techniques in Health Care. SpringerBriefs in Applied Sciences and Technology(). Springer, Singapore. https://doi.org/10.1007/978-981-10-0308-0_7
Download citation
DOI: https://doi.org/10.1007/978-981-10-0308-0_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0307-3
Online ISBN: 978-981-10-0308-0
eBook Packages: EngineeringEngineering (R0)