Abstract
Data mining has gained immense popularity in various fields of medical, education and industry as well. Data mining is a process of predicting the result and extraction of useful information from huge dataset. In this paper, we have surveyed various data mining techniques. Further, performance of various data mining techniques, namely decision tree, random forest, naive Bayes, AdaBoost, multilayer perception neural network, radial basis function, sequential minimal optimization and decision stump, have been evaluated using UCI communities and crime dataset for classifying crime in US states. On the basis of results obtained, we found that the decision tree outperforms with 96.4% accuracy and minimal false-positive rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kirkos E, Spathis C (2007) Data mining techniques for the detection of fraudulent financial statements. Expert Syst Appl: Int J 995–1003
Merceron A, Yacef K (2005) Educational data mining: a case study. In: Proceedings of the 2005 conference on artificial intelligence in education: supporting learning through intelligent and socially informed technology. IOS Press, Amsterdam, Netherland, pp 467–474
Bâra A, Lungu I (2012) Improving decision support systems. In: Advances in data mining knowledge discovery and applications, pp 397–417
Lakshmi BN, Raghunandhan G (2011) A conceptual overview of data mining. In: 2011 national conference on innovations in emerging technology (NCOIET). IEEE, Erode, Tamil Nadu, pp 27–32
Purwar A, Singh SK (2014) Issues in data mining: a comprehensive survey. In: 2014 IEEE international conference on computational intelligence and computing research (ICCIC). IEEE, Coimbatore, pp 1–6
Chen L, Li X, Yang Y (2016) Personal health indexing based on medical examinations. Decis Support Syst 54–65
Shouman M, Turner T (2012) Using data mining techniques in heart disease diagnosis and treatment. In: 2012 Japan-Egypt conference electronics, communications and computers (JEC-ECC). IEEE, Alexandria, pp 173–177
Kumar S, Toshniwal D (2015) A data mining framework to analyze road accident data. J Big Data
Bahari TF, Elayidom MS (2015) An efficient CRM-data mining framework for the prediction of customer behaviour. In: Proceedings of the international conference on information and communication technologies, ICICT 2014. Elsevier, Kochi, pp 725–731
Anand SS, Grobelnik M (2007) Knowledge discovery standards. Artif Intell Rev 21–56
Han J, Kamber M (2012) Data mining concept and techniques. Elsevier, USA
Crone SF, Lessmann S (2006) The impact of preprocessing on data mining: an evaluation of classifier sensitivity in direct marketing. Eur J Oper Res 781–800
Ramaswami M, Bhaskaran R (2009) A study on feature selection techniques in educational data mining. J Comput 7–11
Barros RC, Basgalupp MP (2012) A survey of evolutionary algorithms for decision-tree induction. IEEE Trans Syst Man Cybern Part C: Appl Rev
Mantaras RL (1991) A distance-based attribute selection measure. Mach Learn 81–92
Quinlan JR (1986) Induction of decision trees. Mach Learn 81–106
Breiman L (2001) Random forests. Mach Learn 5–32
Kulkarni VY, Sinha PK (2013) Random forest classifiers: a survey and future research direction. Int J Adv Comput
Breiman L (1996) Bagging predictor. Mach Learn 123–140
Schapire RE (2002) The boosting approach to machine learning: an overview. In: Nonlinear estimation and classification, pp 149–171
Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. MIT Press, Cambridge, MA, USA
Verma B (2002) Fast training of multilayer perceptrons. IEEE Trans Neural Netw 1314–1320
Delashmit WH, Manry MT (2005) Recent developments in multilayer perceptron neural networks. In: Proceedings of the 7th annual memphis area engineering and science conference
Orr MJ (1996) Introduction to radial basis function network
Oyang Y-J, Hwang S-C, Ou Y-Y, Chen CY, Chen ZW (2005) Data classification with radial basis function networks based on a novel kernel density estimation algorithm. IEEE Trans Neural Netw 225–236
Schapire RE (2013) Explaining AdaBoost. In: Empirical inference, pp 37–52
Choy M (2010) Building decision trees from decision stumps
Iba W, Langley P (1992) Induction of one-level decision tree. In: Proceedings of the ninth international workshop on machine learning, ML ’92, USA, pp 233–240
Akinola OS, Afolabi AC (2012) Evaluating classification effectiveness on sequential minimal optimization (SMO) algorithm chemical parameterization of granitoids. IJRRAS
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mani, Bharti Suri, Manoj Kumar (2018). Performance Evaluation of Data Mining Techniques. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 9. Springer, Singapore. https://doi.org/10.1007/978-981-10-3932-4_39
Download citation
DOI: https://doi.org/10.1007/978-981-10-3932-4_39
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3931-7
Online ISBN: 978-981-10-3932-4
eBook Packages: EngineeringEngineering (R0)