Skip to main content

Performance Evaluation of Evolutionary and Decision Tree Based Classifiers in Diversity of Datasets

  • Conference paper
  • 1719 Accesses

Abstract

The large databases of digital information are ubiquitous. Data from the neighborhood store’s checkout register, your bank’s credit card authorization device, records in your doctor’s office, patterns in your telephone calls and many more applications generate streams of digital records archived in huge databases, sometimes in so-called data warehouses A new generation of computational techniques and tools is required to support the extraction of useful knowledge from the rapidly growing volumes of data. These techniques and tools are the subject of the emerging field of knowledge discovery in databases (KDD) and data mining. Data mining plays an important role to discover important information to help in decision making of a decision support system. It has been the active area of research in the last decade. The classification is one of the important tasks of data mining. Different kind of classifiers have been suggested and tested to predict the future events based on unseen data. This paper compares the performance evaluation of evolutionary based genetic algorithm and decision tree based classifiers in diversity of datasets. The performance evaluation metrics are predictive accuracy, training time and comprehensibility. Evolutionary based classifier shows better comprehensibility over decision tree based classifiers. These classifiers show almost same predictive accuracy. Experimental results demonstrate that evolutionary approach based classifiers are slower than decision tree based classifiers. This research is helpful for organizations to select the classifiers as information generator for their decision support systems to make future policies.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Smith, U.M.F., Shapiro, G.P., Smyth, P.: The KDD process for extracting useful knowledge from volumes from data. Communication of ACM 39(11), 27–34 (1996)

    Article  Google Scholar 

  2. King, R.D., Feng, C., Sutherland, A.: STATLOG. Comparison of classification algorithms on large real-world problems. Applied Artificial Intelligence 9(3), 289–333 (1995)

    Article  Google Scholar 

  3. Breslow, L.A., Aha, D.W.: Simplifying decision trees: A survey. Knowledge Engineering Review 12, 1–40 (1997)

    Article  Google Scholar 

  4. Brodley, C.E., Utgoff, P.E.: Multivariate versus univariate decision trees, Department of Computer Science, University of Massachusetts, Amherst, MA. Technical Report 92-8 (1992)

    Google Scholar 

  5. Brown, D.E., Corruble, V., Pittard, C.L.: A comparison of decision tree classifiers with back propagation neural networks for multimodal classification problems. Pattern Recognition 26, 953–961 (1993)

    Article  Google Scholar 

  6. Curram, S.P., Mingers, J.: Neural networks, decision tree induction and discriminant analysis: An empirical comparison. Journal of the Operational Research Society 45, 440–450 (1994)

    Article  MATH  Google Scholar 

  7. Shavlik, J.W., Mooney, R.J., Towell, G.G.: Symbolic and neural learning algorithms: an empirical comparison. Machine Learning 6, 111–144 (1991)

    Google Scholar 

  8. Saroj, Bhardwaj, K.K.: A parallel genetic algorithm approach for automated discovery of censored production rules. In: AIAP 2007 Proceedings of the 25th Conference on Proceedings of the 25th IASTED International Multi-Conference: Artificial Intelligence and Applications, pp. 435–441 (2007)

    Google Scholar 

  9. UCI Repository of Machine Learning Databases. Department of Information and Computer Science University of California (1994), http://www.ics.uci.edu/~mlearn/MLRepositry.html

  10. Han, J., Kamber, M.: Data mining: concepts and techniques: Book (Illustrated), 550 pages (January 2001) ISBN-10: 1558604898, ISBN-13: 9781558604896

    Google Scholar 

  11. Quinlan, J.R.: Induction in decision trees. Journal of Machine Learning 1(1), 81–106 (2003)

    Google Scholar 

  12. Lim, T.S., Loh, W.Y., Shih, Y.S.: A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms. Journal of Machine Learning 40, 203–228 (2000)

    Article  MATH  Google Scholar 

  13. Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley (1989)

    Google Scholar 

  14. Deb, K.: Genetic Algorithm in search and optimization: The techniques and Applications. In: Proceeding of Advanced Study Institute on Computational Methods for Engineering Analysis and Design, pp. 12.1–12.25 (1993)

    Google Scholar 

  15. Saroj: Genetic Algorithm: A technique to search complex space. In: Proceedings of National Seminar on Emerging Dimension in Information Technology, August 10-11, pp. 100–105 (2002)

    Google Scholar 

  16. Frietas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery, pp. 819–845. Springer, New York (2003)

    Google Scholar 

  17. Frietas, A.A.: Data mining and knowledge discovery with evolutionary algorithms, 265 pages (2002) ISBN: 978-3-540-43331-6

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Kumar, P., Sehgal, V.K., Nitin, Chauhan, D.S. (2012). Performance Evaluation of Evolutionary and Decision Tree Based Classifiers in Diversity of Datasets. In: Das, V.V., Stephen, J. (eds) Advances in Communication, Network, and Computing. CNC 2012. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 108. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35615-5_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35615-5_47

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35614-8

  • Online ISBN: 978-3-642-35615-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics