An Experimental Study on Decision Tree Classifier Using Discrete and Continuous Data

Jena, Monalisa; Dehuri, Satchidananda

doi:10.1007/978-981-15-1451-7_35

Monalisa Jena¹⁸ &
Satchidananda Dehuri¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1040))

613 Accesses
1 Citations

Abstract

Classification is one of the fundamental tasks of pattern recognition, data mining, and big data analysis. It spans across the domain for classifying novel instances whose class labels are unknown prior to the development of model. Decision trees like ID3, C4.5, and other variants for the task of classification have been widely studied in pattern recognition and data mining. The reason is that decision tree classifier is simple to understand, and its performance has been comparable with many promising classifiers. Therefore, in this work, we have developed a two-phase method of decision tree classifier for classifying continuous and discrete data effectively. In phase one, our method examines the database, whether it is a continuous-valued or discrete-valued database. If it is a continuous-valued database, then the database is discretized in this phase. In the second phase, the classifier is built and then classifies an unknown instance. To measure the performance of these two phases, we have experimented on a few datasets from the University of California, Irvine (UCI) Machine Learning repository and one artificially created dataset. The experimental evidence shows that this two-phase method of constructing a decision tree to classify an unknown instance is effective in both continuous and discrete cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Phyu, TN.: Survey of classification techniques in data mining. In: Proceedings of the International Multi Conference of Engineers and Computer Scientists, vol. 1, pp. 18–20 (2009)
Google Scholar
Wang, R., Kwong, S., Wang, X.Z., Jiang, Q.: Segment based decision tree induction with continuous valued attributes. IEEE Trans. Cybern. 45(7), 1262–1275 (2015)
Article Google Scholar
Loh, W.Y.: Fifty years of classification and regression trees. Int. Stat. Rev. 82(3), 329–348 (2014)
Article MathSciNet Google Scholar
Quinlan, J.R.: Decision trees and decision-making. IEEE Trans. Syst. Man Cybern. 20(2), 339–346 (1990)
Article Google Scholar
Garcia, S., Luengo, J., Saez, J.A., Lopez, V., Herrera, F.: A survey of discretization techniques: taxonomy and empirical analysis in supervised learning. IEEE Trans. Knowl. Data Eng. 25(4), 734–750 (2013)
Article Google Scholar
Quinlan, J.R.: Improved use of continuous attributes in c4.5. J. Artif. Intell. Res. 4, 77–90 (1996)
Article Google Scholar
Breiman, L.: Classification and regression trees. Routledge (2017)
Google Scholar
Han, J., Pei, J., Kamber, M.: Data mining: concepts and techniques. Elsevier (2011)
Google Scholar
Jearanaitanakij, K.: Classifying continuous data set by id3 algorithm. In: Information, Communications and Signal Processing, 2005 Fifth International Conference, pp. 1048–1051. IEEE (2005)
Google Scholar
De Sa, C.R., Soares, C., Knobbe, A.: Entropy-based discretization methods for ranking data. Inf. Sci. 329, 921–936 (2016)
Article Google Scholar
Ching, J.Y., Wong, A.K.C., Chan, K.C.C.: Class-dependent discretization for inductive learning from continuous and mixed-mode data. IEEE Trans. Pattern Anal. Mach. Intell. 17(7), 641–651 (1995)
Article Google Scholar
Liu, L., Wong, A.K.C., Wang, Y.: A global optimal algorithm for class-dependent discretization of continuous data. Intell. Data Anal. 8(2), 151–170 (2004)
Article Google Scholar
Uther, W.T., Veloso, M.M.: Tree based discretization for continuous state space reinforcement learning. In: Aaai/iaai, pp. 769–774 (1998)
Google Scholar
Chen, Y.C., Wheeler, T.A., Kochenderfer, M.J.: Learning discrete bayesian networks from continuous data. J. Artif. Intell. Res. 59, 103–132 (2017)
Article MathSciNet Google Scholar
Dheeru, D., Taniskidou, E.K.: UCI machine learning repository (2017)
Google Scholar

Download references

Acknowledgements

Thanks to Mr. Sagar Muduli, MCA student, Dept. of I & CT, F. M. University, Balasore, Odisha, for his notable contribution in this work.

Author information

Authors and Affiliations

Department of I & CT, Fakir Mohan University, 756019, Balasore, Odisha, India
Monalisa Jena & Satchidananda Dehuri

Authors

Monalisa Jena
View author publications
You can also search for this author in PubMed Google Scholar
Satchidananda Dehuri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Monalisa Jena .

Editor information

Editors and Affiliations

School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
Pradeep Kumar Mallick
Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Department of Electrical and Electronics Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Rangpo, India
Akash Kumar Bhoi
Division of Information and Communication, Baekseok University, Cheonan-si, Ch’ungch’ong-namdo, Korea (Republic of)
Gyoo-Soo Chae

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jena, M., Dehuri, S. (2020). An Experimental Study on Decision Tree Classifier Using Discrete and Continuous Data. In: Mallick, P., Balas, V., Bhoi, A., Chae, GS. (eds) Cognitive Informatics and Soft Computing. Advances in Intelligent Systems and Computing, vol 1040. Springer, Singapore. https://doi.org/10.1007/978-981-15-1451-7_35

Download citation

DOI: https://doi.org/10.1007/978-981-15-1451-7_35
Published: 15 January 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1450-0
Online ISBN: 978-981-15-1451-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics