Abstract
A decision tree is a basic classification and regression method that uses a tree structure or model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. Decision tree is an effective approach for classification. At the same time, it is also a way to display an algorithm. It serving as a classical algorithm of classification has many optimization algorithms. Even though these approaches achieve high performance, the acquirement costs of attributes are usually ignored. In some cases, the acquired costs are very different and important, the acquirement cost of attributes in decision tree could not be ignored. Existing construction approaches of cost-sensitive decision tree fail to generate the decision tree dynamically according to the given data object and cost constraint. In this paper, we attempt to solve this problem. We propose a global decision tree as the model. The proper decision tree is derived from the model dynamically according to the data object and cost constraint. For the generation of dynamic decision trees, we propose the cost-constraint-based pruning algorithm. Experimental results demonstrate that our approach outperforms C4.5 in both accuracy and cost. Even though the attribute acquirement cost in our approach is much smaller, the accuracy gap between our approach and C4.5 is also small. Additionally, for large data set, our approach outperforms C4.5 algorithm in both cost and accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
Basgalupp, M.P., et al.: LEGAL-tree: a lexicographic multi-objective genetic algorithm for decision tree induction. In: ACM Symposium on Applied Computing, pp. 1085–1090 (2009)
Lopes, C.M., Zaverucha, G.: HTILDE: scaling up relational decision trees for very large databases. In: ACM Symposium on Applied Computing, pp. 1475–1479 (2009)
Rodriguez, J.J., Alonso, C.J.: Interval and dynamic time warping-based decision trees. In: ACM Symposium on Applied Computing, pp. 548–552 (2004)
Ren, C., King, B.R.: Predicting protein contact maps by bagging decision trees. In: International Conference on Bioinformatics, pp. 649–650 (2014)
Blockeel, H., Struyf, J.: Efficient algorithms for decision tree cross-validation. J. Mach. Learn. Res. 3(1), 621–650 (2003)
Hong, J.: AE1: an extension matrix approximate method for the general covering problem. Int. J. Parallel Prog. 14(6), 421–437 (1985)
Tu, P., Chung, J.: A new decision-tree classification algorithm for machine learning. In: International Conference on Tools with Artificial Intelligence, pp. 370–377 (1992)
Lomax, S., Vadera, S.: A survey of cost-sensitive decision tree induction algorithms. ACM Comput. Surv. 45(2), 16–25 (2013)
Barros, R.C., et al.: Towards the automatic design of decision tree induction algorithms. In: Proceedings of the 13th Annual Conference Companion on Genetic and Evolutionary Computation, pp. 567–574. ACM (2011)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, Amsterdam (2014)
Acknowledgment
This work was supported in part by the National Natural Science Foundation of China (No. 61370222), the Natural Science Foundation of Heilongjiang Province (No. F201430), the Innovation Talents Project of Science and Technology Bureau of Harbin (No. 2017RAQXJ094), and the fundamental research funds of universities in Heilongjiang Province, special fund of Heilongjiang University (No. HDJCCX-201608).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Wang, N., Li, J., Liu, Y., Zhu, J., Su, J., Peng, C. (2018). Accurate Decision Tree with Cost Constraints. In: Sun, G., Liu, S. (eds) Advanced Hybrid Information Processing. ADHIP 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 219. Springer, Cham. https://doi.org/10.1007/978-3-319-73317-3_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-73317-3_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73316-6
Online ISBN: 978-3-319-73317-3
eBook Packages: Computer ScienceComputer Science (R0)