Skip to main content

Accurate Decision Tree with Cost Constraints

  • Conference paper
  • First Online:
Book cover Advanced Hybrid Information Processing (ADHIP 2017)

Abstract

A decision tree is a basic classification and regression method that uses a tree structure or model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. Decision tree is an effective approach for classification. At the same time, it is also a way to display an algorithm. It serving as a classical algorithm of classification has many optimization algorithms. Even though these approaches achieve high performance, the acquirement costs of attributes are usually ignored. In some cases, the acquired costs are very different and important, the acquirement cost of attributes in decision tree could not be ignored. Existing construction approaches of cost-sensitive decision tree fail to generate the decision tree dynamically according to the given data object and cost constraint. In this paper, we attempt to solve this problem. We propose a global decision tree as the model. The proper decision tree is derived from the model dynamically according to the data object and cost constraint. For the generation of dynamic decision trees, we propose the cost-constraint-based pruning algorithm. Experimental results demonstrate that our approach outperforms C4.5 in both accuracy and cost. Even though the attribute acquirement cost in our approach is much smaller, the accuracy gap between our approach and C4.5 is also small. Additionally, for large data set, our approach outperforms C4.5 algorithm in both cost and accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)

    Google Scholar 

  2. Basgalupp, M.P., et al.: LEGAL-tree: a lexicographic multi-objective genetic algorithm for decision tree induction. In: ACM Symposium on Applied Computing, pp. 1085–1090 (2009)

    Google Scholar 

  3. Lopes, C.M., Zaverucha, G.: HTILDE: scaling up relational decision trees for very large databases. In: ACM Symposium on Applied Computing, pp. 1475–1479 (2009)

    Google Scholar 

  4. Rodriguez, J.J., Alonso, C.J.: Interval and dynamic time warping-based decision trees. In: ACM Symposium on Applied Computing, pp. 548–552 (2004)

    Google Scholar 

  5. Ren, C., King, B.R.: Predicting protein contact maps by bagging decision trees. In: International Conference on Bioinformatics, pp. 649–650 (2014)

    Google Scholar 

  6. Blockeel, H., Struyf, J.: Efficient algorithms for decision tree cross-validation. J. Mach. Learn. Res. 3(1), 621–650 (2003)

    MATH  Google Scholar 

  7. Hong, J.: AE1: an extension matrix approximate method for the general covering problem. Int. J. Parallel Prog. 14(6), 421–437 (1985)

    MathSciNet  MATH  Google Scholar 

  8. Tu, P., Chung, J.: A new decision-tree classification algorithm for machine learning. In: International Conference on Tools with Artificial Intelligence, pp. 370–377 (1992)

    Google Scholar 

  9. Lomax, S., Vadera, S.: A survey of cost-sensitive decision tree induction algorithms. ACM Comput. Surv. 45(2), 16–25 (2013)

    Article  MATH  Google Scholar 

  10. Barros, R.C., et al.: Towards the automatic design of decision tree induction algorithms. In: Proceedings of the 13th Annual Conference Companion on Genetic and Evolutionary Computation, pp. 567–574. ACM (2011)

    Google Scholar 

  11. Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, Amsterdam (2014)

    Google Scholar 

Download references

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China (No. 61370222), the Natural Science Foundation of Heilongjiang Province (No. F201430), the Innovation Talents Project of Science and Technology Bureau of Harbin (No. 2017RAQXJ094), and the fundamental research funds of universities in Heilongjiang Province, special fund of Heilongjiang University (No. HDJCCX-201608).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jinbao Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, N., Li, J., Liu, Y., Zhu, J., Su, J., Peng, C. (2018). Accurate Decision Tree with Cost Constraints. In: Sun, G., Liu, S. (eds) Advanced Hybrid Information Processing. ADHIP 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 219. Springer, Cham. https://doi.org/10.1007/978-3-319-73317-3_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73317-3_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73316-6

  • Online ISBN: 978-3-319-73317-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics