Skip to main content

Cost-Time Sensitive Decision Tree with Missing Values

  • Conference paper
Knowledge Science, Engineering and Management (KSEM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4798))

Abstract

Cost-sensitive decision tree learning is very important and popular in machine learning and data mining community. There are many literatures focusing on misclassification cost and test cost at present. In real world application, however, the issue of time-sensitive should be considered in cost-sensitive learning. In this paper, we regard the cost of time-sensitive in cost-sensitive learning as waiting cost (referred to WC), a novelty splitting criterion is proposed for constructing cost-time sensitive (denoted as CTS) decision tree for maximal decrease the intangible cost. And then, a hybrid test strategy that combines the sequential test with the batch test strategies is adopted in CTS learning. Finally, extensive experiments show that our algorithm outperforms the other ones with respect to decrease in misclassification cost.

This work is partially supported by Australian large ARC grants (DP0559536 and DP0667060), a China NSF major research Program (60496327), China NSF grant for Distinguished Young Scholars (60625204), China NSF grants (60463003), an Overseas Outstanding Talent Research Program of Chinese Academy of Sciences (06S3011S01), an Overseas-Returning High-level Talent Research Program of China Ministry of Personnel, a Guangxi NSF grant, and an Innovation Project of Guangxi Graduate Education ( 2006106020812M35).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)

    Google Scholar 

  2. Mitchell, T.M.: Machine Learning. McGraw Hill, New York (1997)

    MATH  Google Scholar 

  3. Turney, P.D.: Types of cost in inductive concept learning. In: Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning, Stanford University, California (2000)

    Google Scholar 

  4. Ling, C.X., Sheng, S., Yang, Q.: Test Strategies for Cost-Sensitive Decision Trees. TKDE 18(8), 1055–1067 (2006)

    Google Scholar 

  5. Yang, Q., Ling, C.X., Chai, X., Pan, R.: Test-Cost Sensitive Classification on Data with Missing Values. TKDE 18(5), 626–638 (2006)

    Google Scholar 

  6. Sheng, V.S., Ling, C.X.: Feature Value Acquisition in Testing: A Sequential Batch Test Algorithm. In: ICML’2006, pp. 809–816 (2006)

    Google Scholar 

  7. Sheng, V.S., Ling, C.X., Ni, A., Zhang, S.: Cost-Sensitive Test Strategies. In: AAAI 2006. Proceedings of the Twenty-first National Conference on Artificial Intelligence (2006)

    Google Scholar 

  8. Koopmanschap, M.A., et al.: Influence of waiting time on cost-effectiveness. Social Science & Medicine 60, 2501–2504 (2005)

    Article  Google Scholar 

  9. Cromwell, D.A.: Waiting time information services: An evaluation of how well clearance time statistics can forecast a patient’s wait. Social Science & Medicine 59, 1937–1948 (2004)

    Article  Google Scholar 

  10. Turney, P.D.: Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm. Journal of Artificial Intelligence Research 2, 369–409 (1995)

    Google Scholar 

  11. Qin, Y.S., et al.: Semi-parametric Optimization for Missing Data Imputation. Applied Intelligence  (2007)

    Google Scholar 

  12. Zhang, C.Q., et al.: Efficient Imputation Method for Missing Values. In: PAKDD 2007. LNCS (LNAI), vol. 4426, pp. 1080–1087. Springer, Heidelberg (2007)

    Google Scholar 

  13. Zubek, V.B.: Learning Cost-Sensitive Diagnostic Policies from Data, A Ph.D Dissertation submitted to Oregon State University (2003)

    Google Scholar 

  14. Weiss, S.M., Galen, R.S., Tadepalli, P.V.: Maximizing the predictive value production rules. Artificial Intelligence 45(1-2), 47–71 (1990)

    Article  Google Scholar 

  15. Nunez, M.: The use of background knowledge in decision tree induction. Machine Learning 6, 231–250 (1991)

    Google Scholar 

  16. Tan, M.: Cost-sensitive learning of classification knowledge and its applications in robotics. Machine. Learning Journal 13, 7–33 (1993)

    Google Scholar 

  17. Domingos, P.: MetaCost: A General Method for Making Classifiers Cost-Sensitive. Knowledge Discovery and Data Mining, 155–164 (1999)

    Google Scholar 

  18. Greiner, R., Grove, A., Roth, D.: Learning Cost-Sensitive Active Classifiers. Artificial Intelligence Journal 139(2), 137–174 (2002)

    Article  MathSciNet  Google Scholar 

  19. Ling, C., et al.: Decision Trees with Minimal Costs. In: Proceedings of 21st International Conference on Machine Learning, Banff, Alberta, Canada, July 4-8 (2004)

    Google Scholar 

  20. Qin, Z., Zhang, C., Zhang, S.: Cost-sensitive Decision Trees with Multiple Cost Scales. In: Webb, G.I., Yu, X. (eds.) AI 2004. LNCS (LNAI), vol. 3339, Springer, Heidelberg (2004)

    Google Scholar 

  21. Ni, A.L., Zhu, X.F., Zhang, C.Q.: Any-Cost Discovery: Learning Optimal Classification Rules? In: Zhang, S., Jarvis, R. (eds.) AI 2005. LNCS (LNAI), vol. 3809, pp. 123–132. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  22. Zhang, S., Qin, Z., Ling, C., Sheng, S.: ”Missing is Useful”: Missing Values in Cost-sensitive Decision Trees. IEEE Transactions on Knowledge and Data Engineering 17(12), 1689–1693 (2005)

    Article  Google Scholar 

  23. Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

  24. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Joint Conference on Artificial Intelligence, pp. 1022–1027. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zili Zhang Jörg Siekmann

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, S., Zhu, X., Zhang, J., Zhang, C. (2007). Cost-Time Sensitive Decision Tree with Missing Values. In: Zhang, Z., Siekmann, J. (eds) Knowledge Science, Engineering and Management. KSEM 2007. Lecture Notes in Computer Science(), vol 4798. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76719-0_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76719-0_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76718-3

  • Online ISBN: 978-3-540-76719-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics