Abstract
Cost-sensitive decision tree learning is very important and popular in machine learning and data mining community. There are many literatures focusing on misclassification cost and test cost at present. In real world application, however, the issue of time-sensitive should be considered in cost-sensitive learning. In this paper, we regard the cost of time-sensitive in cost-sensitive learning as waiting cost (referred to WC), a novelty splitting criterion is proposed for constructing cost-time sensitive (denoted as CTS) decision tree for maximal decrease the intangible cost. And then, a hybrid test strategy that combines the sequential test with the batch test strategies is adopted in CTS learning. Finally, extensive experiments show that our algorithm outperforms the other ones with respect to decrease in misclassification cost.
This work is partially supported by Australian large ARC grants (DP0559536 and DP0667060), a China NSF major research Program (60496327), China NSF grant for Distinguished Young Scholars (60625204), China NSF grants (60463003), an Overseas Outstanding Talent Research Program of Chinese Academy of Sciences (06S3011S01), an Overseas-Returning High-level Talent Research Program of China Ministry of Personnel, a Guangxi NSF grant, and an Innovation Project of Guangxi Graduate Education ( 2006106020812M35).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Mitchell, T.M.: Machine Learning. McGraw Hill, New York (1997)
Turney, P.D.: Types of cost in inductive concept learning. In: Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning, Stanford University, California (2000)
Ling, C.X., Sheng, S., Yang, Q.: Test Strategies for Cost-Sensitive Decision Trees. TKDE 18(8), 1055–1067 (2006)
Yang, Q., Ling, C.X., Chai, X., Pan, R.: Test-Cost Sensitive Classification on Data with Missing Values. TKDE 18(5), 626–638 (2006)
Sheng, V.S., Ling, C.X.: Feature Value Acquisition in Testing: A Sequential Batch Test Algorithm. In: ICML’2006, pp. 809–816 (2006)
Sheng, V.S., Ling, C.X., Ni, A., Zhang, S.: Cost-Sensitive Test Strategies. In: AAAI 2006. Proceedings of the Twenty-first National Conference on Artificial Intelligence (2006)
Koopmanschap, M.A., et al.: Influence of waiting time on cost-effectiveness. Social Science & Medicine 60, 2501–2504 (2005)
Cromwell, D.A.: Waiting time information services: An evaluation of how well clearance time statistics can forecast a patient’s wait. Social Science & Medicine 59, 1937–1948 (2004)
Turney, P.D.: Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm. Journal of Artificial Intelligence Research 2, 369–409 (1995)
Qin, Y.S., et al.: Semi-parametric Optimization for Missing Data Imputation. Applied Intelligence (2007)
Zhang, C.Q., et al.: Efficient Imputation Method for Missing Values. In: PAKDD 2007. LNCS (LNAI), vol. 4426, pp. 1080–1087. Springer, Heidelberg (2007)
Zubek, V.B.: Learning Cost-Sensitive Diagnostic Policies from Data, A Ph.D Dissertation submitted to Oregon State University (2003)
Weiss, S.M., Galen, R.S., Tadepalli, P.V.: Maximizing the predictive value production rules. Artificial Intelligence 45(1-2), 47–71 (1990)
Nunez, M.: The use of background knowledge in decision tree induction. Machine Learning 6, 231–250 (1991)
Tan, M.: Cost-sensitive learning of classification knowledge and its applications in robotics. Machine. Learning Journal 13, 7–33 (1993)
Domingos, P.: MetaCost: A General Method for Making Classifiers Cost-Sensitive. Knowledge Discovery and Data Mining, 155–164 (1999)
Greiner, R., Grove, A., Roth, D.: Learning Cost-Sensitive Active Classifiers. Artificial Intelligence Journal 139(2), 137–174 (2002)
Ling, C., et al.: Decision Trees with Minimal Costs. In: Proceedings of 21st International Conference on Machine Learning, Banff, Alberta, Canada, July 4-8 (2004)
Qin, Z., Zhang, C., Zhang, S.: Cost-sensitive Decision Trees with Multiple Cost Scales. In: Webb, G.I., Yu, X. (eds.) AI 2004. LNCS (LNAI), vol. 3339, Springer, Heidelberg (2004)
Ni, A.L., Zhu, X.F., Zhang, C.Q.: Any-Cost Discovery: Learning Optimal Classification Rules? In: Zhang, S., Jarvis, R. (eds.) AI 2005. LNCS (LNAI), vol. 3809, pp. 123–132. Springer, Heidelberg (2005)
Zhang, S., Qin, Z., Ling, C., Sheng, S.: ”Missing is Useful”: Missing Values in Cost-sensitive Decision Trees. IEEE Transactions on Knowledge and Data Engineering 17(12), 1689–1693 (2005)
Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Joint Conference on Artificial Intelligence, pp. 1022–1027. Morgan Kaufmann, San Francisco (1993)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, S., Zhu, X., Zhang, J., Zhang, C. (2007). Cost-Time Sensitive Decision Tree with Missing Values. In: Zhang, Z., Siekmann, J. (eds) Knowledge Science, Engineering and Management. KSEM 2007. Lecture Notes in Computer Science(), vol 4798. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76719-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-76719-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76718-3
Online ISBN: 978-3-540-76719-0
eBook Packages: Computer ScienceComputer Science (R0)