Abstract
Integration of data mining in database systems is an open topic of research. The DBMS’s power of dealing with lots of data and maintaining data integrity adds to the motivation of integrating it with data mining. We propose a method to integrate decision tree classification to do the required pre-computations and store it in database objects for later use. These pre-computed values get updated with the introduction of new data or change in the existing data for classification. Decision tree classification can readily make use of these pre-computed values to build classification models. Our approach is based on the column database to use it effectively for feature oriented calculations. This comparatively improves performance if classification is deemed to be performed on a high dimensional data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Codd, E., Codd, S., Salley, C.: Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate. Technical report, EF Codd and Associates (1993)
IBM. DB2 intelligent miner scoring (2001), http://www4.ibm.com/software/data/iminer/scoring
Oracle, Oracle 9i data mining. White Paper (2001)
Soni, S., Tang, Z., Yang, J.: Performance Study of Microsoft Data-Mining Algorithms. Microsoft White Paper pages (October 2000)
Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining, p. 214 (1998)
Chauchat, J., Rakotomalala, R.: A new sampling strategy for building decision trees from large databases (2000)
Oracle, Oracle 9i Data Warehousing Guide (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rehman, N.U., Scholl, M.H. (2012). Enabling Decision Tree Classification in Database Systems through Pre-computation. In: MacKinnon, L.M. (eds) Data Security and Security Data. BNCOD 2010. Lecture Notes in Computer Science, vol 6121. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25704-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-25704-9_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25703-2
Online ISBN: 978-3-642-25704-9
eBook Packages: Computer ScienceComputer Science (R0)