Skip to main content

Bitmap Index-Based Decision Trees

  • Conference paper
Book cover Foundations of Intelligent Systems (ISMIS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3488))

Included in the following conference series:

Abstract

In this paper we propose an original approach to apply data mining algorithms, namely decision tree-based methods, taking into account not only the size of processed databases but also the processing time. The key idea consists in constructing a decision tree, within the DBMS, using bitmap indices. Indeed bitmap indices have many useful properties such as the count and bit-wise operations. We will show that these operations are efficient to build decision trees. In addition, by using bitmap indices, we don’t need to access raw data. This implies clear improvements in terms of processing time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Mannila, H., Srikant, R., et al.: Fast discovery of association rules. Advances in Kowledge Discovery and Data Mining, 307–328 (1996)

    Google Scholar 

  2. Bentayeb, F., Darmont, J.: Decision tree modeling with relational views. In: Hacid, M.-S., Raś, Z.W., Zighed, D.A., Kodratoff, Y. (eds.) ISMIS 2002. LNCS (LNAI), vol. 2366, pp. 423–431. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  3. Bentayeb, F., Darmont, J., Udréa, C.: Efficient integration of data mining techniques in dbmss. In: 8th International Database Engineering and Applications Symposium (IDEAS 2004), Portugal (2004)

    Google Scholar 

  4. Chauchat, J.H., Rakotomalala, R.: A new sampling strategy for building decision trees from large databases. In: 7th Conference of the International Federation of Classification Societies (IFCS 2000), Belgium (2000)

    Google Scholar 

  5. Chaudhuri, S.: Data mining and database systems: Where is the intersection? Data Engineering Bulletin 21(1), 4–8 (1998)

    Google Scholar 

  6. Dunkel, B., Soparkar, N.: Data organization and access for efficient data mining. In: ICDE, pp. 522–529 (1999)

    Google Scholar 

  7. Gehrke, J., Ramakrishnan, R., Ganti, V.: Rainforest - a framework for fast decision tree construction of large datasets. In: 24th International Conference on Very Large Data Bases (VLDB 1998), USA (1998)

    Google Scholar 

  8. Geist, I., Sattler, K.U.: Towards data mining operators in database systems: Algebra and implementation. In: 2nd International Workshop on Databases, Documents, and Information Fusion, DBFusion 2002 (2002)

    Google Scholar 

  9. Han, J., Fu, Y., Wang, W., Koperski, K., Zaiane, O.: DMQL: A data mining query language for relational databases. In: SIGMOD 1996 Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 1996), Canada (1996)

    Google Scholar 

  10. Imielinski, T., Virmani, A.: Msql: A query language for database mining. DataMining and Knowledge Discovery: An International Journal 3, 373–408 (1999)

    Article  Google Scholar 

  11. Liu, H., Motoda, H.: Feature Selection for knowledge discovery and data mining. Kluwer Academic Publishers, Dordrecht (1998)

    MATH  Google Scholar 

  12. Meo, R., Psaila, G., Ceri, S.: A new SQL-like operator for mining association rules. The VLDB Journal, 122–133 (1996)

    Google Scholar 

  13. Oracle. Oracle 9i data mining. White paper (June 2001)

    Google Scholar 

  14. Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)

    Google Scholar 

  15. Ramesh, G., Maniatty, W., Zaki, M.: Indexing and data access methods for database mining (2001)

    Google Scholar 

  16. Sarawagi, S., Thomas, S., Agrawal, R.: Integrating mining with relational database systems: Alternatives and implications. In: ACM SIGMOD International Conference on Management of Data (SIGMOD 1998), USA (1998)

    Google Scholar 

  17. Soni, S., Tang, Z., Yang, J.: Performance study microsoft data mining algorithms. Technical report, Microsoft Corp. (2001)

    Google Scholar 

  18. Toivonen, H.: Sampling large databases for association rules. In: Proc. 1996 Int. Conf. Very Large Data Bases (1996)

    Google Scholar 

  19. Wang, H., Zaniolo, C., Luo, C.R.: Atlas: a small but complete sql extension for data mining and data streams. In: 29th VLDB Conference, Germany (2003)

    Google Scholar 

  20. Zighed, D.A., Rakotomalala, R.: Sipina-w(c) for windows: User’s guide. Technical report, ERIC laboratory, University of Lyon 2, France (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Favre, C., Bentayeb, F. (2005). Bitmap Index-Based Decision Trees. In: Hacid, MS., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds) Foundations of Intelligent Systems. ISMIS 2005. Lecture Notes in Computer Science(), vol 3488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11425274_7

Download citation

  • DOI: https://doi.org/10.1007/11425274_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25878-0

  • Online ISBN: 978-3-540-31949-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics