Skip to main content

Concurrent discretization of multiple attributes

  • Induction (Decision Tree Pruning, Feature Selection, Feature Discretization)
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1531))

Abstract

Better decision trees can be learnt by merging continuous values into intervals. Merging of values, however, could introduce inconsistencies to the data, or information loss. When it is desired to maintain a certain consistency, interval mergings in one attribute could disable those in another attribute. This interaction raises the issue of determining the order of mergings. We consider a globally greedy heuristic that selects the “best” merging from all continuous attributes at each step. We present an implementation of the heuristic in which the best merging is determined in a time independent of the number of possible mergings. Experiments show that intervals produced by the heuristic lead to improved decision trees.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. Dougherty, R. Kohavi, M. Sahami. Supervised and Unsupervised Discretization of Continuous Features. In the 12th International Conference on Machine Learning, 1995.

    Google Scholar 

  2. R. Kerber. ChiMerge: Discretization of Numeric Attributes. In Ninth National Conference on Artificial Intelligence, 1992, 123–128.

    Google Scholar 

  3. C.J. Merz, P.M. Murphy. UCI Repository of machine learnign databases [http://www.ics.uci.edu/ mlearn/MLRepository.html].

    Google Scholar 

  4. J.R. Quinlan, C4.5: Programs for Machine Learning. Los Altos, CA: Morgan Kaufmann, 1993.

    Google Scholar 

  5. J.R. Quinlan. Improved Use of Continuous Attributes in C4.5. In Journal of Artificial Intelligence Research 4, 1996, 77–90

    Google Scholar 

  6. M. Richeldi and M. Rossotto. Class-driven Statistical Discretization of Continuous Attributes. In Proc. of European Conference on Machine Learning 1995, Springer Verlag, 335–338.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hing-Yan Lee Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, K., Liu, B. (1998). Concurrent discretization of multiple attributes. In: Lee, HY., Motoda, H. (eds) PRICAI’98: Topics in Artificial Intelligence. PRICAI 1998. Lecture Notes in Computer Science, vol 1531. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0095274

Download citation

  • DOI: https://doi.org/10.1007/BFb0095274

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65271-7

  • Online ISBN: 978-3-540-49461-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics