Abstract
We introduce a new way of describing the diversity of an ensemble of classifiers, the Percentage Correct Diversity Measure, and compare it against existing methods. We then introduce two new methods for removing classifiers from an ensemble based on diversity calculations. Empirical results for twelve datasets from the UC Irvine repository show that diversity is generally modeled by our measure and ensembles can be made smaller without loss in accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dietterich T., Ensemble methods in machine learning, Proceedings of the First International Workshop on Multiple Classifier Systems, pp. 1–15, 2000.
Breiman L., Bagging predictors, Machine Learning, Volume 24, No. 2, pp. 123–140, 1996.
Breiman L., Pasting small votes for classification in large databases and on-line, Machine Learning, Volume 36, No. 1–2, pp. 85–103, 1999.
Dietterich T., An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization, Machine Learning, Volume 40, No. 2, pp. 139–158, 2000.
Breiman L., Random Forests, Machine Learning, Volume 45, No. 1, pp. 5–32, 2001.
Freund Y. and Schapire R.E., Experiments with a new boosting algorithm, Proc. 13th International Conference on Machine Learning, pp 148–156, 1996.
Kuncheva L.I., Whitaker C.J., Measures of diversity in classifier ensembles, Machine Learning, Volume 51, pp. 181–207, 2003.
Hansen L. and Salamon P., Neural network ensembles, IEEE Transactions on PAMI, 1990, V. 12, No. 10, pp. 993–1001.
Giacinto G. and Roli F., An approach to automatic design of multiple classifier systems, Pattern Recognition Letters, v. 22, pp. 25–33, 2001.
Latinne P., Debeir O., and Decaestecker C., Limiting the number of trees in random forests, Second International Workshop on Multiple Classifier Systems, pp. 178–187, 2001.
Kuncheva L., Whitaker C., Shipp C., and Duin R., Is independence good for combining classifiers?, 15th International Conference on Pattern Recognition, pp. 168–171, 2000.
Merz C.J. and Murphy P.M., UCI Repository of Machine Learning Databases, Univ. of CA., Dept. of CIS, Irvine, CA., http://www.ics.uci.edu/~mlearn/MLRepository.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Banfield, R.E., Hall, L.O., Bowyer, K.W., Kegelmeyer, W.P. (2003). A New Ensemble Diversity Measure Applied to Thinning Ensembles. In: Windeatt, T., Roli, F. (eds) Multiple Classifier Systems. MCS 2003. Lecture Notes in Computer Science, vol 2709. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44938-8_31
Download citation
DOI: https://doi.org/10.1007/3-540-44938-8_31
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40369-2
Online ISBN: 978-3-540-44938-6
eBook Packages: Springer Book Archive