Properties and Benefits of Calibrated Classifiers

Cohen, Ira; Goldszmidt, Moises

doi:10.1007/978-3-540-30116-5_14

Ira Cohen²² &
Moises Goldszmidt²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3202))

Included in the following conference series:

European Conference on Principles of Data Mining and Knowledge Discovery

2952 Accesses
29 Citations
1 Altmetric

Abstract

A calibrated classifier provides reliable estimates of the true probability that each test sample is a member of the class of interest. This is crucial in decision making tasks. Procedures for calibration have already been studied in weather forecasting, game theory, and more recently in machine learning, with the latter showing empirically that calibration of classifiers helps not only in decision making, but also improves classification accuracy. In this paper we extend the theoretical foundation of these empirical observations. We prove that (1) a well calibrated classifier provides bounds on the Bayes error (2) calibrating a classifier is guaranteed not to decrease classification accuracy, and (3) the procedure of calibration provides the threshold or thresholds on the decision rule that minimize the classification error. We also draw the parallels and differences between methods that use receiver operating characteristic (ROC) curves and calibration based procedures that are aimed at finding a threshold of minimum error. In particular, calibration leads to improved performance when multiple thresholds exist.

Download to read the full chapter text

Chapter PDF

Classifier calibration: a survey on how to assess and improve predicted class probabilities

Article Open access 16 May 2023

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

Article 30 December 2016

Using p-values for the comparison of classifiers: pitfalls and alternatives

Article 11 April 2022

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

DeGroot, M., Fienberg, S.: The comparison and evaluation of forecasters. The statistician 32, 12–22 (1983)
Article Google Scholar
Fundenberg, D., Levine, D.: An easier way to calibrate. Games and economic behavior 29, 131–137 (1999)
Article MathSciNet Google Scholar
Foster, D., Vohra, R.V.: Asymptotic calibration. Biometrika 85, 379–390 (1998)
Article MathSciNet MATH Google Scholar
Zadrozny, B., Elkan, C.: Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers. In: ICML (2001)
Google Scholar
Zadrozny, B., Elkan, C.: Transforming classifier scores into accurate multiclass probability estimates. In: Knowledge Discovery and Data Mining (2002)
Google Scholar
Fawcett, T.: ROC graphs: Notes and practical considerations for data mining representation. Technical Report HPL-2003-4, Hewlett-Packard Labs, Palo Alto, CA (2003)
Google Scholar
Lachiche, N., Flach, P.: Improving accuracy and cost of two-class and multi-class probabilistic classifiers using ROC curves. In: ICML, pp. 416–423 (2003)
Google Scholar
Devroye, L., Gyorfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, New York (1996)
MATH Google Scholar
Brier, G.: Verification of forecasts expressed in terms of probability. Monthly weather review 78, 1–3 (1950)
Article Google Scholar
Duda, R.O., Hart, P.E., Stork, D.: Pattern Classification. John Wiley and Sons, New York (2001)
MATH Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)
Article MATH Google Scholar
Cozman, F.G., Cohen, I., Cirelo, M.: Semi-supervised learning of mixture models. In: ICML, pp. 99–106 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Hewlett-Packard Research Laboratories, 1501 Page Mill Rd., Palo Alto, CA, 94304, USA
Ira Cohen & Moises Goldszmidt

Authors

Ira Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Moises Goldszmidt
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSA-Lyon, LIRIS CNRS UMR5205, F-69621, Villeurbanne, France
Jean-François Boulicaut
Dipartimento di Informatica, Università degli Studi di Bari,
Floriana Esposito
Pisa KDD Laboratory, ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi 1, Pisa, Italy
Fosca Giannotti
Dipartimento di Informatica, Via F. Buonarroti 2, 56127, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cohen, I., Goldszmidt, M. (2004). Properties and Benefits of Calibrated Classifiers. In: Boulicaut, JF., Esposito, F., Giannotti, F., Pedreschi, D. (eds) Knowledge Discovery in Databases: PKDD 2004. PKDD 2004. Lecture Notes in Computer Science(), vol 3202. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30116-5_14

Download citation

DOI: https://doi.org/10.1007/978-3-540-30116-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23108-0
Online ISBN: 978-3-540-30116-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Properties and Benefits of Calibrated Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

Classifier calibration: a survey on how to assess and improve predicted class probabilities

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

Using p-values for the comparison of classifiers: pitfalls and alternatives

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Properties and Benefits of Calibrated Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

Classifier calibration: a survey on how to assess and improve predicted class probabilities

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

Using p-values for the comparison of classifiers: pitfalls and alternatives

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation