Validation of Image Segmentation by Estimating Rater Bias and Variance

Warfield, Simon K.; Zou, Kelly H.; Wells, William M.

doi:10.1007/11866763_103

Simon K. Warfield^19,20,
Kelly H. Zou²⁰ &
William M. Wells²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4191))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

2658 Accesses
5 Citations

Abstract

The accuracy and precision of segmentations of medical images has been difficult to quantify in the absence of a “ground truth” or reference standard segmentation for clinical data. Although physical or digital phantoms can help by providing a reference standard, they do not allow the reproduction of the full range of imaging and anatomical characteristics observed in clinical data.

An alternative assessment approach is to compare to segmentations generated by domain experts. Segmentations may be generated by raters who are trained experts or by automated image analysis algorithms. Typically these segmentations differ due to intra-rater and inter-rater variability. The most appropriate way to compare such segmentations has been unclear.

We present here a new algorithm to enable the estimation of performance characteristics, and a true labeling, from observations of segmentations of imaging data where segmentation labels may be ordered or continuous measures. This approach may be used with, amongst others, surface, distance transform or level set representations of segmentations, and can be used to assess whether or not a rater consistently over-estimates or under-estimates the position of a boundary.

Download to read the full chapter text

Chapter PDF

Creating a Large-Scale Silver Corpus from Multiple Algorithmic Segmentations

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

Orientation-Sensitive Overlap Measures for the Validation of Medical Image Segmentations

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Warfield, S.K., Zou, K.H., Wells, W.M.: Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE Trans Med Imag 23, 903–921 (2004)
Article Google Scholar
Rohlfing, T., Russakoff, D.B., Maurer, C.R.: Expectation maximization strategies for multi-atlas multi-label segmentation. In: Proceedings of International Conference of Information Processing in Medical Imaging, pp. 210–221 (2003)
Google Scholar
Rohlfing, T., Russakoff, D.B., Maurer, C.R.: Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation. IEEE Transactions On Medical Imaging 23, 983–994 (2004)
Article Google Scholar
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Article Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum-likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser. B. 39, 34–37 (1977)
MathSciNet Google Scholar
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extensions. Wiley-Interscience, New York (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Radiology Laboratory, Dept. Radiology, Children’s Hospital,
Simon K. Warfield
Dept. Radiology, Brigham and Women’s Hospital, Harvard Medical School, 75 Francis St., Boston, MA, 02115, USA
Simon K. Warfield, Kelly H. Zou & William M. Wells

Authors

Simon K. Warfield
View author publications
You can also search for this author in PubMed Google Scholar
Kelly H. Zou
View author publications
You can also search for this author in PubMed Google Scholar
William M. Wells
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics and Mathematical Modelling, Technical University of Denmark, Denmark
Rasmus Larsen
Nordic Bioscience, Herlev, Denmark
Mads Nielsen
Department of Computer Science, University of Copenhagen, Denmark
Jon Sporring

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Warfield, S.K., Zou, K.H., Wells, W.M. (2006). Validation of Image Segmentation by Estimating Rater Bias and Variance. In: Larsen, R., Nielsen, M., Sporring, J. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2006. MICCAI 2006. Lecture Notes in Computer Science, vol 4191. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11866763_103

Download citation

DOI: https://doi.org/10.1007/11866763_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44727-6
Online ISBN: 978-3-540-44728-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Validation of Image Segmentation by Estimating Rater Bias and Variance

Abstract

Chapter PDF

Similar content being viewed by others

Creating a Large-Scale Silver Corpus from Multiple Algorithmic Segmentations

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

Orientation-Sensitive Overlap Measures for the Validation of Medical Image Segmentations

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Validation of Image Segmentation by Estimating Rater Bias and Variance

Abstract

Chapter PDF

Similar content being viewed by others

Creating a Large-Scale Silver Corpus from Multiple Algorithmic Segmentations

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

Orientation-Sensitive Overlap Measures for the Validation of Medical Image Segmentations

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation