Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation

Li, Zeju; Kamnitsas, Konstantinos; Glocker, Ben

doi:10.1007/978-3-030-32248-9_45

Zeju Li¹⁶,
Konstantinos Kamnitsas¹⁶ &
Ben Glocker¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11766))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

11k Accesses
36 Citations

The original version of this chapter was revised: Table 1 was corrected. The correction to this chapter is available at https://doi.org/10.1007/978-3-030-32248-9_98

Abstract

Overfitting in deep learning has been the focus of a number of recent works, yet its exact impact on the behavior of neural networks is not well understood. This study analyzes overfitting by examining how the distribution of logits alters in relation to how much the model overfits. Specifically, we find that when training with few data samples, the distribution of logit activations when processing unseen test samples of an under-represented class tends to shift towards and even across the decision boundary, while the over-represented class seems unaffected. In image segmentation, foreground samples are often heavily under-represented. We observe that sensitivity of the model drops as a result of overfitting, while precision remains mostly stable. Based on our analysis, we derive asymmetric modifications of existing loss functions and regularizers including a large margin loss, focal loss, adversarial training and mixup, which specifically aim at reducing the shift observed when embedding unseen samples of the under-represented class. We study the case of binary segmentation of brain tumor core and show that our proposed simple modifications lead to significantly improved segmentation performance over the symmetric variants.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Change history

10 October 2019
For chapter 45:
The original version of this chapter was revised. The value of the last column of the fifth row in Table 1 was corrected from “0.93” to “0.83.”
For chapter 51:
The original version of this chapter was revised. The given name and family name of an author were mixed up. The given name is Antonio, and the family name is García-Uceda Juárez.

Notes

1.
The dot product between a filter and a signal is highest when these match perfectly.

References

Bakas, S., et al.: Advancing the cancer genome atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data 4, 170117 (2017)
Article Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. In: International Conference on Learning Representations (ICLR) (2015)
Google Scholar
Kamnitsas, K., et al.: Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal. 36, 61–78 (2017)
Article Google Scholar
Landman, B.A., Xu, Z., Igelsias, J.E., Styner, M., Langerak, T.R., Klein, A.: 2015 MICCAI Multi-atlas Labeling Beyond The Cranial Vault - Workshop and Challenge (2015)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Liu, W., Wen, Y., Yu, Z., Yang, M.: Large-margin softmax loss for convolutional neural networks. In: International Conference on Machine Leanring (ICML), pp. 507–516 (2016)
Google Scholar
Valindria, V.V., et al.: Small organ segmentation in whole-body MRI using a two-stage FCN and weighting schemes. In: Shi, Y., Suk, H.-I., Liu, M. (eds.) MLMI 2018. LNCS, vol. 11046, pp. 346–354. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00919-9_40
Chapter Google Scholar
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. In: International Conference on Learning Representations (ICLR) (2018)
Google Scholar

Download references

Acknowledgements

ZL is grateful for a China Scholarship Council (CSC) Imperial Scholarship. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant No 757173, project MIRA, ERC-2017-STG) and EPSRC (EP/R511547/1).

Author information

Authors and Affiliations

Biomedical Image Analysis Group, Imperial College London, London, UK
Zeju Li, Konstantinos Kamnitsas & Ben Glocker

Authors

Zeju Li
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Kamnitsas
View author publications
You can also search for this author in PubMed Google Scholar
Ben Glocker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zeju Li .

Editor information

Editors and Affiliations

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Dinggang Shen
University of Georgia, Athens, GA, USA
Tianming Liu
Western University, London, ON, Canada
Terry M. Peters
Yale University, New Haven, CT, USA
Lawrence H. Staib
University of Strasbourg, Illkirch, France
Caroline Essert
United Imaging Intelligence, Shanghai, China
Sean Zhou
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Pew-Thian Yap
Western University, London, ON, Canada
Ali Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Z., Kamnitsas, K., Glocker, B. (2019). Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11766. Springer, Cham. https://doi.org/10.1007/978-3-030-32248-9_45

Download citation

DOI: https://doi.org/10.1007/978-3-030-32248-9_45
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32247-2
Online ISBN: 978-3-030-32248-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)