Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Valvano, Gabriele; Chartsias, Agisilaos; Leo, Andrea; Tsaftaris, Sotirios A.

doi:10.1007/978-3-030-33391-1_2

Gabriele Valvano^22,23,
Agisilaos Chartsias²³,
Andrea Leo²² &
…
Sotirios A. Tsaftaris²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11795))

Included in the following conference series:

2821 Accesses
6 Citations
2 Altmetric

Abstract

There has been an increasing focus in learning interpretable feature representations, particularly in applications such as medical image analysis that require explainability, whilst relying less on annotated data (since annotations can be tedious and costly). Here we build on recent innovations in style-content representations to learn anatomy, imaging characteristics (appearance) and temporal correlations. By introducing a self-supervised objective of predicting future cardiac phases we improve disentanglement. We propose a temporal transformer architecture that given an image conditioned on phase difference, it predicts a future frame. This forces the anatomical decomposition to be consistent with the temporal cardiac contraction in cine MRI and to have semantic meaning with less need for annotations. We demonstrate that using this regularization, we achieve competitive results and improve semi-supervised segmentation, especially when very few labelled data are available. Specifically, we show Dice increase of up to 19% and 7% compared to supervised and semi-supervised approaches respectively on the ACDC dataset. Code is available at: https://github.com/gvalvano/sdtnet.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bai, W., et al.: Recurrent neural networks for aortic image sequence segmentation with sparse annotations. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11073, pp. 586–594. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00937-3_67
Chapter Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE PAMI 35(8), 1798–1828 (2013)
Article Google Scholar
Bengio, Y., et al.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Article MathSciNet Google Scholar
Bernard, O., et al.: Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE TMI 37(11), 2514–2525 (2018)
Google Scholar
Chartsias, A., et al.: Disentangled representation learning in cardiac image analysis. Med. Image Anal. 58, 101535 (2019)
Article Google Scholar
Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., Abbeel, P.: InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. In: NeurIPS, pp. 2172–2180 (2016)
Google Scholar
Hsieh, J.T., Liu, B., Huang, D.A., Fei-Fei, L.F., Niebles, J.C.: Learning to decompose and disentangle representations for video prediction. In: NeurIPS, pp. 517–526 (2018)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. In: ICLR (2014)
Google Scholar
Lee, H.Y., Tseng, H.Y., Huang, J.B., Singh, M., Yang, M.H.: Diverse image-to-image translation via disentangled representations. In: ECCV, pp. 35–51 (2018)
Google Scholar
Mao, X., Li, Q., Xie, H., Lau, R.Y.K., Wang, Z., Smolley, S.P.: On the effectiveness of least squares generative adversarial networks. IEEE PAMI PP(99), 1–13 (2018)
Article Google Scholar
Qin, C., et al.: Joint Learning of motion estimation and segmentation for cardiac MR image sequences. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11071, pp. 472–480. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00934-2_53
Chapter Google Scholar
Qin, C., Shi, B., Liao, R., Mansi, T., Rueckert, D., Kamen, A.: Unsupervised deformable registration for multi-modal images via disentangled representations. arXiv preprint arXiv:1903.09331 (2019)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Smith, L.N.: Cyclical learning rates for training neural networks. In: 2017 IEEE WACV, pp. 464–472. IEEE (2017)
Google Scholar
Van Steenkiste, S., Locatello, F., Schmidhuber, J., Bachem, O.: Are disentangled representations helpful for abstract visual reasoning? arXiv preprint arXiv:1905.12506 (2019)
Wood, J.N.: A smoothness constraint on the development of object recognition. Cognition 153, 140–145 (2016)
Article Google Scholar

Download references

Acknowledgements

This work was supported by the Erasmus+ programme of the European Union, during an exchange between IMT School for Advanced Studies Lucca and the School of Engineering, University of Edinburgh. S.A. Tsaftaris acknowledges the support of the Royal Academy of Engineering and the Research Chairs and Senior Research Fellowships scheme. We thank NVIDIA Corporation for donating the Titan Xp GPU used for this research.

Author information

Authors and Affiliations

IMT School for Advanced Studies Lucca, Piazza S. Francesco, 55100, Lucca, LU, Italy
Gabriele Valvano & Andrea Leo
School of Engineering, University of Edinburgh, West Mains Rd, Edinburgh, EH9 3FB, UK
Gabriele Valvano, Agisilaos Chartsias & Sotirios A. Tsaftaris

Authors

Gabriele Valvano
View author publications
You can also search for this author in PubMed Google Scholar
Agisilaos Chartsias
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Leo
View author publications
You can also search for this author in PubMed Google Scholar
Sotirios A. Tsaftaris
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gabriele Valvano .

Editor information

Editors and Affiliations

Shanghai Jiaotong University, Shanghai, China
Qian Wang
NVIDIA GmbH, Munich, Germany
Fausto Milletari
University of Houston, Houston, TX, USA
Hien V. Nguyen
Technical University Munich, Munich, Germany
Shadi Albarqouni
King's College London, London, UK
M. Jorge Cardoso
NVIDIA GmbH, Munich, Germany
Nicola Rieke
NVIDIA, Santa Clara, CA, USA
Ziyue Xu
Imperial College London, London, UK
Konstantinos Kamnitsas
Johns Hopkins University, Baltimore, MD, USA
Vishal Patel
University of Houston, Houston, TX, USA
Badri Roysam
UT Southwestern Medical Center, Dallas, TX, USA
Steve Jiang
Chinese Academy of Sciences, Beijing, China
Kevin Zhou
University of Arkansas, Fayetteville, AR, USA
Khoa Luu
University of Arkansas, Fayetteville, AR, USA
Ngan Le

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 689 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Valvano, G., Chartsias, A., Leo, A., Tsaftaris, S.A. (2019). Temporal Consistency Objectives Regularize the Learning of Disentangled Representations. In: Wang, Q., et al. Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data. DART MIL3ID 2019 2019. Lecture Notes in Computer Science(), vol 11795. Springer, Cham. https://doi.org/10.1007/978-3-030-33391-1_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-33391-1_2
Published: 13 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33390-4
Online ISBN: 978-3-030-33391-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)