Learning Interpretable Disentangled Representations Using Adversarial VAEs

Sarhan, Mhd Hasan; Eslami, Abouzar; Navab, Nassir; Albarqouni, Shadi

doi:10.1007/978-3-030-33391-1_5

Learning Interpretable Disentangled Representations Using Adversarial VAEs

Conference paper
First Online: 13 October 2019

3049 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11795))

Abstract

Learning Interpretable representation in medical applications is becoming essential for adopting data-driven models into clinical practice. It has been recently shown that learning a disentangled feature representation is important for a more compact and explainable representation of the data. In this paper, we introduce a novel adversarial variational autoencoder with a total correlation constraint to enforce independence on the latent representation while preserving the reconstruction fidelity. Our proposed method is validated on a publicly available dataset showing that the learned disentangled representation is not only interpretable, but also superior to the state-of-the-art methods. We report a relative improvement of \(81.50\%\) in terms of disentanglement, \(11.60\%\) in clustering, and \(2\%\) in supervised classification with a few amount of labeled data.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Chen, T.Q., Li, X., Grosse, R.B., Duvenaud, D.K.: Isolating sources of disentanglement in variational autoencoders. In: Advances in Neural Information Processing Systems, pp. 2615–2625 (2018)
Google Scholar
Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., Abbeel, P.: InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2172–2180 (2016)
Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38
Chapter Google Scholar
Higgins, I., et al.: beta-VAE: Learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations (2017)
Google Scholar
Hinton, G.E., Krizhevsky, A., Wang, S.D.: Transforming auto-encoders. In: Honkela, T., Duch, W., Girolami, M., Kaski, S. (eds.) ICANN 2011. LNCS, vol. 6791, pp. 44–51. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21735-7_6
Chapter Google Scholar
Kim, H., Mnih, A.: Disentangling by factorising. arXiv preprint arXiv:1802.05983 (2018)
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. arXiv preprint arXiv:1312.6114 (2013)
Kulkarni, T.D., Whitney, W.F., Kohli, P., Tenenbaum, J.: Deep convolutional inverse graphics network. In: Advances in Neural Information Processing Systems, pp. 2539–2547 (2015)
Google Scholar
Larsen, A.B.L., Sønderby, S.K., Larochelle, H., Winther, O.: Autoencoding beyond pixels using a learned similarity metric. arXiv preprint arXiv:1512.09300 (2015)
Locatello, F., Bauer, S., Lucic, M., Gelly, S., Schölkopf, B., Bachem, O.: Challenging common assumptions in the unsupervised learning of disentangled representations. arXiv preprint arXiv:1811.12359 (2018)
Locatello, F., Tschannen, M., Bauer, S., Rätsch, G., Schölkopf, B., Bachem, O.: Disentangling factors of variation using few labels. arXiv preprint arXiv:1905.01258 (2019)
Miotto, R., Wang, F., Wang, S., Jiang, X., Dudley, J.T.: Deep learning for healthcare: review, opportunities and challenges. Briefings Bioinform. 19(6), 1236–1246 (2017)
Article Google Scholar
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 618–626 (2017)
Google Scholar
Tschandl, P., Rosendahl, C., Kittler, H.: The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 (2018)
Article Google Scholar
Zhang, H., Goodfellow, I., Metaxas, D., Odena, A.: Self-attention generative adversarial networks. arXiv preprint arXiv:1805.08318 (2018)

Download references

Author information

Authors and Affiliations

Computer Aided Medical Procedures, Technical University of Munich, Munich, Germany
Mhd Hasan Sarhan, Nassir Navab & Shadi Albarqouni
Carl Zeiss Meditec AG, Munich, Germany
Mhd Hasan Sarhan & Abouzar Eslami
Computer Aided Medical Procedures, Johns Hopkins University, Baltimore, USA
Nassir Navab

Authors

Mhd Hasan Sarhan
View author publications
You can also search for this author in PubMed Google Scholar
Abouzar Eslami
View author publications
You can also search for this author in PubMed Google Scholar
Nassir Navab
View author publications
You can also search for this author in PubMed Google Scholar
Shadi Albarqouni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mhd Hasan Sarhan .

Editor information

Editors and Affiliations

Shanghai Jiaotong University, Shanghai, China
Qian Wang
NVIDIA GmbH, Munich, Germany
Fausto Milletari
University of Houston, Houston, TX, USA
Hien V. Nguyen
Technical University Munich, Munich, Germany
Shadi Albarqouni
King's College London, London, UK
M. Jorge Cardoso
NVIDIA GmbH, Munich, Germany
Nicola Rieke
NVIDIA, Santa Clara, CA, USA
Ziyue Xu
Imperial College London, London, UK
Konstantinos Kamnitsas
Johns Hopkins University, Baltimore, MD, USA
Vishal Patel
University of Houston, Houston, TX, USA
Badri Roysam
UT Southwestern Medical Center, Dallas, TX, USA
Steve Jiang
Chinese Academy of Sciences, Beijing, China
Kevin Zhou
University of Arkansas, Fayetteville, AR, USA
Khoa Luu
University of Arkansas, Fayetteville, AR, USA
Ngan Le

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sarhan, M.H., Eslami, A., Navab, N., Albarqouni, S. (2019). Learning Interpretable Disentangled Representations Using Adversarial VAEs. In: Wang, Q., et al. Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data. DART MIL3ID 2019 2019. Lecture Notes in Computer Science(), vol 11795. Springer, Cham. https://doi.org/10.1007/978-3-030-33391-1_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-33391-1_5
Published: 13 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33390-4
Online ISBN: 978-3-030-33391-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)