Disentangling the Latent Space of (Variational) Autoencoders for NLP

Brunner, Gino; Wang, Yuyi; Wattenhofer, Roger; Weigelt, Michael

doi:10.1007/978-3-319-97982-3_13

Disentangling the Latent Space of (Variational) Autoencoders for NLP

Gino Brunner¹⁹,
Yuyi Wang¹⁹,
Roger Wattenhofer¹⁹ &
…
Michael Weigelt¹⁹

Conference paper
First Online: 11 August 2018

1377 Accesses
1 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 840))

Abstract

We train multi-task (variational) autoencoders on linguistic tasks and analyze the learned hidden sentence representations. The representations change significantly when translation and part-of-speech decoders are added. The more decoders are attached, the better the models cluster sentences according to their syntactic similarity, as the representation space becomes less entangled. We compare standard unconstrained autoencoders to variational autoencoders and find significant differences. We achieve better disentanglement with the standard autoencoder, which goes against recent work on variational autoencoders in the visual domain.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
E.g., https://github.com/tensorflow/magenta/tree/master/magenta/models/music_vae.

References

Bengio, Y., Courville, A.C., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013). https://doi.org/10.1109/TPAMI.2013.50
Burgess, C.P., Higgins, I., Pal, A., Matthey, L., Watters, N., Desjardins, G., Lerchner, A.: Understanding disentangling in \(\beta \)-VAE. arXiv preprint arXiv:180403599 (2018)
Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., Lerchner, A.: \(\beta \)-VAE: Learning basic visual concepts with a constrained variational framework (2016)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. CoRR abs/1312.6114. http://arxiv.org/abs/1312.6114 (2013)
Koehn, P.: Europarl: A Parallel Corpus for Statistical Machine Translation (2005)
Google Scholar
Liu, X., Gao, J., He, X., Deng, L., Duh, K., Wang, Y.: Representation learning using multi-task deep neural networks for semantic classification and information retrieval. In: NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 912–921 (2015)
Google Scholar
Luong, M., Le, Q.V., Sutskever, I., Vinyals, O., Kaiser, L.: Multi-task sequence to sequence learning. CoRR abs/1511.06114 (2015)
Google Scholar
Niehues, J., Cho, E.: Exploiting linguistic resources for neural machine translation using multi-task learning. In: Proceedings of the Second Conference on Machine Translation, WMT 2017, pp. 80–89 (2017)
Google Scholar
Shen, T., Lei, T., Barzilay, R., Jaakkola, T.: Style transfer from non-parallel text by cross-alignment. In: Advances in Neural Information Processing Systems, pp. 6833–6844 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

ETH Zurich, Zürich, Switzerland
Gino Brunner, Yuyi Wang, Roger Wattenhofer & Michael Weigelt

Authors

Gino Brunner
View author publications
You can also search for this author in PubMed Google Scholar
Yuyi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Roger Wattenhofer
View author publications
You can also search for this author in PubMed Google Scholar
Michael Weigelt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gino Brunner .

Editor information

Editors and Affiliations

School of Science and Technology, Nottingham Trent University, Nottingham, United Kingdom
Ahmad Lotfi
Faculty of Science and Technology, Bournemouth University, Poole, Dorset, United Kingdom
Hamid Bouchachia
School of Computing, University of Portsmouth, Portsmouth, Hampshire, United Kingdom
Alexander Gegov
School of Science and Technology, Nottingham Trent University, Nottingham, United Kingdom
Caroline Langensiepen
College of Science and Technology, Nottingham Trent University, Nottingham, United Kingdom
Martin McGinnity

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brunner, G., Wang, Y., Wattenhofer, R., Weigelt, M. (2019). Disentangling the Latent Space of (Variational) Autoencoders for NLP. In: Lotfi, A., Bouchachia, H., Gegov, A., Langensiepen, C., McGinnity, M. (eds) Advances in Computational Intelligence Systems. UKCI 2018. Advances in Intelligent Systems and Computing, vol 840. Springer, Cham. https://doi.org/10.1007/978-3-319-97982-3_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-97982-3_13
Published: 11 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97981-6
Online ISBN: 978-3-319-97982-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics