Generating Large Labeled Data Sets for Laparoscopic Image Processing Tasks Using Unpaired Image-to-Image Translation

Pfeiffer, Micha; Funke, Isabel; Robu, Maria R.; Bodenstedt, Sebastian; Strenger, Leon; Engelhardt, Sandy; Roß, Tobias; Clarkson, Matthew J.; Gurusamy, Kurinchi; Davidson, Brian R.; Maier-Hein, Lena; Riediger, Carina; Welsch, Thilo; Weitz, Jürgen; Speidel, Stefanie

doi:10.1007/978-3-030-32254-0_14

Micha Pfeiffer¹⁶,
Isabel Funke¹⁶,
Maria R. Robu^17,18,
Sebastian Bodenstedt¹⁶,
Leon Strenger¹⁶,
Sandy Engelhardt¹⁹,
Tobias Roß²⁰,
Matthew J. Clarkson^17,18,
Kurinchi Gurusamy²¹,
Brian R. Davidson²¹,
Lena Maier-Hein²⁰,
Carina Riediger²²,
Thilo Welsch²²,
Jürgen Weitz^22,23 &
…
Stefanie Speidel^16,23

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11768))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

8655 Accesses
44 Citations
4 Altmetric

Abstract

In the medical domain, the lack of large training data sets and benchmarks is often a limiting factor for training deep neural networks. In contrast to expensive manual labeling, computer simulations can generate large and fully labeled data sets with a minimum of manual effort. However, models that are trained on simulated data usually do not translate well to real scenarios. To bridge the domain gap between simulated and real laparoscopic images, we exploit recent advances in unpaired image-to-image translation. We extend an image-to-image translation method to generate a diverse multitude of realistically looking synthetic images based on images from a simple laparoscopy simulation. By incorporating means to ensure that the image content is preserved during the translation process, we ensure that the labels given for the simulated images remain valid for their realistically looking translations. This lets us generate a large, fully labeled synthetic data set. We show that this data set can be used to train models for the task of liver segmentation in laparoscopic images. We achieve median dice scores of up to 0.89 in some patients without manually labeling a single laparoscopic image and show that using our synthetic data to pre-train models can greatly improve their performance. The synthetic data set is made publicly available, fully labeled with segmentation maps, depth maps, normal maps, and positions of tools and camera (http://opencas.dkfz.de/image2image).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Data set and code available at: http://opencas.dkfz.de/image2image/.

References

Bujwid, S., Martí, M., Azizpour, H., Pieropan, A.: GANtruth - an unpaired image-to-image translation method for driving scenarios (2018)
Google Scholar
Chu, C., Zhmoginov, A., Sandler, M.: CycleGAN, a Master of Steganography. ArXiv abs/1712.02950 (2017)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR 2009 (2009)
Google Scholar
Gibson, E., et al.: Deep residual networks for automatic segmentation of laparoscopic videos of the liver (2017)
Google Scholar
Huang, S.-W., Lin, C.-T., Chen, S.-P., Wu, Y.-Y., Hsu, P.-H., Lai, S.-H.: AugGAN: cross domain adaptation with GAN-based data augmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11213, pp. 731–744. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01240-3_44
Chapter Google Scholar
Huang, X., Liu, M.Y., Belongie, S., Kautz, J.: Multimodal unsupervised image-to-image translation. In: The European Conference on Computer Vision (ECCV) (2018)
Chapter Google Scholar
Iglovikov, V.I., Shvets, A.A.: TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation. CoRR abs/1801.05746 (2018)
Google Scholar
Lee, H.Y., Tseng, H.Y., Huang, J.B., Singh, M., Yang, M.H.: Diverse image-to-image translation via disentangled representations. In: The European Conference on Computer Vision (ECCV) (2018)
Google Scholar
Lee, K.H., Ros, G., Li, J., Gaidon, A.: SPIGAN: privileged adversarial learning from simulation. In: International Conference on Learning Representations (2019)
Google Scholar
Maier-Hein, L., et al.: Surgical data science for next-generation interventions. Nat. Biomed. Eng. 1(9), 691 (2017)
Article Google Scholar
Twinanda, A., Shehata, S., Mutter, D., Marescaux, J., De Mathelin, M., Padoy, N.: EndoNet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans. Med. Imaging 36, 86–97 (2016)
Article Google Scholar
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thrity-Seventh Asilomar Conference on Signals, Systems Computers, vol. 2, pp. 1398–1402 (2003)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Division of Translational Surgical Oncology, National Center for Tumor Diseases, Dresden, Germany
Micha Pfeiffer, Isabel Funke, Sebastian Bodenstedt, Leon Strenger & Stefanie Speidel
Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Maria R. Robu & Matthew J. Clarkson
Centre for Medical Image Computing, University College London, London, UK
Maria R. Robu & Matthew J. Clarkson
Faculty of Computer Science, Mannheim University of Applied Sciences, Mannheim, Germany
Sandy Engelhardt
Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), Heidelberg, Germany
Tobias Roß & Lena Maier-Hein
Division of Surgery and Interventional Science, University College London, London, UK
Kurinchi Gurusamy & Brian R. Davidson
Department for Visceral, Thoracic and Vascular Surgery, University Hospital Dresden, Dresden, Germany
Carina Riediger, Thilo Welsch & Jürgen Weitz
Centre for Tactile Internet with Human-in-the-Loop (CeTI), TU Dresden, Dresden, Germany
Jürgen Weitz & Stefanie Speidel

Authors

Micha Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Funke
View author publications
You can also search for this author in PubMed Google Scholar
Maria R. Robu
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Bodenstedt
View author publications
You can also search for this author in PubMed Google Scholar
Leon Strenger
View author publications
You can also search for this author in PubMed Google Scholar
Sandy Engelhardt
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Roß
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. Clarkson
View author publications
You can also search for this author in PubMed Google Scholar
Kurinchi Gurusamy
View author publications
You can also search for this author in PubMed Google Scholar
Brian R. Davidson
View author publications
You can also search for this author in PubMed Google Scholar
Lena Maier-Hein
View author publications
You can also search for this author in PubMed Google Scholar
Carina Riediger
View author publications
You can also search for this author in PubMed Google Scholar
Thilo Welsch
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Weitz
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Speidel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Micha Pfeiffer .

Editor information

Editors and Affiliations

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Dinggang Shen
University of Georgia, Athens, GA, USA
Tianming Liu
Western University, London, ON, Canada
Terry M. Peters
Yale University, New Haven, CT, USA
Lawrence H. Staib
University of Strasbourg, Illkirch, France
Caroline Essert
United Imaging Intelligence, Shanghai, China
Sean Zhou
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Pew-Thian Yap
Western University, London, ON, Canada
Ali Khan

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 6129 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pfeiffer, M. et al. (2019). Generating Large Labeled Data Sets for Laparoscopic Image Processing Tasks Using Unpaired Image-to-Image Translation. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11768. Springer, Cham. https://doi.org/10.1007/978-3-030-32254-0_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-32254-0_14
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32253-3
Online ISBN: 978-3-030-32254-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)