Linear and Deformable Image Registration with 3D Convolutional Neural Networks

Stergios, Christodoulidis; Mihir, Sahasrabudhe; Maria, Vakalopoulou; Guillaume, Chassagnon; Marie-Pierre, Revel; Stavroula, Mougiakakou; Nikos, Paragios

doi:10.1007/978-3-030-00946-5_2

Christodoulidis Stergios⁴²,
Sahasrabudhe Mihir⁴³,
Vakalopoulou Maria⁴³,
Chassagnon Guillaume^43,44,
Revel Marie-Pierre⁴⁴,
Mougiakakou Stavroula⁴² &
…
Paragios Nikos⁴⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11040))

Included in the following conference series:

2666 Accesses
20 Citations
1 Altmetric

Abstract

Image registration and in particular deformable registration methods are pillars of medical imaging. Inspired by the recent advances in deep learning, we propose in this paper, a novel convolutional neural network architecture that couples linear and deformable registration within a unified architecture endowed with near real-time performance. Our framework is modular with respect to the global transformation component, as well as with respect to the similarity function while it guarantees smooth displacement fields. We evaluate the performance of our network on the challenging problem of MRI lung registration, and demonstrate superior performance with respect to state of the art elastic registration methods. The proposed deformation (between inspiration & expiration) was considered within a clinically relevant task of interstitial lung disease (ILD) classification and showed promising results.

You have full access to this open access chapter, Download conference paper PDF

Conv2Warp: An Unsupervised Deformable Image Registration with Continuous Convolution and Warping

Learning Iterative Optimisation for Deformable Image Registration of Lung CT with Recurrent Convolutional Networks

On the Adaptability of Unsupervised CNN-Based Deformable Image Registration to Unseen Image Domains

Keywords

1 Introduction

Image registration is the process of aligning two or more sources of data to the same coordinate system. Through all the different registration methods used in medical applications, deformable registration is the one most commonly used due to its richness of description [15]. The goal of deformable registration is to calculate the optimal non-linear dense transformation G to align in the best possible way, a source (moving) image S to a reference (target) image R [2, 6]. Existing literature considers the mapping once the local alignment has been performed and therefore is often biased towards the linear component. Furthermore, state of the art methods are sensitive to the application setting, involve multiple hyper-parameters (optimization strategy, smoothness term, deformation model, similarity metric) and are computationally expensive.

Recently, deep learning methods have gained a lot of attention due to their state of the art performance on a variety of problems and applications [4, 12]. In computer vision, optical flow estimation—a problem highly similar to deformable registration—has been successfully addressed with numerous deep neural network architectures [9]. In medical imaging, some methods in literature propose the use of convolutional neural networks (CNNs) as robust methods for image registration [5, 14]. More recently, adversarial losses have been introduced with impressive performance [16]. The majority of these methods share two limitations: (i) dependency on the linear component of the transformation and (ii) dependency on ground truth displacement which is used for supervised training.

In this paper, we address the previous limitations of traditional deformable registration methods and at the same time propose an unsupervised method for efficient and accurate registration of 3D medical volumes that determines the linear and deformable parts in a single forward pass. The proposed solution outperforms conventional multi-metric deformable registration methods and demonstrates evidence of clinical relevance that can be used for the classification of patients with ILD using the transformation between the extreme moments of the respiration circle.

The main contributions of the study are fourfold: (i) coupling linear and deformable registration within a single optimization step/architecture, (ii) creating a modular, parameter-free implementation which is independent of the different similarity metrics, (iii) reducing considerably the computational time needed for registration allowing real-time applications, (iv) associating deformations with clinical information.

2 Methodology

In this study, we propose the use of an unsupervised CNN for the registration of pairs of medical images. A source image S and a reference image R are presented as inputs to the CNN while the output is the deformation G along with the registered source image D. This section presents details of the proposed architecture as well as the dataset that we utilized for our experiments. Please note that henceforth, we will use the terms deformation, grid, and transformation interchangeably.

2.1 Linear and Deformable 3D Transformer

One of the main components of the proposed CNN is the 3D transformer layer. This layer is part of the CNN and is used to warp its input under a deformation G. The forward pass for this layer is given by

$$\begin{aligned} D = \mathcal {W}(S, G), \end{aligned}$$

(1)

where $\mathcal {W}(\cdot , G)$ indicates a sampling operation $\mathcal {W}$ under the deformation G. G is a dense deformation which can be thought of as an image of the same size as D, and which is constructed by assigning for every output voxel in D, a sampling coordinate in the input S.

In order to allow gradients to flow backwards though this warping operation and facilitate back-propagation training, the gradients with respect to the input image as well as the deformation should be defined. Similar to [10], such gradients can be calculated for a backward trilinear interpolation sampling. The deformation is hence fed to the transformer layer as sampling coordinates for backward warping. The sampling process is illustrated by

$$\begin{aligned} D(\mathbf {p}) = \mathcal {W}(S, G)(\mathbf {p}) = \sum _{\mathbf {q}} S(\mathbf {q}) \prod _{d}\max \left( 0, 1 - |{[G(\mathbf {p})]_d - \mathbf {q}_d}|\right) , \end{aligned}$$

(2)

where $\mathbf {p}$ and $\mathbf {q}$ denote pixel locations, $d\in \{x,y,z\}$ denotes an axis, and $[G(\mathbf {p})]_d$ denotes the d-component of $G(\mathbf {p})$.

Our modeling of the deformation G offers a choice of the type of deformation we wish to use—linear, deformable, or both. The linear (or affine) part of the deformation requires the prediction of a $3 \times 4$ affine transformation matrix A according to the relation $[\hat{x}, \hat{y}, \hat{z}]^T = A[x, y, z, 1]^T $, where $[x, y, z, 1]^T$ represents the augmented points to be deformed, whereas $[\hat{x}, \hat{y}, \hat{z}]^T$ represents their locations in the deformed image. The matrix A can then be used to build a grid, $G_A$, which is the affine component of the deformation G.

To model the deformable part $G_N$, a simple and straightforward approach is to generate sampling coordinates for each output voxel ($G_N(\mathbf {p})$). We can let the network calculate these sampling points directly. Such a choice would however require the network to produce feature maps with large value ranges which complicates training. Moreover without appropriate regularization, non-smooth and even unconnected deformations could be produced. In order to circumvent this problem, we adopt the approach proposed by [13] and predict spatial gradients $\varPhi $ of the deformation along each dimension instead of the deformation itself. This quantity measures the displacements of consecutive pixels. By enforcing these displacements to have positive values and subsequently applying an integration operation along each dimension, the spatial sampling coordinates can be retrieved. This integration operation could be approximated by simply applying a cumulative sum along each dimension of the input (i.e. integral image). In such a case, for example, when $\varPhi _{\mathbf {p}_d} = 1$ there is no change in the distance between the pixels $\mathbf {p}$ and $\mathbf {p}+1$ in the deformed image along the axis d. On the other hand, when $\varPhi _{\mathbf {p}_d} < 1$, the distance between these consecutive pixels along d will decrease, while it will increase when $\varPhi _{\mathbf {p}_d} > 1$. Such an approach ensures the generation of smooth deformations that avoid self-crossings, while allows the control of maximum displacements among consecutive pixels.

Finally, to compose the two parts we apply the deformable component to a moving image, followed by the linear component. When operating on a fixed image S, this step can be written as

$$\begin{aligned} \mathcal {W}(S, G) = \mathcal {W}\left( \mathcal {W}(S, G_N), G_A\right) . \end{aligned}$$

(3)

During training, the optimization of the decoders of A and $G_N$ is done jointly, as the network is trained end-to-end. We also impose regularization constraints on both these components. We elaborate on the importance of this regularization for the joint training in Sect. 2.3.

2.2 Architecture

The architecture of the CNN is based on an encoder-decoder framework presented in [1] (Fig. 1). The encoder adopts dilated convolutional kernels along with multi-resolution feature merging, while the decoder employs non-dilated convolutional layers and up-sampling operations. Specifically, a kernel size of $3 \times 3 \times 3$ was set for the convolutional layers while LeakyReLU activation was employed for all convolutional layers except the last two. Instance normalization was included before most of the activation functions. In total five layers are used in the encoder and their outputs are merged along with the input pair of image to form a feature map of 290 features with a total receptive field of $25 \times 25 \times 25$. In the decoder, two branches were implemented—one for the spatial deformation gradients and the other for the affine matrix. As far as the former is concerned, a squeeze-excitation block [8] was added in order to weigh the most important features for the spatial gradients calculation while for the latter a simple global average operation was used to reduce the spatial dimensions to one. For the affine parameters and the spatial deformation gradients, a linear layer and sigmoid activation were respectively used. Finally to retrieve $\varPhi $, the output of the sigmoid function should be scaled by a factor of 2 in order to fall in the range [0, 2] and hence allow for consecutive pixels to have larger distance than the initial.

2.3 Training

The network was trained by minimizing the mean squared error (MSE) between the R and D image intensities as well as the regularization terms of the affine transformation parameters and the spatial deformation gradients using the Adam optimizer [11]. Our loss is defined as

$$\begin{aligned} \mathrm {Loss} = \left\| R - \mathcal {W}(S, G) \right\| ^{2} + \alpha \left\| A - A_I\right\| _{1} + \beta \left\| \varPhi - \varPhi _{I} \right\| _{1}, \end{aligned}$$

(4)

where $A_I$ represents the identity affine transformation matrix, $\varPhi _{I}$ is the spatial gradient of the identity deformation, and $\alpha $ and $\beta $ are regularization weights. As mentioned before, regularization is essential to the joint optimization. To elaborate, without the L1 regularization on A, the network might get stuck in a local minimum where it aligns only high-level features using the affine transformation. This will result in a high reconstruction error. On the other hand, without the smoothness regularizer on $\varPhi $, the spatial gradients decoder network can predict very non-smooth grids which again makes it prone to fall in a local minimum. Having both linear and deformable components is helpful to the network because these two components now share the work. This hypothesis aligns with [13] and is also evaluated in Sect. 3.

The initial learning rate is $10^{-3}$ and subdued by a factor of 10 if the performance on the validation set does not improve for 50 epochs while the training procedure stops when there is no improvement for 100 epochs. The regularization weights $\alpha $ and $\beta $ were set to $10^{-6}$ so that neither of the two components has an unreasonably large contribution to the final loss. As training samples, random pairs among all cases were selected with a batch size of 2 due to the limited memory resources on the GPU. The performance of the network was evaluated every 100 batches, and both proposed models—with and without affine components—converged after nearly 300 epochs. The overall training time was calculated to ${\sim }16$ h.

2.4 Dataset

MRI exams were acquired as a part of a prospective study aiming to evaluate the feasibility of pulmonary fibrosis detection in systemic sclerosis patients by using magnetic resonance imaging (MRI) and an elastic registration-driven biomarker. This study received institutional review board approval and all patients gave their written consent. The study population consisted of 41 patients (29 patients with systemic sclerosis and 12 healthy volunteers). Experienced radiologists annotated the lung field for the total of the 82 images and provided information about the pathology of each patient (healthy or not). Additionally, eleven characteristic landmarks inside the lung area had been provided by two experienced radiologists.

All MRI examinations were acquired on a 3T-MRI unit (SKYRA magneton, Siemens Healthineers) using an 18-phased-array-body coil. All subjects were positioned in the supine position with their arms along the body. Inspiratory and expiratory MRI images were acquired using an ultrashort time of echo (UTE) sequence, the spiral VIBE sequence, with the same acquisition parameters (repetition time 2.73 ms, echo time 0.05 ms, flip angle $5^{\circ }$, field-of-view $620 \times 620$ mm, slice thickness 2.5 mm, matrix $188 \times 188$, with an in-plane resolution of $2.14 \times 2.14$ mm).

As a pre-processing step, the image intensity values were cropped within the window [0, 1300] and mapped to [0, 1]. Moreover, all the images were scaled down along all dimensions by a factor of 2/3 with cubic interpolation resulting to an image size of $64\times 192\times 192$ to compensate GPU memory constraints. A random split was performed and 28 patients (56 pairs of images) were selected for the training set, resulting to 3136 training pairs, while the rest 13 were used for validation.

3 Experimental Setup and Results

3.1 Evaluation

We evaluated the performance of our method against two different state-of-the-art methods, namely, Symmetric Normalization (SyN) [2], using its implementation on the ANTs package [3] and the deformable method presented in [6, 7] for a variety of similarity metrics (normalized cross correlation (NCC), mutual information (MI) and discrete wavelet metric (DWM), and their combination). For the evaluation we calculated the Dice coefficient metric, measured on the lung masks, after we applied the calculated deformation on the lung mask of the moving image. Moreover, we evaluate our method using the provided landmark locations. For comparison reasons we report the approximate computational time each of these methods needed to register a pair of images. For all the implementations we used a GeForce GTX 1080 GPU except for SyN implementation where we used a CPU implementation running on 4 cores of an i7-4700HQ CPU.

3.2 Results and Discussion

Starting with the quantitative evaluation, in Table 1 the mean Dice coefficient values along with their standard deviations are presented for different methods. We performed two different types of tests. In the first set of experiments (Table 1: Inhale-Exhale), we tested the performance of the different methods for the registration of the MRI images, between the inhale and exhale images, for the 13 validation patients. The SyN implementation reports the lowest Dice scores while at the same time, it is computationally quite expensive due to its CPU implementation. Moreover, we tested three different similarity metrics along with their combinations using the method proposed in [6] as described earlier. In this specific setup, the MI metric seam to report the best Dice scores. However, the scores reported by the proposed architecture are superior by at least ${\sim }2.5$% to the ones reported by the other methods. For the proposed method, the addition of a linear component to the transformation layer does not change the performance of the network significantly in this experiment. Finally, we calculated the errors over all axes in predicted locations for eleven different manually annotated landmark points on inhale volumes after they were deformed using the decoded deformation for each patient. We compare the performance of our method against the inter-observer (two different medical experts) distance and the method presented in [6] in Table 2. We observe that both methods perform very well considering the inter-observer variability, with the proposed one reporting slightly better average euclidean distances.

Table 1. Dice coefficient scores (%) calculated over the deformed lung masks and the ground truth.

Full size table

For the second set of experiments (Table 1: All combinations), we report the Dice scores for all combinations of the 13 different patients, resulting on 169 validation pairs. Due to the large number of combinations, this problem is more challenging since the size of the lungs in the extreme moments of the respiratory circles can vary significantly. Again, the performance of the proposed architecture is superior to the tested baselines, highlighting its very promising results. In this experimental setup, the linear component plays a more important part by boosting the performance by ${\sim }0.5$%.

Concerning the computation time, both [6] and the proposed method report very low inference time, due to their GPU implementations, with the proposed method reaching ${\sim }0.5$ s per subject. On the other hand, [2] is computationally quite expensive, making it difficult to test it for all the possible combinations on the validation set.

Table 2. Errors measured as average euclidean distances between estimated landmark locations and ground truth marked by two medical experts. We also report as inter-observer, the average euclidean distance between same landmark locations marked by the two experts. dx, dy, and dz denote distances along x-, y-, and z- axes, respectively, while ds denotes the average error along all axes.

Full size table

Finally, in Fig. 2, we present the deformed image produced by the proposed method on coronal view for a single patient in the two different moments of the respiratory cyrcle. The grids were superimposed on the images, indicating the displacements calculated by the network. The last column shows the difference between the reference and deformed image. One can observe that the majority of the errors occur on the boundaries, as the network fails to capture large local displacements.

3.3 Evaluation of the Clinical Relevance of the Deformation

To asses the relevance of the decoded transformations in a clinical setting, we trained a small classifier on top of the obtained residual deformations to classify patients as healthy or unhealthy. The residual deformation associated with a pair of images indicates voxel displacements, written as $G_\delta = G - G_I$, where G is the deduced deformation between the two images, and $G_I$ is the identity deformation.

We trained a downsampling convolutional kernel followed by a multi-layer perceptron (MLP) to be able to predict whether a case is healthy or not. The network architecture is shown in Fig. 3. The model includes batch normalization layers, to avoid overfitting, as we have few training examples at our disposal. Further, a Tanh activation function is used in the MLP. The downsampling kernel is of size $3 \times 3 \times 3$, with a stride of 2 and a padding of 1. The number of units in the hidden layer of the MLP was set to 100. We trained with binary cross entropy loss, with an initial learning rate of $10^{-4}$, which is halved every fifty epochs. Training five models in parallel took about 2 h on two GeForce GTX 1080 GPUs.

We cross-validate five models on the training set of 28 patients, and report the average response of these models on the rest 13 patients. We conduct the same experiment for deformations obtained using [6] and all similarity measures (NCC, DWM, MI). The results on the test set using a threshold of 0.5 on the predicted probability are reported in Table 3, suggesting that indeed the deformations between inhale and exhale carry information about lung diseases.

Table 3. Results on disease prediction using deformations on the test set. The reported accuracy is in percentage points.

Full size table

4 Conclusion

In this paper, we propose a novel method which exploits the 3D CNNs to calculate the optimal transformation (combining a linear and a deformable component within a coupled framework) between pair of images that is modular with respect to the similarity function, and the nature of transformation. The proposed method generates deformations with no self-crossings due to the way the deformation layer is defined, efficient due to the GPU implementation of the inference and reports high promising results compared to other unsupervised registration methods. Currently, the proposed network was tested on the challenging problem of lung registration, however, its evaluation on the registration of other modalities, and other organs is one of the potential directions of our method.

References

Anthimopoulos, M., Christodoulidis, S., Ebner, L., Geiser, T., Christe, A., Mougiakakou, S.: Semantic segmentation of pathological lung tissue with dilated fully convolutional networks. arXiv preprint arXiv:1803.06167 (2018)
Avants, B., Epstein, C., Grossman, M., Gee, J.: Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal. 12, 26–41 (2008)
Article Google Scholar
Avants, B.B., Tustison, N.J., Song, G., Cook, P.A., Klein, A., Gee, J.C.: A reproducible evaluation of ants similarity metric performance in brain image registration. NeuroImage 54, 2033–2044 (2011)
Article Google Scholar
Chandra, S., Usunier, N., Kokkinos, I.: Dense and low-rank Gaussian CRFs using deep embeddings. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Google Scholar
Cheng, X., Zhang, L., Zheng, Y.: Deep similarity learning for multimodal medical images (2016)
Google Scholar
Ferrante, E., Dokania, P.K., Marini, R., Paragios, N.: Deformable registration through learning of context-specific metric aggregation. In: Wang, Q., Shi, Y., Suk, H.-I., Suzuki, K. (eds.) MLMI 2017. LNCS, vol. 10541, pp. 256–265. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67389-9_30
Chapter Google Scholar
Glocker, B., Sotiras, A., Komodakis, N., Paragios, N.: Deformable medical image registration: setting the state of the art with discrete methods. Ann. Rev. Biomed. Eng. 13, 219–244 (2011)
Article Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
Google Scholar
Hui, T.W., Tang, X., Loy, C.C.: Liteflownet: A lightweight convolutional neural network for optical flow estimation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
Google Scholar
Jaderberg, M., Simonyan, K., Zisserman, A., et al.: Spatial transformer networks. In: Advances in Neural Information Processing Systems, pp. 2017–2025 (2015)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Riza, A., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild. arXiv (2018)
Google Scholar
Shu, Z., Sahasrabudhe, M., Güler, R.A., Samaras, D., Paragios, N., Kokkinos, I.: Deforming autoencoders: unsupervised disentangling of shape and appearance. In: 2018 IEEE European Conference on Computer Vision (ECCV) (2018)
Google Scholar
Simonovsky, M., Gutiérrez-Becker, B., Mateus, D., Navab, N., Komodakis, N.: A deep metric for multimodal registration. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 10–18. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46726-9_2
Chapter Google Scholar
Sotiras, A., Davatzikos, C., Paragios, N.: Deformable medical image registration: a survey. IEEE Trans. Med. Imaging 32, 1153–1190 (2013)
Article Google Scholar
Yan, P., Xu, S., Rastinehad, A.R., Wood, B.J.: Adversarial image registration with application for MR and TRUS image fusion (2018). http://arxiv.org/abs/1804.11024

Download references

Author information

Authors and Affiliations

ARTORG Center, University of Bern, Murtenstrasse 50, 3008, Bern, Switzerland
Christodoulidis Stergios & Mougiakakou Stavroula
CVN, CentraleSupélec, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
Sahasrabudhe Mihir, Vakalopoulou Maria & Chassagnon Guillaume
Groupe Hospitalier Cochin-Hôtel Dieu, Université Paris Descartes, Paris, France
Chassagnon Guillaume & Revel Marie-Pierre
TheraPanacea, Paris, France
Paragios Nikos

Authors

Christodoulidis Stergios
View author publications
You can also search for this author in PubMed Google Scholar
Sahasrabudhe Mihir
View author publications
You can also search for this author in PubMed Google Scholar
Vakalopoulou Maria
View author publications
You can also search for this author in PubMed Google Scholar
Chassagnon Guillaume
View author publications
You can also search for this author in PubMed Google Scholar
Revel Marie-Pierre
View author publications
You can also search for this author in PubMed Google Scholar
Mougiakakou Stavroula
View author publications
You can also search for this author in PubMed Google Scholar
Paragios Nikos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christodoulidis Stergios .

Editor information

Editors and Affiliations

University College London, London, UK
Danail Stoyanov
University of Leeds, Leeds, UK
Zeike Taylor
Imperial College London, London, UK
Bernhard Kainz
University of Adelaide, Adelaide, SA, Australia
Gabriel Maicas
University of Iowa, Iowa City, IA, USA
Reinhard R. Beichel
Sunnybrook Health Science Centre, Toronto, ON, Canada
Anne Martel
Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Germany
Lena Maier-Hein
Visulytix Ltd. Screenworks, London, UK
Kanwal Bhatia
King’s College London, London, UK
Tom Vercauteren
Imperial College London, London, UK
Ozan Oktay
University of Adelaide, Adelaide, SA, Australia
Gustavo Carneiro
Queensland University of Technology, Brisbane, QLD, Australia
Andrew P. Bradley
University of Lisbon, Lisbon, Portugal
Jacinto Nascimento
University of Queensland, Brisbane, QLD, Australia
Hang Min
University of California Los Angeles, Los Angeles, CA, USA
Matthew S. Brown
Radboud University Medical Center, Nijmegen, The Netherlands
Colin Jacobs
Fraunhofer Institute for Medical Image Computing (MEVIS), Bremen, Germany
Bianca Lassen-Schmidt
Nagoya University, Nagoya, Japan
Kensaku Mori
University of Copenhagen, Copenhagen, Denmark
Jens Petersen
Harvard Medical School, Boston, MA, USA
Raúl San José Estépar
Philips (Germany), Hamburg, Germany
Alexander Schmidt-Richberg
University College London, London, UK
Catarina Veiga

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stergios, C. et al. (2018). Linear and Deformable Image Registration with 3D Convolutional Neural Networks. In: Stoyanov, D., et al. Image Analysis for Moving Organ, Breast, and Thoracic Images. RAMBO BIA TIA 2018 2018 2018. Lecture Notes in Computer Science(), vol 11040. Springer, Cham. https://doi.org/10.1007/978-3-030-00946-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-00946-5_2
Published: 12 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00945-8
Online ISBN: 978-3-030-00946-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Linear and Deformable Image Registration with 3D Convolutional Neural Networks

Abstract

Similar content being viewed by others

Conv2Warp: An Unsupervised Deformable Image Registration with Continuous Convolution and Warping

Learning Iterative Optimisation for Deformable Image Registration of Lung CT with Recurrent Convolutional Networks

On the Adaptability of Unsupervised CNN-Based Deformable Image Registration to Unseen Image Domains

Keywords

1 Introduction