1 Introduction

Diffusion weighted imaging (DWI) enables the in vivo study of the brain white matter (WM) microstructure. For this, the diffusion tensor (DT) has long been the model of choice for its simplicity and ability to provide quantitative scalar parameters (fractional anisotropy - FA, mean diffusivity - MD...) that correlate with pathology stages. However, the DT suffers from some limits when dealing with crossing fibers or since FA and MD encompass multiple microstructure properties that cannot be disentangled. In contrast, diffusion multi-compartment models (MCM) [10], i.e. a linear sum of diffusion compartments each modeling a fiber bundle, provide a more specific microstructure characterization.

Population studies enable the comprehension at the group level of a disease progression and severity. Typical population studies require the construction of atlases or the registration of all images to an atlas, on which the comparison is performed. For this task, using as much information as possible from the DWI, e.g. using the full MCM, enables a finer disease characterization [12]. With this objective, several methods have been explored for diffusion images registration.

Most approaches have focused on DT or DT-derived scalar images registration [7, 13]. However, they do not use the whole diffusion information especially in crossing regions where the DT is limited. On the other end, some approaches [8] consider DWI registration. However, these algorithms are yet limited to single-shell DWI while the complete study of WM microstructure requires multiple b-values [11]. Moreover, adapted re-orientation in q-space is needed [6] and requires an underlying diffusion model making the registration model dependent.

MCM image registration has also been studied in [12] using a multi-tensor correlation coefficient showing better alignment of WM structures and improved abnormality detection compared to DT registration. However, their algorithm is centered on multi-tensors, while many other diffusion models are available [10]. Furthermore, their measure requires the one-to-one mapping of the compared MCMs tensors. This task is time-consuming and may lead to discontinuities of the similarity measure and its derivative as illustrated in Sect. 3.2. This mapping is also harder to define when models with different number of compartments are compared, as it happens if model selection [11] is part of MCM estimation. As an alternative, Cheng et al. [4] proposed to use the \(\ell _2\) norm; their approach is however limited to multi-tensors and to a sum of squared differences (SSD).

We propose in Sect. 2 new similarity measures for MCM images registration, considering distances between MCM characteristic functions (CF), i.e. the Fourier transforms of the MCM probability density functions (PDF). The first measure extends the SSD to MCM; the second measure proposes a surrogate on MCM to the scalar correlation coefficient (CC) to account for variability across patients and acquisition parameters. Both measures are agnostic to individual compartment types, i.e. we assume that their CF describe the microstructure well for registration, and only require to derive the generic expression of the inner product of two compartments to get an analytical form. For complex models, we propose a discrete spherical approximation to both measures. We evaluate the proposed measures in Sect. 3 using simulated and in-vivo data showing their smoother evolution and improved performance for matching WM structures.

2 Methods

2.1 Diffusion Multi-Compartment Models

We define a diffusion MCM M as a weighted sum of N compartments, each depicting water diffusion in a specific environment. For example, an MCM may contain an isotropic free water compartment and several directional compartments, each describing diffusion in a fiber bundle with a given orientation. The MCM admits a PDF \(p_M(x) = \sum _{i=1}^N w_{M,i} p_{M,i}(x)\), where \(p_{M,i}\) is the PDF of the i-th compartment (\(M_i\)) of the model and \(w_{M,i}\) is the weight of compartment \(M_i\) (\(\sum _i w_{M,i} = 1\)). \(p_{M,i}\) can take one of many different forms [10]. M also admits a unique CF \(\varphi _M (t) = \sum _{i=1}^N w_{M,i} \varphi _{M,i}(t)\), where \(\varphi _{M,i}\) is the CF of \(M_i\). As compartments are usually antipodally symmetric, \(\varphi _{M}\) takes its values in \(\mathbb {R}\).

2.2 \(\ell _2\) Space of Square Integrable Functions

We consider the Hilbert space \(\mathcal {L}\) of real-valued square integrable functions f such that \(f: \mathbb {R}^3 \rightarrow \mathbb {R}\) and \(\int _{\mathbb {R}^3} f^2(x) dx < \infty \). The inner product on \(\mathcal {L}\) is defined as \(\langle f,g\rangle _{\mathcal {L}} = \int _{\mathbb {R}^3} f(x) g(x) dx\) and is associated to a distance \(d_{\mathcal {L}}\). \(d_{\mathcal {L}}\) can be used to compare MCM PDFs since most fascicle model PDFs are square integrable. However we define our similarity measures on MCM CFs, i.e. the Fourier transforms of the PDFs, since: 1- the CF is usually simpler than the PDF and is directly related to DWI signal formation, 2- distances on CFs or PDFs are equivalent since \(d_{\mathcal {L}}\) computed on CFs or PDFs are equal (Plancherel theorem). The distance between two MCM CFs \(\varphi _X\) and \(\varphi _Y\) is written as follows:

$$\begin{aligned} d_{\mathcal {L}}^2 (\varphi _X,\varphi _Y) = \mathbf {w_X}^T \mathbf {A_{X,X}} \mathbf {w_X} + \mathbf {w_Y}^T \mathbf {A_{Y,Y}} \mathbf {w_Y} - 2 \mathbf {w_X}^T \mathbf {A_{X,Y}} \mathbf {w_Y} \end{aligned}$$
(1)

where \(\mathbf {w_X}\), respectively \(\mathbf {w_Y}\), represents the vector of compartment weights for model X, respectively for model Y. \(\mathbf {A_{X,Y}}\) denotes the \(N_X\times N_Y\) matrix of all pairwise inner products between the individual compartments of X and Y. As a consequence, \(d_{\mathcal {L}}\) can be computed analytically if \(\langle .,. \rangle _{\mathcal {L}}\) can be.

2.3 MCM Similarity Measures

We consider measures computed over a set of voxels that have been paired between two images, respectively \(\mathbf {R} = \{ R_1, \cdots , R_N \}\) and \(\mathbf {F} = \{ F_1, \cdots , F_N \}\), where \(R_k\) and \(F_k\) denote the k-th paired MCMs.

MCM Sum of Squared Differences. The first measure is the direct extension of SSD to MCMs relying on the direct comparison of the MCM paired between \(\mathbf {R}\) and \(\mathbf {F}\) using the distance defined in Eq. (1):

$$\begin{aligned} \mathrm {SSD}_{\mathcal {L}} (\mathbf {R},\mathbf {F}) = \sum _{k=1}^N d_{\mathcal {L}}^2 (\varphi _{R_k}, \varphi _{F_k}) \end{aligned}$$
(2)

where \(\varphi _{R_k}\), respectively \(\varphi _{F_k}\) is the CF of the k-th MCM in \(\mathbf {R}\), respectively in \(\mathbf {F}\). Assuming all compartments are tensors, this measure admits a closed-form solution based on an inner product of the form: \(\langle \varphi _{i}, \varphi _{j} \rangle _{\mathcal {L}} = (2\pi )^{3/2} | \Sigma _{i} + \Sigma _{j} |^{-1/2}\).

MCM Correlation Surrogate. Similarly to the scalar SSD, \(\mathrm {SSD}_{\mathcal {L}}\) may suffer from changes due to inter-patient brain variability or DWI acquisition differences. We therefore present a second measure aimed at being an “equivalent”, in the \(\mathcal {L}\) space, to the scalar squared CC. Going back to the scalar case, the squared CC can be seen as a measure of the linear relationship between two sets of scalars. Another way of measuring this relationship has been proposed by Alpert et al. [2] as the squared residual of the linear regression between these sets. We propose an MCM correlation surrogate in \(\mathcal {L}\) following a similar idea:

$$\begin{aligned} C_{\mathcal {L}} = \min _{\varvec{\theta }} \sum _{k=1}^N d_{\mathcal {L}}^2 (\varphi _{R_k}, H_{\varvec{\theta }} \varphi _{F_k}) \end{aligned}$$
(3)

where \(H_{\varvec{\theta }}: \mathbb {R}^3 \rightarrow \mathbb {R}\) denotes a kernel function with parameters \(\varvec{\theta }\), such that it preserves the square integrability of \(\varphi _{F_k}\). \(C_{\mathcal {L}}\) is the residual of the minimization of Eq. (3) with respect to \(\varvec{\theta }\). Interestingly, if \(H_{\varvec{\theta }}\) is a CF, Eq. (3) is equivalent to the convolution of \(p_{F_k}\) with \(H_{\varvec{\theta }}\) PDF. Following this, we choose \(H_{\varvec{\theta }}\) as the CF of the centered Gaussian PDF with parameter \(\sigma \): \(H_{\varvec{\theta }}(t) = \exp \left( - \sigma t^T t / 2 \right) \), which results in a Gaussian smoothing of the MCMs PDF.

Developing further Eq. (3) leads to the computation of three different inner products of the form: \(\langle \varphi _{i},H_{\varvec{\theta }}^{\alpha } \varphi _{j} \rangle _{\mathcal {L}}\) for \(\alpha \in \{ 0,1,2 \}\). Similarly to \(\mathrm {SSD}_{\mathcal {L}}\), when considering tensor MCM compartments, these inner products admit an analytical form \(\langle \varphi _{i},H_{\varvec{\theta }}^{\alpha } \varphi _{j} \rangle _{\mathcal {L}} = (2\pi )^{3/2} | \Sigma _{i} + \Sigma _{j} + \alpha \sigma Id |^{-1/2}\) (where Id denotes the \(3\times 3\) identity matrix). In this specific case, optimization over \(\varvec{\theta }\) is carried out using gradient-based optimization algorithm as analytical derivatives are available.

Similarity Metrics Spherical Approximation. For some non Gaussian compartment types, the inner products cannot be computed analytically. We therefore propose a discrete approximation \(\langle .,. \rangle _{a\mathcal {L}}\) to the inner product \(\langle .,. \rangle _{\mathcal {L}}\) using the fact that CFs decrease fast to infinity and are linked to DWI signal formation as: \(S_k = S_0 \varphi \left( \sqrt{2 b_k} g_k\right) \) where \(S_0\) is the nominal signal without diffusion weighting and \(S_k\) the diffusion weighted signal with b-value \(b_k\) along gradient \(g_k\). CFs are used to estimate MCM parameters through a least squares minimization. We construct \(\langle \varphi _i,\varphi _j \rangle _{a\mathcal {L}}\) by sampling values of \(\varphi _i\) and \(\varphi _j\) on DWI acquisition spheres, on a reasonable number of points (in our experiments we sampled a total of 169 gradients on 5 spheres ranging from \(b=500\) to \(3000\,\mathrm{s.mm}^{-2}\)):

$$\begin{aligned} \langle \varphi _i,\varphi _j \rangle _{a\mathcal {L}} = \sum _k \varDelta _k \varphi _i \left( \sqrt{2 b_k} g_k\right) \varphi _j \left( \sqrt{2 b_k} g_k\right) \end{aligned}$$
(4)

where \(\varDelta _k\) is the volume of a subpart of a sphere around point \(\sqrt{2 b_k} g_k\). Plugging this approximation in Eqs. (2) and (3) defines two measures: \(\mathrm {SSD}_{a\mathcal {L}}\) and \(C_{a\mathcal {L}}\).

2.4 Pairing-Based MCM Similarity Measure

As a comparison point to the proposed measures, we also recall the definition of a pairing-based SSD, denoted \(\mathrm {SSD}_P\). The distance between two MCM \(R_k\) and \(F_k\) is defined through pairings between the MCM compartments \(R_{k,i}\) and \(F_{k,j}\). Then, assuming that \(N_{R_k} = N_{F_k}\), the pairing-based distance \(d_P\) is defined as:

$$\begin{aligned} d_P^2 (R_k,F_k) = \mathop {\min }\limits _{a_1\ldots a_{N_{R_k}}} \sum _{i=1}^{N_{R_k}} w_{R_{k,i}} w_{F_{k,a_i}} d_I^2(p_{R_{k,i}},p_{F_{k,a_i}}) \end{aligned}$$
(5)

where \(a_i\) are elements of an association vector \(\mathbf {a}\) mapping each element of \(R_k\) to an element of \(F_k\), \(d_I\) is a distance between MCM compartment PDFs: for multi-tensors the log-Euclidean distance [3]. As \(\mathbf {a}\) is unknown, all permutations are tested and \(d_P\) is the minimum over those associations. The sum of \(d_P^2\) over k defines \(\mathrm {SSD}_P\). An additional difficulty appears when \(N_{R_k} < N_{F_k}\) when model selection was performed, making \(\mathbf {a}\) more complicated to define. We chose to pair each element of \(R_k\) to a single element of \(F_k\), extraneous elements being left out.

3 Experiments and Results

3.1 Image Database and Registration Algorithm

We have used in vivo data from 10 pre-processed healthy subjects from the Human Connectome Project (HCP)Footnote 1. Each DWI acquisition includes 270 gradient directions on three different b-values (from \(b=1000\) to \(3000\,\mathrm{s.mm}^{-2}\)). The image size is 145 \(\times \) 174 \(\times \) 145 and voxel size 1.25 \(\times \) 1.25 \(\times \) 1.25 mm\(^3\). From the DWI, multi-tensor models were estimated so that the approximate and analytical measures can be compared. Estimation was performed using Scherrer et al. method with model selection [11]. Each multi-tensor model was composed of 2 isotropic compartments (free water and restricted water) and a maximum of 3 tensors.

As a fair evaluation framework, we have integrated all measures (\(\mathrm {SSD}_P\), \(\mathrm {SSD}_\mathcal {L}\), \(\mathrm {SSD}_{a\mathcal {L}}\), \(C_\mathcal {L}\), \(C_{a\mathcal {L}}\)) in a generic MCM non linear registration algorithm. This algorithm is an extension of a block-matching scalar image registration [5]. Two essential components were specified for MCM registration. First interpolation for image resampling was performed using state-of-the-art MCM interpolation techniques [12]. The second major brick is the re-orientation technique for resampling. We used the preservation of principal direction technique [1] on each compartment of the interpolated MCM. We used this framework in our experiments with the same parameters, changing only the similarity measure.

3.2 Similarity Measures Qualitative Evaluation

We evaluated the behavior of each similarity measure when translating or rotating a block (close to the ventricles, see Fig. 1.a) where the number of compartments is known to change. The selected block had a resolution of 5 \(\times \) 5 \(\times \) 5 voxels and was either translated along the Y-axis (vertical axis in Fig. 1.a) or rotated around the Z-axis (transverse axis in Fig. 1.a). For each case, we computed the similarity measure between the transformed block and the original one. We report in Fig. 1 the five evaluated similarity measures, as a function of the transformation parameters, scaled so that their maximum is equal.

Fig. 1.
figure 1

Similarity measures evolution around the corpus callosum for (b): rotation and (c): translation. Legend: red: \(\mathrm {SSD}_P\), green: \(\mathrm {SSD}_\mathcal {L}\), blue: \(\mathrm {SSD}_{a\mathcal {L}}\), cyan: \(C_\mathcal {L}\), black: \(C_{a\mathcal {L}}\).

When rotating the block around its center (Fig. 1.b), all measures perform well and indicate one clear global minimum although \(\mathrm {SSD}_P\) suffers from some irregularities. The main variations appear in the translation case due to the change of explored region with the increasing translation. In Fig. 1.c, \(\mathrm {SSD}_P\) appears more sensitive to interpolation than the proposed measures. The value of \(\mathrm {SSD}_P\) may vary abruptly when the MCM compartment pairings change, leading to non differentiability. The mapping function \(\mathbf {a}\) also maps one compartment to one and only one other compartment. However, the number of compartments is different when translation occurs. Some compartments are thus left out, leading to part of one MCM not being considered in the measure.

Measures based on the \(\mathcal {L}\) space compare MCMs as a whole and therefore adopt much smoother and regular shapes. Moreover, the shapes of the approximations are very similar (the green and blue curves are overlapping in Fig. 1 as well as the cyan and black curves in Fig. 1.b), suggesting the discrete approximation is well fit for MCM registration. \(C_\mathcal {L}\) and \(C_{a\mathcal {L}}\) adopt a different shape from the SSD measures, allowing for more possibilities in matching one block with other regions while keeping the true matching as the global optimum.

3.3 Quantitative Evaluation on HCP Data

Simulated Deformations. We simulated deformations by generating for each HCP subject a set of 5 poly-rigid transformations [5] by randomly seeding 15 local rigid transformations (rotations between \([-\pi /6;\pi /6]\) radians around each axis and translations between \([-10;10]\) mm in each direction) inside the brain and extrapolating a dense deformation. These transformations were then applied to their corresponding subject, generating two images to register: \(I \circ T^{-1/2}\) and \(I \circ T^{1/2}\). The ground truth transformation is then T. We present in Fig. 2 the voxelwise transformation errors for the registration with each measure.

Fig. 2.
figure 2

Transformation errors after registration on a representative HCP subject: fixed MCM image (a), error using \(\mathrm {SSD}_P\) (d), \(\mathrm {SSD}_\mathcal {L}\) (e), \(\mathrm {SSD}_{a\mathcal {L}}\) (b), \(C_\mathcal {L}\) (f), \(C_{a\mathcal {L}}\) (c).

All \(\mathcal {L}\) space measures outperform \(\mathrm {SSD}_P\). This is particularly visible around the ventricles (arrows on Fig. 2). This confirms previous results where \(\mathrm {SSD}_P\) shape was irregular and thus more difficult to optimize in this region. The results obtained by the spherical approximations of the measures are close to their analytical counterparts. All \(\mathcal {L}\) space measures appear to perform equally well, only displaying subtle error differences. We also computed the average transformation error over the whole image and averaged those across all experiments. All measures improve over the initial error (2.704). The \(\mathrm {SSD}_P\) error (0.587) is however significantly higher (paired t-test, \(p < 0.05\)) than the \(\mathcal {L}\) space measures: \(\mathrm {SSD}_{\mathcal {L}}\), \(\mathrm {SSD}_{a\mathcal {L}}\) and \(C_{\mathcal {L}}\) reach an average error of 0.577 and \(C_{a\mathcal {L}}\) reaches 0.578. All proposed measures are not significantly different from each other (paired t-test, \(p > 0.05\)). The absence of difference between MCM correlation and SSD may be explained by the fact that each couple of images comes from the same subject and \(\mathrm {SSD}_{\mathcal {L}}\) is enough in that case to recover the deformation.

Inter-subject Registration. We quantitatively evaluated inter-subject registration by registering all couples of HCP images (total of 90 registrations) and applying the deformation to WM segmentations. These parcellations were obtained by the HCP pipelines using FreeSurfer and contain a total of 182 structures. We report in Fig. 3 the averaged total overlaps (TO) [9] between the transformed parcellations and the true ones for each method.

Fig. 3.
figure 3

White matter parcellation total overlaps after registration of MCM images from HCP data, using either global affine registration, or non linear registration with respectively \(\mathrm {SSD}_P\), \(\mathrm {SSD}_{\mathcal {L}}\), \(\mathrm {SSD}_{a\mathcal {L}}\), MCM correlation surrogate \(C_{\mathcal {L}}\) and \(C_{a\mathcal {L}}\).

The average TO are relatively low for all methods. This comes from inter-subject variability of the reference parcellations. The overlap scores however confirm previous findings: the \(\mathcal {L}\) space measures all outperform \(\mathrm {SSD}_P\) (average TO: \(47.23\%\)), itself being better than global affine registration on B0 images (all results significant: paired t-test, \(p<0.05\)). This confirms that the pairing based measure, although good overall, encounters more difficulties. \(C_{\mathcal {L}}\) performs the best over all other measures (average TO: \(50.73\%\), paired t-test, \(p<0.05\)), while both \(\mathrm {SSD}_{\mathcal {L}}\) and \(\mathrm {SSD}_{a\mathcal {L}}\) perform equally well (average TO: \(50.53\%\)). Introducing a more flexible similarity measure than SSD may thus help MCM image registration. \(C_{a\mathcal {L}}\) performs a little lower than other \(\mathcal {L}\) space measures (average TO: \(49.73\%\)), especially compared to \(C_{\mathcal {L}}\). This suggests that the selected approximation points may have an influence on the approximation quality.

4 Conclusion

We have proposed a new framework to define MCM similarity measures based on the fact that most MCM CF belong to the Hilbert space of square integrable functions \(\mathcal {L}\). Defining distances between MCM in \(\mathcal {L}\) has two main advantages: 1- no pairing is needed between the individual MCM compartments thereby alleviating the complex computation of a mapping function; 2- the measures are generic as they are agnostic to the compartments models. We defined two similarity measures (and their analytical expressions for multi-tensors): an MCM SSD and an MCM surrogate to the scalar CC. In addition, we defined a numerical approximation applicable to all \(\mathcal {L}\) space based measures for complex models. We have evaluated these new similarity measures on both simulated transformations and real data from HCP, showing the better behavior of \(\mathcal {L}\) space measures and the potential of the MCM correlation surrogate.