1 Introduction

Tracking fetal growth and developmental progression is paramount in obstetric care. The fetal brain undergoes a predictable sequence of structural changes across gestation: from a smooth surface, to progressively bearing more folds [1]. This process follows a precise schedule, and delays are indicative of impaired brain maturation. Thus, the presence of a cerebral abnormality may be manifested by structural deviations from the norm. In order to detect such developmental deviations, an individual’s image can be compared against an atlas that is representative of the healthy population. Atlases of the developing brain have been developed from magnetic resonance (MR) image data collected from infant and fetal subjects (reviewed in [2]). These atlases have provided a representation of brain anatomy in the womb, and have facilitated tissue segmentation, thereby enabling studies of structural growth and aiding the detection (or characterization) of fetal pathologies [2]. However, given that ultrasound (US) imaging forms one of the first steps in perinatal monitoring, there is still a need to create an ultrasound-specific atlas for use in routine care. This work presents a tool to automatically generate an atlas from 3D US images of the fetal brain.

The standard approach to construct an anatomical atlas is to perform one of state-of-the-art pairwise deformable registration algorithms [3] between the chosen (reference) volume and the remaining volumes from a data set. Such an approach is simple and easily scalable to large data sets, however it introduces a bias to registration results due to the selection of reference volume. That is, if the selected reference volume is an outlier then all registrations will estimate implausible transformations. Additionally, the transformation estimated using a pairwise approach accumulates inverse consistency and transitivity errors [4], which could be propagated to any subsequent analysis. Different approaches have been proposed to reduce transformation errors when building atlases, including statistical deformation models [5], linear [6] or geodesic [7, 8] averaging of the transformations and intensity to produce the atlases. Approaches with simultaneous registration (i.e. groupwise registration) of all volumes in a dataset have been shown to reduce the bias introduced by selection of a fixed reference volume, and errors in the estimated displacement fields.

Developing intensity-based methods for registration of ultrasound images is challenging due to strong intensity inhomogeneities within tissues, and the presence of shadows, which cause partial, low-contrast boundaries. Local phase [9] and feature asymmetry (derived from the monogenic signal [10]) extract contrast-invariant structural information, and have been shown to improve analysis of US images in several tasks. Specifically, feature asymmetry (FA) has the potential to enhance tissue boundaries, and as such, has been extensively used to process fetal ultrasound data, where there are large structural changes (e.g. [11, 12]). A hybrid intensity and local phase representation of US images has been applied to tumour tracking in 2D liver US [13], showing overall improved registration accuracy. Realizing the potential of FA to enhance and sharpen the sonographic landmarks necessary for accurate registration, this work explores its inclusion as additional image channels for US atlas construction.

In this paper, we present a framework to construct the first 3D atlas of the fetal brain using non-rigid groupwise registration of US images. Our framework extends a standard groupwise registration [4] to its multi-channel counterpart using a composite image representation to improve the registration of tissue interfaces in US data. The proposed multi-channel image representation comprises of ultrasound intensities and features extracted using different FA scales, thereby representing boundaries of different sizes in a multiscale manner. The presented evaluation shows that the presented method is capable of constructing an atlas from fetal brain US data, and at the same time, the performed quantitative analysis shows that our method outperforms standard intensity-based, and single channel image registration methods.

2 Materials and Methods

2.1 Fetal Dataset and Preprocessing

The fetal US images used in this work comprised of 39 volumes (\(247 \times 190 \times 179\) voxels) with known age of 23 gestational weeks (GW). The sonographic volumes of the fetal head were obtained from the INTERGROWTH-21\(^{\text {st}}\) study database [14], which were collected using a Philips HD9 curvilinear probe at a 2–5 MHz wave frequency. After alignment [15], all volumes were resampled to an isotropic voxel size (\(0.6\times 0.6 \times 0.6\) mm) and resized to \(160\times 160\times 160\) voxels.

2.2 Atlas Construction

Given a set of M images, the goal of atlas construction is to find a set of transformations \(\mathcal {T}\), each of which maps its corresponding image \(\mathbf {I}_m\) to a common reference space: \(\mathcal {T}: \lbrace \varvec{T}_{mR} : \mathbf {I}_m \mapsto \mathbf {I}_R, m=1,\ldots ,M\rbrace \). This typically comprises of two steps: a global transformation to correct for size and growth differences, followed by a non-rigid registration to account for local morphological differences.

Sonographic scans of the fetal head were first rigidly aligned using the method proposed in [15]. Briefly, a slice-wise classifier segmented the skull boundaries and predicted the relative position of the slice in the brain volume. This information was then combined to estimate a similarity transformation modelling 9 degrees of freedom (namely, rotation, translation, and isotropic scaling) to linearly register all volumes to a standard (atlas) space. One of the challenges of processing fetal brain US is that the ultrasound signal is attenuated by the cranial bones in its path, and the concave shape also refracts it and creates reverberation artifacts. This affects the visibility of anatomical boundaries, particularly in the cerebral hemisphere proximal to the US probe. Since only one of the hemispheres has clearly visible structures, this hemisphere is mirrored across the midsagittal plane. This generates a complete representation of the brain, thereby making an assumption of brain symmetry, for simplicity [16].

The non-rigid image registration used in this work, is built on the groupwise deformable registration proposed in [4, 17]. The implicit reference groupwise registration reduces bias introduced by selection of a reference volume by jointly estimating the transformation between all volumes in the dataset to an unknown reference volume. This process is defined as the following optimization problem:

$$\begin{aligned} \arg \min _{\varvec{u}}\left( \sum ^{M}_m\sum ^{M}_{n, n\ne m}\int _{\varOmega }Sim\left( \mathbf {I}_m(\varvec{T}_{mR}(\varvec{x})), \mathbf {I}_n(\varvec{T}_{nR}(\varvec{x})) \right) d\varvec{x} ~+\right. \end{aligned}$$
(1)
$$\begin{aligned} \left. \alpha \sum _m^{M}\int _{\varOmega }Reg(\varvec{u}_{mR}(\varvec{x}))d\varvec{x} \right) \end{aligned}$$
(2)

where Sim and Reg denote the similarity measure and regularisation term, respectively, \(\alpha \) is the weighting parameter, \(\varvec{T}_{mR} = \varvec{x}+\varvec{u}_{mR}(\varvec{x})\) (or \(\varvec{T}_{nR} = \varvec{x}+\varvec{u}_{nR}(\varvec{x})\)) is the transformation from volume \(\mathbf {I}_m\) (or \(\mathbf {I}_n\)) to the implicit reference volume \(\mathbf {I}_R\) at spatial position \(\varvec{x}\). The \(\varvec{u}_{mR}\) (or \(\varvec{u}_{nR}\)) represents a subsequent displacement field, \(\varOmega \) is the volume domain, and M is the number of volumes to be registered. The implicit reference volume is iteratively updated based on all the volumes deformed during the displacement field estimation process. In this work, we choose the diffeomorphic Demons framework [18], where optimisation iteratively alternates between minimising the energy related to the similarity measure Sim and the regularization term Reg performed via Gaussian smoothing of the estimated displacement fields. In order to establish anatomically meaningful correspondences between brain US volumes, we replace state-of-the-art intensity differences used in the classic Demons by a multi-channel feature-based representation of the US volumes. The implementation details for efficient Demons-like implicit reference groupwise registration can be found in [19].

2.3 Feature Extraction

The monogenic signal uses the Riesz transform to generate a representation of an image in the frequency domain [20]. By applying an appropriately selected bandpass filter (\(f_{\lbrace o,e\rbrace }\)), the signal can be decomposed into local structural (phase and orientation) and energetic (amplitude) information. The phase component extracts contrast-invariant, structural information, which is particularly useful in recovering feature asymmetry (FA) [21]. FA is a measure of the extent to which a structure around an image voxel is locally asymmetric, thus representing a step-edge [10, 21]. The FA edge image \(\mathbf {\hat{I}}\) of an input image \(\mathbf {I}\) is recovered as:

$$\begin{aligned} \mathbf {\hat{I}} = \frac{\lfloor \vert f_{o,\lambda }( \mathbf {I}) \vert - \vert f_{e,\lambda }(\mathbf {I})\vert - t \rfloor }{\sqrt{f_{o,\lambda }(\mathbf {I})^2 + f_{e,\lambda }(\mathbf {I})^2} + \varepsilon } \end{aligned}$$
(3)

where \(\lambda \) represents the filter scale, \(f_o\) and \(f_e\) represent the odd and even parts of the signal, t is a threshold that controls the sensitivity of the response, \(\lfloor \cdot \rfloor \) sets negative values to zero, and \(\varepsilon \) is a filter regularization parameter which prevents division by zero.

FA allows edge features to be obtained at different centre-wavelengths, \(\lambda \). The centre frequency is equivalent to the scale of the bandpass filter \(f_{\lbrace o,e\rbrace ,\lambda }\) (i.e. size of structures of interest) used to the calculate the monogenic signal. In fetal brain US images, the anatomical boundaries appear as step-edges and ridge-like structures, which are best extracted with a log-Gabor filter [11]. Given an US image, \(\mathbf {I}\), a corresponding FA edge image \(\mathbf {\hat{I}}\) is defined to highlight structural boundaries [10, 21]. An FA image typically detects thick edges which are thinned by applying non-maximum suppression for improved boundary localization. Here, we explore the effect of supplementary structural information for atlas construction by varying the centre frequency at which the monogenic signal was recovered from the images, \(\lambda = \left[ 0.025, 0.425 \right] \) (Fig. 1). The other parameters were empirically set to \(t=0.5\) and \(\varepsilon =10^{-6}\).

Fig. 1.
figure 1

Schematic of a coronal view of the fetal brain at 23 GW, and a typical US scan. Feature asymmetry edge images are shown at varying centre-frequency wavelengths, \(\lambda =\lbrace 0.075, 0.125, 0.275 \rbrace \), overlaid in yellow. (Color figure online)

2.4 Evaluation Metrics

Ten brain volumes were linearly registered using [15] and anatomical regions were manually segmented and verified by an expert with 10 years’ experience and a senior sonographer. The segmented regions of interest (\(\mathbf {V}^k\)) included the brain stem (BS), cavum septum pellucidum (CSP), thalamus (TH), and white matter (WM) (Fig. 2c). In order to evaluate registration performance, we compute the average relative overlap (ARO) for each of the \(K=4\) regions as follows [4]:

$$\begin{aligned} ARO = \frac{ 1 }{ N(N-1)} \sum ^N_{\begin{array}{c} j=1,\\ j\ne i \end{array}} \sum ^N_{i=1} \frac{ \sum _K \mathbf {V}^k_i \cap \mathbf {V}^k_j}{ \sum _K \mathbf {V}^k_i \cup \mathbf {V}^k_j} \end{aligned}$$
(4)

where \(\mathbf {V}^k_i = \mathbf {V}^k_i\left( \mathcal {T}_{iR}(\mathbf {I})\right) \), K is the number structures, and N is the number of annotated volumes.

All experiments were performed on an Intel i7 2.80 GHz quad-core machine (32 GB RAM) with a C++ implementation of the diffeomorphic Demons algorithm.

3 Results and Discussion

3.1 Registration of Anatomical Structures

In the first experiment, we explored different formulations of the groupwise registration algorithm to construct an atlas that maximizes anatomical correspondence between the neurosonographic images. By varying the diffusion parameter of the Demons-like forces (\(\sigma _d=\left[ 0.25,5.0\right] \)), we find that the best performance is achieved with \(\sigma _d=1.0\) (ARO \(>0.868\) for WM, Fig. 2a), and gradually decreases as \(\sigma _d\) increases. This behaviour was observed regardless of input: single- or multi-channel.

Furthermore, we explored the effect of FA wavelength (scale) selection by varying \(\lambda \) from 0.025 to 0.425. Figure 2b shows the ARO averaged across all four structures. The groupwise approach outperformed the linearly aligned data, regardless of input. However, the multi-channel Demons (intensity + sFA) outperformed single-channel Demons (intensity) only for scales \(\lambda =\left\{ 0.075, 0.125, 0.175\right\} \). For further comparison, we explored the performance of a multi-channel Demons with multi-scale feature extraction by combining the features from the best FA scales (\(\lambda ={0.075, 0.125, 0.175}\)) into the second input channel. This yielded the highest structural overlap (\(\text {ARO}=0.8029 \pm 0.049\)) and was selected as the best method for atlas construction.

Figure 2c shows the result of applying the groupwise registration to the set of ten volumes for which corresponding segmentations were available. The atlases were constructed by averaging the images after affine registration, and then further transformed by the groupwise registration. There is high consensus between the structures, as observed in the probability maps, which is corroborated by the groupwise ARO of the resulting atlas (Table 1).

To further examine the anatomical agreement recovered by the groupwise registration algorithm, we compared the average segmentation maps obtained by transforming the annotated images, with a segmentation of the resulting groupwise atlas. High volumetric overlap was also observed for all four structures (mean \(\text {ARO}=0.8580 \pm 0.036\)).

Fig. 2.
figure 2

(a) Registration regularization parameter (\(\sigma _d\)) versus average relative overlap (ARO). (b) Different atlas construction methods plotted against ARO. (c) Resulting US atlas constructed using intensity and multi-scale FA from \(n=10\) volumes for which segmentations were available. Denser colour signifies higher overlap.

3.2 Construction of Population Brain Atlas

In the second experiment, we applied the proposed multi-channel groupwise registration algorithm to 39 brain volumes to construct an atlas. Figure 3 shows atlases constructed by averaging the images after the affine registration [15], and after non-rigid registration with either a single (intensity) channel, or multiple channels (i.e. intensity and feature asymmetry). It is evident that the atlases constructed with groupwise registration had higher anatomical definition, and more distinct boundaries. Structural clarity was even higher in the atlas constructed with multi-channel inputs.

Fig. 3.
figure 3

Visual comparison between fetal brain atlases constructed from \(n=39\) US volumes at 23 GW using affine (first column), intensity-based (second column), and multi-channel groupwise methods (third column). Comparison to MRI-based fetal atlas at 23 GW shows the presence of similar structures in both modalities [22] (yellow arrows), but some structures are better observed in US images (red arrows). (Color figure online)

Fig. 4.
figure 4

Volumetric US atlas with superimposed segmentation of four structures from a fetal MR atlas at 23 GW (obtained from Gholipour et al. [22]).

In order to visualize the structural variation within the healthy fetal cohort at 23 GW, we performed principal component analysis (PCA) of the deformation fields estimated from the 39 volumes. Figure 5 demonstrates the first four components, altogether explaining 65.6% of the variation at this gestational age. PC1, PC2, and PC4 display variations in anatomical shape and global eccentricity of the brain. Small changes in ventricular shape are particularly observed around the posterior lateral ventricles and the cortical plate in PC1. Nonetheless, all modes of variation demonstrate realistic representations of the brain at 23 GW.

Fig. 5.
figure 5

Principal component analysis (PCA) result display the mean brain ±3 standard deviations (i.e. \(\mu \pm 3\sigma \)) for the first four components. All modes show realistic representations of the brain.

Table 1. Average relative overlap for all four structures on 10 annotated volumes. Intensity: single-channel Demons. Intensity+sFA: multi-channel, single-scale demons (\(\lambda =0.075\)). Intensity+mFA: multi-channel, multi-scale Demons with FA scales \(\lambda =\lbrace 0.075,0.125, 0.175\rbrace \).

3.3 Comparison to Existing Fetal Atlas

In order to assess the quality of the resulting atlas, Fig. 4 compares our US-specific atlas with a MRI-based fetal template at 23 GW generated by Gholipour et al. [22]. The contours of the latter are superimposed on the US-based atlas constructed from \(n=39\) volumes. Here, we can visually determine that despite there not being a direct intensity mapping between the two, there is a good match between the shape and location of the structures in both modalities at this gestational week. This illustrates the complementarity between the modalities, and presents opportunities to transfer anatomical insights from one to the other, and the possibility to facilitate analysis of fetal brain US with information contained within MRI models of development (e.g. [22]). The fact that structures such as the basal ganglia are better visible in the US-based atlas (Fig. 3) also presents new opportunities for use of neurosonographic data to study structural development.

4 Conclusion

In this paper, we present the first fetal brain atlas constructed from US data. The proposed multi-channel Demons formulation takes as input an image with intensity and feature-enhanced channels. It was shown to outperform a single-channel Demons registration approach, generating high structural overlap in the resulting atlas. Comparison with an age-matched MR atlas demonstrated similarities in the shape and presence of key anatomies in both imaging modalities, but also revealed new structures that are better observed in the US atlas. Given the formulation of the proposed method, it is expected that it should extend to a broader gestational age range.