Keywords

1 Introduction

In recent years, the study of structural and functional brain connectivity has expanded rapidly. Following the rise of diffusion and functional MRI, connectomics has unlocked a wealth of knowledge to be explored. Almost synonymous with the connectome is the network-theory based representation of the brain. In much of the recent literature, the quantitative analysis of connectomes has focused on region-to-region connectivity. This paradigm equates physical brain regions with nodes in a graph, and uses observed structural measurements or functional correlations as a proxy for edge strengths between nodes.

Critical to this representation of connectivity is the delineation of brain regions, the parcellation. Multiple studies have shown that the choice of parcellation influences the graph statistics of both structural and functional networks [15, 17, 18]. It remains an open question which of the proposed parcellations is the optimal representation, or even if such a parcellation exists [14].

It is thus useful to construct a more general framework for cortical connectivity, one in which any particular parcellation of the cortex may be expressed and its connectivity matrix derived, and one in which the variability of connectivity measures can be modeled and assessed statistically. It is also important that this framework allow comparisons between parcellations, and representations in this framework must be both analytically and computationally tractable. Since several brain parcellations at the macroscopic scale are possible, a representation of connectivity that is independent of parcellation is particularly appealing.

In this paper, we develop such a general framework for a parcellation independent connectivity representation, building on the work of [8]. We describe a continuous point process model for the generation of an observed tractFootnote 1 (streamline) intersections with the cortical surface, from which we may recover a distribution of edge strengths for any pair of cortical regions, as measured by the inter-region tract count. Our model is an intensity function over the product space of the cortical surface with itself, assigning to every pair of points on the surface a point connectivity. We describe an efficient method to estimate the parameter of the model, as well as a method to recover the region-to-region edge strength. We then demonstrate the estimation of the model on a Test-Retest dataset. We provide reproducibility estimates for our method and the standard direct count methods [10] for comparison. We also compare the representational power of common cortical parcellations with respect to a variety of measures.

2 Continuous Connectivity Model

The key theoretical component of our work is the use of point process theory to describe estimated cortical tract projections. A point process is a random process where any realization consists of a collection of discrete points on a measurable space. The most basic of these processes is the Poisson process, in which events occur independently at a specific asymptotic intensity (rate) \(\lambda \) over the chosen domain [12]. \(\lambda \) completely characterizes each particular process, and is often defined as a function \(\lambda : \text {Domain} \rightarrow \mathbb {R}^+\), which allows the process to vary in intensity by location. The expected count of any sub-region (subset) of the domain is its total intensity, the integral of \(\lambda \) over the sub-region. In this paper, our domain is the connectivity space of the cortex, the set of all pairs of points on the surface, and the events are estimated tract intersections with the cortical surface.

2.1 Model Definition

Let \(\varOmega \) be union of two disjoint subspaces each diffeomorphic to the 2-sphere representing the white matter boundaries in each hemisphere. Further consider the space \(\varOmega \times \varOmega \), which here represents all possible end point pairs for tracts that reach the white matter boundary. We treat the observation of such tracts as events generated by an inhomogeneous (symmetric) Poisson process on \(\varOmega \times \varOmega \); in our case, for every event (xy) we have a symmetric event (yx).

Assuming that each event is independent of all other events except for its symmetric event (i.e., each tract is recovered independently), we model connectivity as a intensity function \(\lambda :\varOmega \times \varOmega \rightarrow \mathbb {R}^+\), such that for any regions \(E_1, E_2 \subset \varOmega \), the number of events is Poisson distributed with parameter

$$\begin{aligned} \mathcal {C}(E_1,E_2) = \iint _{E_1,E_2}{\lambda }(x,y)dxdy. \end{aligned}$$
(1)

Due to properties of the Poisson distribution, the expected number of tracts is exactly \(\mathcal {C}(E_1,E_2)\). For any collection of regions \(\{E_i\}_{i=1}^N = P\), we can compute a weighted graph \(\mathcal {G}(P,\lambda )\) by computing each \(\mathcal {C}(E_i,E_j)\) for pairs \((E_i,E_j) \in P \times P\). Each node in this graph represents a region, and each weighted edge represents the pairwise connectivity of the pair of nodes (regions) it connects. We call P a parcellation of \(\varOmega \) if \(\bigcup _i E_i = \varOmega \) and \(\bigcap _iE_i\) has measure 0 (\(\{E_i\}\) is almost disjoint).

2.2 Recovery of the Intensity Function

A sufficient statistic for Poisson process models is the intensity function \(\lambda (x,y)\). Estimation of the function is non-trivial, and has been the subject of much study in the spatial statistics community [3]. We choose to use a non-parametric Kernel Density Estimation (KDE) approach due to an efficient closed form for bandwidth estimation described below. This process is self-tuning up to a choice of desiderata for the bandwidth parameter.

We first inflate each surface to a sphere and register them using a spherical registration (See Sect. 3.1); however this entire method can be undertaken without group registration. We treat each hemisphere as disjoint from the other, allowing us to treat \(\varOmega \times \varOmega \) as the product of spheres \((S_1 \cup S_2) \times (S_1 \cup S_2)\). Throughout the rest of the paper D denotes a dataset containing observed tract endpoints \((x,y)_i\), and \(\hat{\lambda }\) denotes our estimation of \(\lambda \).

The unit normalized spherical heat kernel is a natural choice of kernel for \(\mathbb {S}^2\). We use its truncated spherical harmonic representation [1], defined as follows for any two unit vectors p and q on the 2-sphere:

$$\begin{aligned} K_\sigma (p,q) = \sum _h^H \frac{2h + 1}{4\pi }\exp \{-h(h+1)\sigma \}P^0_h(p\cdot q) \end{aligned}$$

Here, \(P_h^0\) is the \(h^{th}\) degree associated Legendre polynomial of order 0. Note that the non-zero order polynomials have coefficient zero due to the radial symmetry of the spherical heat kernel [1]. However, since we are estimating a function on \(\varOmega \times \varOmega \), we use the product of two heat kernels as our KDE kernel \(\kappa \). For any two points p and q, the kernel value associated to a end point pair (xy) is \(\kappa ((p,q)|(x,y)) = K_\sigma (x,p)K_\sigma (y,q)\). It is easy to show that \(\int _{\varOmega \times \varOmega } K_\sigma (x,p)K_\sigma (y,q)dpdq = 1\).

The spherical heat kernel has a single shape parameter \(\sigma \) which corresponds to its bandwidth. While in general tuning this parameter requires the re-estimation of \(\hat{\lambda }\) at every iteration, by rewriting our kernel we can memoize part of the computation so that we only need to store the sum of the outer products of the harmonics. Writing out \(\kappa ((p,q)|D) = \sum _{(x_i,y_i) \in D}K_\sigma (x_i,p)K_\sigma (y_i,q)\), we have the following:

Thus, evaluations of the kernel at any point (pq) can be done quickly for sequences of values of \(\sigma \). We then are left with the choice of loss function. Denoting the true intensity function \(\lambda \), the estimated intensity \(\hat{\lambda }\), and the leave-one-out estimate \(\hat{\lambda }_i\) (leaving out observation i), Integrated Squared Error (ISE) is defined:

$$\begin{aligned} ISE(\sigma | D) =&\int _{\varOmega \times \varOmega } (\hat{\lambda }(x,y|\sigma ) - \lambda (x,y))^2 dxdy\\ \approx&\int \hat{\lambda }(x,y|\sigma )^2 dxdy - \frac{2}{|D|} \sum _{(x_i,y_i) \in D} \hat{\lambda }_{i}(x_i,y_i) + Constant. \end{aligned}$$

Hall and Marron [9] suggest tuning bandwidth parameters using ISE. In practice, we find that replacing each leave-one-out estimate with its logarithm \(\log \hat{\lambda }_{i}(x_i,y_i)\) yields more consistent and stable results.

2.3 Selecting a Parcellation

Given an estimated intensity \(\hat{\lambda }\) and two or more parcellations \(P_1,P_2,\dots \), we would like to know which parcellation and associated graph \(\mathcal {G}(P,\hat{\lambda })\) best represents the underlying connectivity function. There are at least two perspectives to consider.

Approximation Error: Because each \(P_i\) covers \(\varOmega \) (and \(P_i \times P_i = \varOmega \times \varOmega \)), each \(\mathcal {G}(P_1,\hat{\lambda })\) can be viewed as a piece-wise function \(g:\varOmega \times \varOmega \rightarrow \mathbb {R}^+\), where \(g(x,y) = \frac{1}{|E_i||E_j|}\mathcal {C}(E_i,E_j)\) such that \(x\in E_i\) and \(y \in E_j\). In other words, g is the constant approximation to \(\lambda \) over every pair of regions. A natural measure of error is another form of Integrated Squared Error:

$$\begin{aligned} Err(\hat{\lambda },\mathcal {G}(P_1,\hat{\lambda })) = \iint _{\varOmega \times \varOmega } (g(x,y) - \lambda (x,y))^2 dxdy. \end{aligned}$$
(2)

This is analogous to squared loss (\(\ell _2\)-loss).

Likelihood: An alternative viewpoint leverages the point process model to measure a likelihood:

$$\begin{aligned} \log \mathcal {L}(P) = \sum _{E_i,E_j \in P} \log \text {Poisson}( |\{(x,y)_i \in D: x \in E_i, y \in E_j\}| ; \mathcal {C}(E_i,E_j)). \end{aligned}$$
(3)

Here, the independence assumption plays a critical role, allowing pairs of regions to be evaluated separately. Unfortunately this is biased toward parcellations with more, smaller regions, as the Poisson distribution has tied variance and mean in one parameter. A popular likelihood-based option that somewhat counterbalances this is Akaike’s Information Criterion (AIC),

$$\begin{aligned} AIC(P) = -2\log \mathcal {L}(P) + \left( \begin{array}{c}|P|\\ 2 \end{array}\right) \log |D|. \end{aligned}$$
(4)

AIC balances accuracy with parsimony, penalizing overly parameterized models - in our case, parcellations with too many regions.

3 Application to CoRR Test-Retest Data

We demonstrate the use of our framework on a test-retest dataset. We measure connectome reproducibility using Intraclass Correlation (ICC) [13], and compare three parcellations using multiple criteria (See Eqs. 2, 3, and 4).

3.1 Procedure, Connectome Generation, and Evaluation

Our data are comprised of 29 subjects from the Institute of Psychology, Chinese Academy of Sciences sub-dataset of the larger Consortium for Reliability and Reproducibility (CoRR) dataset [19]. T1-weighted (T1w) and diffusion weighted (DWI) images were obtained on 3T Siemens TrioTim using an 8-channel head coil and 60 directions. Each subject was scanned twice, roughly two weeks apart.

T1w images were processed with Freesufer’s [4] recon-all pipeline to obtain a triangle mesh of the grey-white matter boundary registered to a shared spherical space [5], as well as corresponding vertex labels per subject for three atlas-based cortical parcellations, the Destrieux atlas [6], the Desikan-Killiany (DK) atlas [2], and the Desikan-Killiany-Tourville (DKT31) atlas [11]. Probabilistic streamline tractography was conducted using the DWI in 2 mm isotropic MNI 152 space, using Dipy’s [7] implementation of constrained spherical deconvolution (CSD) [16] with a harmonic order of 6. As per Dipy’s ACT, we retained only tracts longer than 5 mm with endpoints in likely grey matter.

We provide the mean ICC score computed both with and without entries that are zero for all subjects. When estimating \(\hat{\lambda }\) the kernels are divided by the number of tracks, and we use a sphere with unit surface area instead of unit radius for ease of computation. We threshold each of the kernel integrated connectomes at \(10^{-5}\), which is approximately one half of one unit track density. We then compute three measures of parcellation representation accuracy, namely ISE, Negative Log Likelihood, and AIC scores.

3.2 Results and Discussion

Table 1 shows a surprisingly low mean ICC scores for regular count matrices. This may be because ICC normalizes each measure by its \(s^2\) statistic, meaning that entries in the adjacency matrices that should be zero but that are subject to a small amount of noise – a few erroneous tracks – have very low ICC. Our method in effect smooths tracts endpoints into a density; end points near the region boundaries are in effect shared with the adjacent regions. Thus, even without thresholding we dampen noise effects as measured by ICC. With thresholding, our method’s performance is further improved, handily beating the counting method with respect to ICC score. It is important to note that for many graph statistics, changing graph topology can greatly affect the measured value [18]. While it is important to have consistent non-zero measurements, the difference between zero and small but non-zero in the graph context is also non-trivial. The consistency of zero-valued measurements is thus very important in connectomics.

Table 1. This table shows mean ICC scores for each connectome generation method. The count method - the standard approach - defines edge strength by the fiber endpoint count. The integrated intensity method is our proposed method; in general it returns a dense matrix. However, many of the values are extremely low, and so we include results thresholding the matrix, with and without elements that are zero for all subjects. Highest ICC scores for each atlas are bolded.
Table 2. This table shows the means over all subjects of three measures of parcellation “goodness”. The retest versions are the mean of the measure using the parcellation’s regional connectivity matrix (or the count matrix) from one scan, and the estimated intensity function from the other scan.

Table 2 suggests that all three measures, while clearly different, are consistent in their selection at least with respect to these three parcellations. It is somewhat surprising that the Destrieux atlas has quite low likelihood criteria, but this may be due to the (quadratically) larger number of region pairs. Both likelihood based retest statistics also choose the DK parcellation, while ISE chooses the Destrieux parcellation by a small margin. It should be noted that these results must be conditioned on the use of a probabilistic CSD tractography model. Different models may lead to different intensity functions and resulting matrices. The biases and merits the different models and methods (e.g. gray matter dilation for fiber counting vs streamline projection) remain important open questions (Figs 1 and 2).

Fig. 1.
figure 1

A visualization of the ICC scores for connectivity to Brodmann Area 45 (Destrieux region 14) for the Count connectomes (left) and the proposed Integrated Intensity connectomes (right). denotes a higher score.

Fig. 2.
figure 2

A visualization of the marginal connectivity \(M(x) = \int _{E_i} \hat{\lambda }(x,y) dy\) for the Left Post-central Gyrus region of the DK atlas (Region 57). The region is shown in on the inset. denotes higher connectivity regions with the region.

4 Conclusion

We have presented a general framework for structural brain connectivity. This framework provides a representation for cortical connectivity that is independent of the choice of regions, and thus may be used to compare the accuracy of a given set of regions’ connectivity matrix. We provide one possible estimation method for this representation, leveraging spherical harmonics for fast parameter estimation. We have demonstrated this framework’s viability, as well as provided a preliminary comparison of regions using several measures of accuracy.

The results presented here lead us to conjecture that our connectome estimates are more reliable compared to standard fiber counting, though we stress that a much larger study is required for strong conclusions to be made. Further adaptations of our method are possible, such as using FA-weighted fiber counting. Our future work will explore these options, conduct tests on larger datasets, and investigate the relative differences between tracking methods and parcellations more rigorously.