1 Introduction

Automatic anatomical labeling approaches for tubular-like structures has been investigated for a couple of decades. Methods exist from 2D atlas registration [6], 3D branch matching [8], to the maximum likelihood estimation [7], etc. These methods are applicable to tree-like structures with short and straight branches, such as the airways and abdominal aorta. However, they are not feasible for structures with large variations in geometry or/and topology. For example, the Circle of Willis (CoW) is genus 1 in terms of topology, with total of 83 variations [4]. Majority of human have one or more missing arteries in their CoW. Additionally, vessels can be twisted and intertwined, which create complex geometries with large range in branch length, curvature, torsion and radius. These rise difficulties for atlas based labeling approaches [1]. Even for the same person, his/her vessel structure and morphology change over time. All these make the automatic CoW labeling a challenging problem.

In [2], a maximum a posteriori (MAP) classification is firstly used to identify five branches. It then combines the MAP with a graph matching method to label the CoW in the form of three separate trees. The MAP inferences in [9, 10] are formulated as a quadratic binary programming problem. This formulation can handle non tree-like vasculature with high efficiency. It simultaneously segments and labels CoW with an integer programming. However, their methods do not differentiate symmetrically located bifurcations.

This paper describes a novel and generic approach to anatomically label vascular structures. Our method first extracts bifurcations of interest from the centerlines of vessel structures, where a set of geometric features are calculated. Then the probability distribution of these bifurcations is generated using an XGBoost classifier. Finally a Hidden Markov Model (HMM) with a restricted transition strategy is constructed to find the most likely labeling configuration of the whole structure, while constraining all branches or bifurcations with a feasible topology. Our main contribution is threefold: (1) Coordinate independent geometrical features are used to describe the bifurcations, which alleviate the global alignment that is required in other approaches. (2) XGBoost method is used to learn the probability distribution of bifurcations of interest as well as those not of interest. (3) The topology identification is considered as a matching problem which is solved by an HMM.

2 Method

Problem Formulation. Each vascular structure is modeled as a centerline graph together with a set of radii corresponding to all centerline points. Based on this initial setting, although many features like centerline length, curvature, etc., could be calculated, these features alone are not sufficient to classify each anatomical structure. We propose to classify bifurcation points based on extracted geometrical features associated with them. That is, for each bifurcation point, we calculate features from the three branches, which are uniquely identified as the Head, Left and Right, connected to the bifurcation as well as a few combined features derived from them. Each vascular structure is modeled as an undirected weighted graph \(G=(V,E,A)\), where a sets of vertex \(V = \left\{ v_1,\ldots , v_n \right\} \) denote the bifurcation points of the structure, \(E\subset V\times V\) is a set of edges representing vessel connections with these bifurcations, and \(A:V \mapsto \mathbb {R}^f\) are features associated with V. The bifurcation \(c = \left\{ V_c, E_{c\_head}, E_{c\_left}, E_{c\_right}, A_c\right\} , c=1,\ldots ,f\), is generated by G. The vessel labeling problem can be considered as a bifurcation mapping problem, based on an available graph set \(\{\hat{G}\}\) as well as known bifurcation set \(\{\hat{C}\}\). All these form a knowledge base \(\mathcal {K}\). For a given target graph \(G^t\), the target labeling is defined as \(L: C^t \mapsto \{\hat{C}\}\,\cup \,\{\varnothing \}\), where \(C^t = \left\{ c_i^t: i = 1,\ldots ,m\right\} \)represents bifurcation sets on the \(G^t\), and the label \(\varnothing \) represents certain bifurcation point that is not of interest. Obviously, a surjective mapping to \(\varnothing \) is allowed. As for the remaining subset of \(C^t\), the inclusion map \(C^t \mapsto \{\hat{C}\}\) is injective but not necessarily surjective. It is possible that multiple solutions that fulfill these conditions exist. Let the joint probability distribution of every possible solution be \(P(C^t, L|\mathcal {K})\). We aim at finding the label \(L^*\) with the maximum a posteriori (MAP) probability:

$$\begin{aligned} L^{*}=\arg \max _{L\in \mathcal {L}} {P(C^t, L|\mathcal {K})}, \end{aligned}$$
(1)

where \(\mathcal {L}\) denotes the labeling solution space.

The Proposed Model. By observing the vascular anatomy, it seems that anatomical bifurcations can only appear in a certain order, starting from a reference point, e.g. internal carotid artery and vertebral basilar artery in the CoW. However, there always exists anatomical variabilities. For example, CoW exhibits large variability in its topology [2], including the absence of bifurcation points of interests. This also means that the absence of one bifurcation can imply that those further downstream cannot be presented either. Despite of those variations, it is still helpful to impose topology constraints during the inference.

We propose to formulate the labeling process as an HMM with a restricted transition strategy. In our model, a bifurcation’s label is assumed to depend only on those of its immediate neighbors while independent on the rest. Therefore, the bifurcation points with feature sets can be considered as a sequence of observations. Their labels are considered as a state sequence with state transition probabilities as topology constraints from the prior training population. Feature set \(A^t\) generated by \(G^t\) gives information about the sequence of states. Let \(S = \left\{ s_i\right\} ,s_i \,\in \, \left\{ l_1,\ldots ,l_k\right\} \,\cup \,\{\varnothing \}\) be the state space, containing labels of different anatomical bifurcation points and a null label \(\varnothing \) for those not of interest. Let \(O = \left\{ o_i\right\} ,o_i \,\in \,\left\{ c_1,\ldots ,c_m\right\} \) be the observation space of all anatomically possible bifurcations. For a given observed sequence of length L, \(Y = \left\{ y_1,\ldots ,y_L\right\} \), \(y_i \,\in \,O\), we can extract its corresponding state sequence \(X = \left\{ x_1,\ldots ,x_L\right\} , x_i \,\in \,S\) from \(\{\hat{G}\}\). Then, we can define the state transition probability matrix Q and the output probability matrix B with dimensions equal to that of the state space:

$$\begin{aligned}&Q = \begin{bmatrix} q_{ij}\end{bmatrix}_{k*k}, q_{ij} = P(x_{t+1} = s_j|x_t = s_i)\nonumber \\&B = \begin{bmatrix} b_{ij }\end{bmatrix}_{k*m}, b_{ij} = P(y_{t} = o_j|x_t = s_i) \end{aligned}$$
(2)

The initial probabilities on the state space is defined as \(\varPi = \left\{ \pi _i\right\} _k,\pi _{i} = P(x_{1} = s_i)\). Following the Markov property, we have:

$$\begin{aligned}&P(x_t|x_1,x_2,\ldots ,x_{t-1}) = P(x_t|x_{t-1}), \text { and} \nonumber \\&P(y_t|y_1,y_2,\ldots ,y_{t-1},x_1,x_2,\ldots ,x_t) = P(y_t|x_t). \end{aligned}$$
(3)

The problem in Eq. 1 can be formulated as: \(X^{*}=\arg \max _{X} {P(Y, X)}\), which can be solved by the Viterbi algorithm [5]. Let \(V_{i,k}\) be the maximum probability of a sequence with length i and end state \(s_k\). With dynamic programming, we have:

$$\begin{aligned} V_{1,k}&= P(y_1|s_k)\cdot \pi _k \nonumber \\ V_{i,k}&= P(y_i|s_k)\cdot \max _{x \in [1,L]}(q_{x,k} \cdot V_{i-1,x}). \end{aligned}$$
(4)

The solution can be found: \(x_L = \arg \max _{x \in [1,L]}V_{L,x}\). The critical task is to convert the graph \({G^t}\) into a chain, and to find the optimal implicit state sequence Y by the observed sequence X and model. It remains to calculate Q and B (Eq. 2).

Note that the topology of graph \({G^t}\) is always different, due to the fact that, some branches may be missing in certain cases, and some branches may not be what we are interested in. Such heterogeneity could be caused by either the patient anatomy or the process of segmentation and skeletonization. Bifurcations are “sorted” by the angles between each vector from E (Fig. 1(a)) to a bifurcation and a specific reference vector. For example, assuming that the CoW is complete, the order can normally be determined by traversing. For seemingly “multiple-tree” structures that are split by the gap of some missing blood vessels, and if a bifurcation is not connected with the others, the next one in the clock-wise order can be considered as the next bifurcation. With this rule, all bifurcations can be virtually connected even with missing branches. Additionally in this way, even if symmetric bifurcations have similar geometric features, they have different position in the chain. For any given sequence Y, except the \(\varnothing \), the probability of each elements \(s_i\) in state space appear in a particular location is different. The vascular structure of interest can be regarded as a Markov chain, including the CoW where multiple trees maybe found instead of a “circle”.

Then for the prediction step, we construct the chain model of the output/observed sequence, \(P(x_t = s_i)\) can be calculated from the occurrence rate of \(s_i\) in the training set, and \(P(x_t = s_i|y_{t} = o_j)\) can be obtained from a bifurcation label classifier trained on \(\hat{C}\).

Table 1. Geometrical features defined on vessel centerline models.

Bifurcation Label Classifier. In order to estimate the likelihood of a bifurcation \(o_j\) with label \(s_i\), \(P(s_i|o_j)\), we propose to use the XGBoost algorithm with probability estimation [3]. It is an improved gradient tree boosting based on classification and regression tree. This method is particularly suitable for our vascular datasets which spans in large space but also with larger sparsity. The connected three branches are uniquely identified, such that f features of each branch could be combined to 3f features for the corresponding bifurcation. Since the CoW ring lies approximated on a plane, the relative positions of the branches around any bifurcation are the same. Then we just consider a branch, whose one end is on the circle structure and the other not, as the Head one. With the bifurcation as the origin, from the Head branch anti-clockwise in the ring plane we can identify the children branches (Fig. 1(b)). Specifically, 27 coordinate invariant features are calculated such that the global alignment can be avoided (Table 1).

3 Experiments and Results

Data and Technical Details. Our proposed approach has been evaluated on the public dataset distributed by the MIDAS Data Server at Kitware Inc. It contains 50 MRA images of the cerebral vasculature from healthy volunteers together with theirs segmentations and centerlines [2], as well the ground truth manually indicted by an expert. We first prune the centerline model to a region around the CoW. This results in about 18 bifurcations in each case. In this step, we use features of Euclidean distance to three base points (i.e. where blood enters the whole CoW) and the smallest average radius of the bifurcation. Due to the heterogeneity of the different cases, according to the expert’s experience, we are mainly interested in 11 bifurcation points, which are labeled as A-K in Fig. 1. In the 27 features reported in Table 1, tor is removed by a Least Significant Difference metric, which results in 24 features are calculated. In the approximate plane of the CoW, the orthogonal vectors of tangent from the bifurcation point of the BA branch is considered as the reference vector. Branches are sorted by the angles between each vector from E to a bifurcation and the reference vector.

Fig. 1.
figure 1

(a) A sketch of CoW, together with the 11 labels (A-K) of the bifurcations of interest. (b) A sketch of the basis features (as in Table 1) of one bifurcation. Only the corresponding ones for one of the three branches are indicated, those of the other two can be similarly defined.

Evaluation. A leave-one-out cross-validation was performed to asses the performance of the proposed method. Exemplar labeled segmentation of a normal CoW (Fig. 2(a)) and a varied CoW (Fig. 2(c)) are shown, where branches of interest, i.e. those determined by the 11 bifurcations of interest, are color-coded, with grey-colored ones being not of interest. We could see that our approach is able to handle topological disturbances around the CoW (e.g. grey-colored small branches in the enlarged views).

Fig. 2.
figure 2

Color-coded example labeling results of: (a)(c) normal and (b)(d) varied CoW models (with grey-colored ones being not of interest).

Table 2. Evaluation comparison of the 11 bifurcations of interest (Fig. 1) with accuracy (A), precision (P), recall (R) and specificity (S) are reported for the proposed method with (Ours) and without (Ours w/o) the topological constraints, as well as for the best performing state-of-the-art methods [2, 10].

Table 2 shows the accuracy, precision, recall and specificity for each bifurcation of interest separately, using our method with and without the topological constraints. They are compared with two best performing methods [2, 10]. In general, our method offers higher accuracy and specificity, while those by [2] offers higher recall. These suggests that our method also tries to label both bifurcations of and not of interest correctly. This is probably more suitable when the data is more generic and not trimmed only for those of interests. But for datasets that contain only relevant branches, the method by [2] could perform better. With these said, for all of the bifurcations, the differences between the methods are not significant, as have also been reported in [10]. However, note that the method by [10] does not distinguish symmetrical bifurcation points, which are one of the main error sources in the CoW classification problem. Thus, we do not intend to compare and interpret their numbers directly. The error source could be seen in the plotted confusion matrices shown in Fig. 3, where the “confused” symmetrically located bifurcation points could be seen, e.g. J and K, D and F. From the confusion matrices, we could see that the bifurcation points J and K are the most wrongly classified. In addition to confusing each other, it is due to the difference in the length of the MCA-1 vessel and the presence of other bifurcations. The errors of the bifurcation A, I, D and F are mainly caused by missing vessels, e.g. AcoA and PCoA. More interestingly, due to the obvious features of ICA, the bifurcation C and G are hardly affected by the reason above.

Fig. 3.
figure 3

The confusion matrices with (left), without (right) topological constraints.

4 Conclusions

We have proposed a supervised learning method for anatomical labeling of vascular branches. It learns the probability distribution of every anatomical branch bifurcation with a XGBoost classifier, which is then used as inputs to a Hidden Markov Model so as to enforcing the topology consistency. Our method is able to solve a labeling problems of trees, loops and chains. Furthermore, it has demonstrated that it can handle a large anatomical variability such as those present in the topology of the CoW, and is able to map labels to cases containing different subsets of bifurcation points of interest. The leave-one-out cross validation performed on 50 cases has shown higher accuracy that the best performing state-of-the-art methods. Additionally, our method uses coordinate independent features, reducing the need for a global rigid registration step before the labeling process. Pending issues such as validations on larger population, on structures with different topologies, the parallelization of the bifurcation classifier and the optimization of topological constrained model fall into our future work.