A Multi-task Network to Detect Junctions in Retinal Vasculature

Uslu, Fatmatülzehra; Bharath, Anil Anthony

doi:10.1007/978-3-030-00934-2_11

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11071))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

14k Accesses
10 Citations

Abstract

Junctions in the retinal vasculature are key points to be able to extract its topology, but they vary in appearance, depending on vessel density, width and branching/crossing angles. The complexity of junction patterns is usually accompanied by a scarcity of labels, which discourages the usage of very deep networks for their detection. We propose a multi-task network, generating labels for vessel interior, centerline, edges and junction patterns, to provide additional information to facilitate junction detection. After the initial detection of potential junctions in junction-selective probability maps, candidate locations are re-examined in centerline probability maps to verify if they connect at least 3 branches. The experiments on the DRIVE and IOSTAR showed that our method outperformed a recent study in which a popular deep network was trained as a classifier to find junctions. Moreover, the proposed approach is applicable to unseen datasets with the same degree of success, after training it only once.

You have full access to this open access chapter, Download conference paper PDF

Detection and classification of multi-scale retinal junctions using region-based CNN

Article 19 July 2021

W–net: A Convolutional Neural Network for Retinal Vessel Segmentation

RVSeg-Net: An Efficient Feature Pyramid Cascade Network for Retinal Vessel Segmentation

Keywords

1 Introduction

The variations on geometry and topology of retinal vasculature can give information about the health of the eye. Junctions in the retinal vasculature represent important landmarks to extract retinal vessel topology, and can also facilitate image registration. The appearance of junctions in fundus images can vary markedly depending on branching angles, vessel widths and junction sizes, somewhat complicating their detection.

Several previous studies have made use of algorithms designed to work on binary vessel maps, usually on vessel skeletons [3, 4, 6, 10]. Generally, these methods relied on examining the intersection numbers in the skeletons [4] accompanied with vessel widths, lengths and branching angles [6]. However, the skeletons can misrepresent vessel topology due to either/both gaps in vessels in segmented vessel maps or, missing or false vessels in these maps.

Recently, deep learning methods have been frequently used for medical image analysis. However, they have been rarely applied to junction detection in retinal vasculature. As far as we are aware, the only study is Pratt et al.’s [10], where a popular network was trained as a classifier to identify image patches with junctions. The image patches contained vessel skeletons, generated from binary vessel maps. The major factor restricting the application of deep architectures on this task seems the shortage of training data. The number of publicly accessible image datasets is only two [1], as far as we are aware. Also, the task is extremely skewed, where the fraction of junction pixels in a fundus image is lower than $10^{-4}$ [1]. These factors – and primarily the imbalanced nature of the junction presence – complicate the training of deep networks containing a large number of parameters.

In this study, we introduce a multi-task network for junction detection on fundus images, which can deal with the shortage of labeled data and highly skewed nature of the task. We also present a junction probability map, which significantly facilitates finding junctions by removing possible variations in vessel thickness. Experiments on the DRIVE and IOSTAR show our method outperformed Pratt et al.’s method [10], despite operating directly on fundus images.

2 Method

Our method has three stages: (i) training our multi-task network, (ii) initial junction search and (iii) refined junction search.

2.1 Learning Junction Patterns with Multi-task Network

The amount of labeled data for junction locations in retinal vasculature is not sufficient to successfully train a deep network with many parameters. In contrast, vessel segmentation has been realized by deep networks in increasing numbers of studies [12]. In order to deal with the shortage of labeled data for the junction detection, we propose a multi-task network, which simultaneously generates label patches for vessel interior, centerline, edge locations and junction patterns.

The proposed network for this task is a fully connected network, which is initialized with weights learned by training a stack of Restricted Boltzmann Machines (RBMs) [8] on five types of image patches: fundus image patches, label patches for vessel interior, centerline, edges and junction patterns. Label patches can be combined, as suggested by Li et al. [9], to generate a likelihood map for the whole fundus image (see Fig. 1). After initialization, the network is trained patch-wise with the $l_{2}$ loss function in Eq. (1):

$$\begin{aligned} \mathcal {L}=||\mathcal {V}_{i}^{p}-\tilde{\mathcal {V}}_{i}^{p}||_{2} + ||\mathcal {V}_{c}^{p}-\tilde{\mathcal {V}}_{c}^{p}||_{2}+||\mathcal {V}_{e}^{p}-\tilde{\mathcal {V}}_{e}^{p}||_{2}+||\mathcal {V}_{j}^{p}-\tilde{\mathcal {V}}_{j}^{p}||_{2} \end{aligned}$$

(1)

where $\tilde{\mathcal {V}}_{i}^{p}$, $\tilde{\mathcal {V}}_{c}^{p}$, $\tilde{\mathcal {V}}_{e}^{p}$ and $\tilde{\mathcal {V}}_{j}^{p}$ respectively represent estimates of ground truth label patches for vessel interior ($\mathcal {V}_{i}^{p}$), centerline ($\mathcal {V}_{c}^{p}$), edges ($\mathcal {V}_{e}^{p}$) and junction patterns ($\mathcal {V}_{j}^{p}$) by the network.

One key advantage of this network is that – despite the ability to perform more than one task – it has relatively fewer parameters, slightly over one million, than those of many state-of-the-art networks. For example, Res16 [7] has over eleven million parameters, which was used by Pratt et al. to classify image patches with junctions [10]. Also, pretraining can provide a good initialization for the network parameters and facilitate training of the network in the face of the scarcity of labeled data.

2.2 Initial Junction Search

The junction probability maps produced by our method, shown in Figs. 2(b) and (e), demonstrate significantly larger probabilities in the presence of junctions, but can also yield occasional weak responses to the centrelines of larger vessels. The center regions of the junction patterns on the maps appear to be blob-like and can be detected by eigen-analysis on Hessian decomposition [5] of junction probability maps. Bright blob-like structures have negative eigenvalues with similar magnitudes; $| \overrightarrow{E_{1}} | \simeq | \overrightarrow{E_{2}}|$ [5].

Because the magnitude of junction probabilities can vary depending on the network’s recognition ability for these structures, we normalize eigenvalues locally in the range of [0, 1] to make the detection of junction centers more immune to variations on junction probabilities. Simply multiplying normalized eigenvalues for each pixel is observed to generate greater values for junctions than those for vessel centerlines, as demonstrated in Figs. 2(c) and (f). This map is thresholded to locate junctions, but yields some false positives. The following section will describe a method to eliminate these false positives.

2.3 Refined Junction Search

We count branch numbers on our vessel centerline probability maps, demonstrated in Figs. 2(a) and (d), to remove false positives obtained in the initial search. In contrast to previous studies counting intersection numbers in vessel skeletons to identify junction locations [4], we use centerline probability maps to avoid mistakes, which can occur due to further processing of segmented vessel maps. For example, skeletonization may lead to false branches [4].

We locate four circles centered at potential junctions obtained in the previous step, and decide on branch numbers by calculating the most repetitive and largest intersection number over the circles (see Fig. 3). The reason for using multiple circles is to reduce the possibility of detecting false branches in the centerline probability maps.

3 Experimental Setup and Material

Parameter Settings: The proposed network consists of the input layer with 256 units, 5 hidden layers with 400 units in each layer and the output layer with 1024 units. The size of an input image patch and that of a label patch are 16 by 16 pixels.

The initialization of network weights in pretraining was made by sampling from a normal distribution $\mathcal {N}(0,0.001)$. After training the network for 50 epochs with a learning rate of 0.005, a momentum of 0.5 for the first 5 epochs and that of 0.9 for the rest of pretraining, as recommended in [8], we finetuned the weights for 120 epochs with a learning rate of 0.08.

The parameters for junction detection (initial and refined search) were as follows. Radii of circles were 3 to 6 pixels. Threshold for junction probability map was 0.3 and that for centerline probability maps to calculate branch number was 0.1.

Performance Evaluation: We used precision, recall and $F_{1}$ score for performance evaluation, similar to [1, 3, 14], and accepted junctions estimated in the distance of 5 pixels to actual junctions true. These metrics are calculated with the following definitions over the entire set of junctions in a dataset: $Recall= \frac{TP}{TP+FN}$, $Precision= \frac{TP}{TP+FP}$ and $F_{1} score= \frac{2\cdot Recall \cdot Precision}{Recall+ Precision}$, where TP, FN and FP respectively denote the number of correctly labeled junctions, that of missed junctions and that of mistakenly labeled junctions.

Material: We evaluated the performance of our method on two fundus image datasets, which are captured by different image modalities. The DRIVE [13] is the most popular dataset for the evaluation of retinal vessel segmentation methods, which contains 40 images with a resolution of $768 \times 584$ pixels, acquired with a nonmydriatic CCD camera. The IOSTAR [1] consists of 24 scanning laser ophthalmoscope (SLO) images. Both datasets have ground truth vessel maps and junction locations [1]^{Footnote 1}, and are publicly available. The number of junctions per image, including both bifurcations and crossings, is roughly 130 for the DRIVE and 78 for IOSTAR.

Training Data Preparation: We obtained ground truth labels for, respectively, vessel centerline and edges by applying a simple thinning and edge detection method to ground truth vessel maps. Later, we generated junction pattern labels by using ground truth centerline maps: we firstly located masks of $5 \times 5$ pixels centered at ground truth junction locations then removed centerline labels outside these masks. The training dataset contains the last 20 images in the DRIVE and test set includes the first 20 images, which are separated by authors in [13]. We prepared 18, 000 mini-batches, each contains 100 training samples. Because the network was only trained with the DRIVE dataset, there was no need for preparing a training set for IOSTAR.

4 Results

We evaluated the performance of the proposed method on the DRIVE test set and the complete set of IOSTAR. Similar to Pratt et al.’s training [10], we trained our network with only the DRIVE training set. The reason for this is that the number of ground truth junctions per image in IOSTAR is almost half that given for the DRIVE dataset. We also assessed the performance of our method on ground truth vessel maps, to make a fair performance comparison with previous methods that utilize these maps. Table 1 presents our findings and compares them with those of previously reported methods.

Table 1. The performance comparison for junction detection.

Full size table

According to the table, recall rates at the initial search for both datasets are over 0.84 regardless of image type (e.g. ground truth or fundus image), which indicates that junction probability maps have a high potential for the detection of almost $90\%$ of junctions in the ground truth data. However, precision at this stage is low because of the large number of false positives generated with the eigen-analysis. The refined search eliminates many of these false positives, with precision values increasing from 0.26 up to 0.74 (in the IOSTAR dataset) when ground truth images are input, and from 0.29 up to 0.65 (in the DRIVE dataset) when fundus images are input. Due to the elimination of some true positives at this stage, recall rates can drop for both image types; however, this is observed to be less recognizable when ground truth images are input.

Considering input image types, the performance of the network is found to be better at ground truth images, particularly for IOSTAR with a 0.19 increase on $F_{1}$ score for fundus images. However, we did not observe significant performance difference between the DRIVE and IOSTAR cases, which indicates the generalization ability of our method for unseen datasets even if they are captured by different imaging modalities.

Regarding the usage of a deep network, the most relevant study to ours is Pratt et al.’s [10], where a deep network was trained with vessel skeletons produced from ground truth vessel maps as a classifier to identify image patches carrying any types of junctions; then, local maxima in selected patches were labelled as junction locations. Although their tolerance distance to actual junctions, 10 pixels, is two times larger than ours, our recall and $F_{1}$ score on the DRIVE and all the three metrics on IOSTAR are significantly larger than their findings. Moreover, the performance we obtained on fundus images for the same metrics, particularly for recall, was found to be still better than their performance on both datasets.

BICROS [1] combined two approaches: one relying on orientation scores obtained from fundus images and the other finding branches on skeletons generated from segmented vessel maps. Because of using different segmentation methods, their performance varies depending on suitability of segmentation methods to imaging modality [1]. However, the approach we propose yielded better recall rates than theirs, regardless of imaging modalities and input image types.

Precision rates reported by Pratt et al. [10] and obtained with BICROS [1] appear to be larger, particularly for the DRIVE, than ours. However, it should be noted that our method is not designed to differentiate crossings from bifurcations, and errors can occur if crossing vessels have large width [4]. This situation can be observed inside the red dashed line rectangle in Fig. 4(a), where a crossing is represented with two joints. Also, some unlabeled junctions in ground truth data can reduce precision. False positives inside the blue dotted line rectangle in Fig. 4(a) seem to be good candidates for junctions. On the other hand, our method can fail to detect a few junctions if they are not represented with sufficient probabilities in centerline probability maps. The green solid line square in Fig. 4(b) shows a junction missed by our method; but it is not easily seen by the naked eye.

5 Conclusion

Although deep networks have recently become the method of choice for medical image segmentation, their application is limited to those with large amounts of labeled data for the desired task. In order to deal with the scarcity of labeled data for junction detection in fundus images as a prior step to bifurcation and crossing classification, we propose a multi-task deep network. This network can produce probability maps of vessel interior, centerline, edges and junctions. Because of learning various descriptions of retinal vasculature, the network was found to confidently indicate junction locations. Potential junction locations suggested by the junction probability maps were reassessed by simply counting branch numbers on centerline probability maps.

We evaluated the performance of our network on the DRIVE test set and IOSTAR after training the network with the DRIVE training set once. We found that the proposed approach outperformed previous approaches for the junction detection task. Moreover, the performance of the proposed approach appeared to be better than that of a similar method due to Pratt et al. [10], which used a popular deep network to identify junctions. Our findings suggest that the proposed method can be used for unseen retinal datasets, even if they have slightly different characteristics, without retraining the network.

Notes

1.
The ground truth data is available at http://retinacheck.org/datasets.

References

Abbasi-Sureshjani, S., Smit-Ockeloen, I., Bekkers, E., Dashtbozorg, B., ter Haar Romeny, B.: Automatic detection of vascular bifurcations and crossings in retinal images using orientation scores. In: The IEEE International Symposium on Biomedical Imaging (ISBI), pp. 189–192. IEEE (2016)
Google Scholar
Abbasi-Sureshjani, S., Smit-Ockeloen, I., Zhang, J., Ter Haar Romeny, B.: Biologically-inspired supervised vasculature segmentation in SLO retinal fundus images. In: Kamel, M., Campilho, A. (eds.) ICIAR 2015. LNCS, vol. 9164, pp. 325–334. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-20801-5_35
Chapter Google Scholar
Azzopardi, G., Petkov, N.: Automatic detection of vascular bifurcations in segmented retinal images using trainable COSFIRE filters. Pattern Recogn. Lett. 34(8), 922–933 (2013)
Article Google Scholar
Calvo, D., Ortega, M., Penedo, M.G., Rouco, J.: Automatic detection and characterisation of retinal vessel tree bifurcations and crossovers in eye fundus images. Comput. Methods Programs Biomed. 103(1), 28–38 (2011)
Article Google Scholar
Frangi, A.F., Niessen, W.J., Vincken, K.L., Viergever, M.A.: Multiscale vessel enhancement filtering. In: Wells, W.M., Colchester, A., Delp, S. (eds.) MICCAI 1998. LNCS, vol. 1496, pp. 130–137. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0056195
Chapter Google Scholar
Hamad, H., Tegolo, D., Valenti, C.: Automatic detection and classification of retinal vascular landmarks. Image Anal. Stereol. 33(3), 189–200 (2014)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Hinton, G.E.: A practical guide to training restricted Boltzmann machines. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) Neural Networks: Tricks of the Trade. LNCS, vol. 7700, pp. 599–619. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35289-8_32
Chapter Google Scholar
Li, Q., Feng, B., Xie, L., Liang, P., Zhang, H., Wang, T.: A cross-modality learning approach for vessel segmentation in retinal images. IEEE Trans. Med. Imaging 35(1), 109–118 (2016)
Article Google Scholar
Pratt, H., et al.: Automatic detection and distinction of retinal vessel bifurcations and crossings in colour fundus photography. J. Imaging 4(1), 4 (2017)
Article Google Scholar
Soares, J.V., Leandro, J.J., Cesar, R.M., Jelinek, H.F., Cree, M.J.: Retinal vessel segmentation using the 2-D Gabor wavelet and supervised classification. IEEE Trans. Med. Imaging 25(9), 1214–1222 (2006)
Article Google Scholar
Srinidhi, C.L., Aparna, P., Rajan, J.: Recent advancements in retinal vessel segmentation. J. Med. Syst. 41(4), 70 (2017)
Article Google Scholar
Staal, J., Abràmoff, M.D., Niemeijer, M., Viergever, M.A., Van Ginneken, B.: Ridge-based vessel segmentation in color images of the retina. IEEE Trans. Med. Imaging 23(4), 501–509 (2004)
Article Google Scholar
Zhang, H., Yang, Y., Shen, H.: Line junction detection without prior-delineation of curvilinear structure in biomedical images. IEEE Access 6, 2016–2027 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

BICV Group, Bioengineering Department, Imperial College London, London, UK
Fatmatülzehra Uslu & Anil Anthony Bharath

Authors

Fatmatülzehra Uslu
View author publications
You can also search for this author in PubMed Google Scholar
Anil Anthony Bharath
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fatmatülzehra Uslu .

Editor information

Editors and Affiliations

University of Leeds, Leeds, UK
Alejandro F. Frangi
King’s College London, London, UK
Julia A. Schnabel
University of Pennsylvania, Philadelphia, PA, USA
Christos Davatzikos
Universidad de Valladolid, Valladolid, Spain
Carlos Alberola-López
Queen’s University, Kingston, ON, Canada
Gabor Fichtinger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Uslu, F., Bharath, A.A. (2018). A Multi-task Network to Detect Junctions in Retinal Vasculature. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. MICCAI 2018. Lecture Notes in Computer Science(), vol 11071. Springer, Cham. https://doi.org/10.1007/978-3-030-00934-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-00934-2_11
Published: 26 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00933-5
Online ISBN: 978-3-030-00934-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics