Automated Diagnosis of Neural Foraminal Stenosis Using Synchronized Superpixels Representation

He, Xiaoxu; Yin, Yilong; Sharma, Manas; Brahm, Gary; Mercado, Ashley; Li, Shuo

doi:10.1007/978-3-319-46723-8_39

Xiaoxu He^18,19,
Yilong Yin²⁰,
Manas Sharma^18,19,
Gary Brahm^18,19,
Ashley Mercado^18,19 &
…
Shuo Li^18,19

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9901))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

13k Accesses
2 Citations

Abstract

Neural foramina stenosis (NFS), as a common spine disease, affects $80\,\%$ of people. Clinical diagnosis by physicians’ manual segmentation is inefficient and laborious. Automated diagnosis is highly desirable but faces the class overlapping problem derived from the diverse shape and size. In this paper, a fully automated diagnosis approach is proposed for NFS. It is based on a newly proposed synchronized superpixels representation (SSR) model where a highly discriminative feature space is obtained for accurately and easily classifying neural foramina into normal and stenosed classes. To achieve it, class labels (0:normal,1:stenosed) are integrated to guide manifold alignment which correlates images from the same class, so that intra-class difference is reduced and the inter-class margin are maximized. The overall result reaches a high accuracy ($98.52\,\%$) in 110 mid-sagittal MR spine images collected from 110 subjects. Hence, with our approach, an efficient and accurate clinical tool is provided to greatly reduce the burden of physicians and ensure the timely treatment of NFS.

You have full access to this open access chapter, Download conference paper PDF

A neural network model for detection and classification of lumbar spinal stenosis on MRI

Article Open access 27 December 2023

A Supervised Approach Towards Segmentation of Clinical MRI for Automatic Lumbar Diagnosis

A Modified BiSeNet for Spinal Segmentation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Neural foramina stenosis (NFS) is known as a common result of disc degeneration due to age. For example, about $80\,\%$ of people suffer lower back pain caused by NFS [1, 2]. Existing clinical diagnosis by physicians’ manual segmentation is very inefficient and tedious. Automated diagnosis, which predicts class label (0:normal,1:stenosed) for a given neural foramina image, is highly desirable.

However, automated diagnosis is still challenging due to the difficulty in extracting very discriminative feature representation from extremely diverse neural foramina images [1]. This diversity leads to severe inter-class overlapping problem when classifying neural foramina images into normal or stenosed class(see Fig. 1(a)). Class overlapping problem is regarded as one of the toughest pervasive problems in classification [3–6], and severely affects the diagnosis accuracy of neural foramina. To solve it, a discriminant feature space maximizing the inter-class margin between normal and stenosed class is needed.

In this paper, a fully automated and reliable diagnosis framework is proposed for NFS. For reliable classification, it construct a new discriminative feature space (as shown in Fig. 1(d)) using a new synchronized superpixels representation (SSR) model (as shown in Fig. 1(b) and (c)). SSR model integrates class label into manifold approximation and alignment to obtain the joint decomposing, synchronizing, and clustering the spectral representation for neural foramina images from the same class. The obtained normal SSR and stenosed SSR are new superpixel representation synchronized for image from the same class. As the synchronization of SSR is merely performed for images from the same class so that images from different classes have unsynchronized superpixel representations which enlarge the inter-class difference. Hence, the constructed SSR space is highly discriminative due to the enlarged inter-class margin and reduced intra-class margin (as shown in Fig. 1(d)). With this discriminative space, any classifier, even the simple knn, could achieve superior performance in automated diagnosis of NFS. With our diagnosis framework, an automated and accurate clinical diagnosis tool is provided for NFS.

2 Spectral Graph, Spectral Bases, and Superpixels

There are three key concepts used in our framework:

Spectral graph $\mathcal {G} = (\mathcal {V},\mathcal {E})$ is a graph structure for the pairwise similarities among all pixels within an image [7, 8]. For an image I with total N pixels, we construct $\mathcal {G} = (\mathcal {V},\mathcal {E})$ where $\mathcal {V}(N = |\mathcal {V}|)$ is the pixel set and each edge $e \in \mathcal {E}$ connects two arbitrary pixels i, j in the image. Each e for i, j is weighted by W(i, j) determined by intensity, spatial location, and the contour interventions between two pixels:

$$\begin{aligned} W(i,j)=exp(-||x_i-x_j||^2/\delta _x-||I_i-I_j||^2/\delta _I-\underset{x \in line(i,j)}{\text {max}}||Edge(x)||^2/\delta _E) \end{aligned}$$

(1)

where $x_i, x_j$ are the location of the pixels i, j and the $I_i, I_j$ are their intensities respectively. Edge(x) represents an edge detector (i.e., Canny detector) in location x. $\delta _x, \delta _I, \delta _E$ are constants that will be assigned empirically. In practice, spectral matrix W will only be computed in k-nearest neighbors, thus W is a sparse matrix.

Spectral bases $U = [\xi _1(\mathcal {G}),..., \xi _N(\mathcal {G})]$ are the eigenvectors of spectral matrix W [8, 9]. In practice, they are decomposed from the graph Laplacian L instead of W:

$$\begin{aligned} L=Id-D^{-\frac{1}{2}}WD^{-\frac{1}{2}} \end{aligned}$$

(2)

where $Id=diag(1,1,...,1)$ is the identify matrix, D is the diagonal matrix whose elements are the row summations of W.

Superpixels are the clusters obtained from grouping images pixels based on spectral bases which approximate manifold of an image. They correspond to high level representation of an image, such as smooth and non-overlapping regions in the image.

3 Methodology

The overview of our diagnosis framework includes two phases (as shown in Fig. 2): (1) in training (Sect. 3.1), SSR space is constructed by label-supervised synchronization of spectral bases’ decomposition and clustering; (2) in testing (Sect. 3.2), the class label of an unlabeled localized neural foramina is predicted by searching its nearest neighbors in SSR space.

3.1 Training Phase

Given training set $\{\mathcal {I},\mathcal {C}\}$ include $M_1$ normal neural foramina images $\mathcal {I}_{nor}=\{I_m|C_m=0, I_m\in \mathcal {I}, C_m\in \mathcal {C}, m=1,...,M_1\}$, $M_2$ stenosed neural foramina images $\mathcal {I}_{ste}=\{I_m|C_m=1, I_m\in \mathcal {I}, C_m\in \mathcal {C}, m=1,...,M_2\}$, and the corresponding Laplacians set $\mathcal {L}=\{L_m, m=1,...,M\}$, where $M=M_1+M_2$, the construction of SSR space includes the following two steps:

Spectral Bases Synchronization: Synchronized spectral bases for normal and stenosed images are simultaneously obtained by the integration of class labels $\{C_m=C_l\}$ into Joint Laplacian Diagonalization with Fourier coupling [7]. For looking a set of synchronized bases $\{Y_i:Y_m^TY_m=Id\}_{m=1}^M$, $Y_m^TL_mY_m$ are approximately diagonal for $m=1,...,M$. To ensure that the bases from the same class behave consistently, the label-supervised coupling constraints [7] are introduced: given a vector $f^m$ on manifold of image $I_m$, and a corresponding vector $f^l$ on manifold of image $I_l$, if $C_m=C_l$, we require that their Fourier coefficients in the respective bases coincide, $Y_mf^m=Y_lf^l$. So, the label-supervised coupled diagonalization problem can be rewritten as

$$\begin{aligned} \underset{Y_1,...,Y_M}{\text {min}}\sum _{m\in I}||Y_m^TL_mY_m-\varLambda _m||_F^2+\mu \sum _{m,l\in \mathcal {I}, C_m=C_l}||F_m^TY_m-F_l^TY_l||_F^2 \end{aligned}$$

(3)

where $\varLambda _m=diag(\lambda _1,...,\lambda _K)$ denotes the diagonal matrix containing the first smallest eigenvalues of $L_m$. F is an arbitrary feature mapping that maps a spectral map to a fixed dimension feature vector. The optimal results $Y_1^*,...,Y_M^*$ can be classified as normal and stenosed synchronized spectral bases according to their class labels.

In practice, to resolve the ambiguity of $Y_m$ and simplify the optimization, the first K vectors of the synchronized spectral bases are approximated as a linear combination of the first smallest $K'\ge K$ eigenvectors of $L_m$, denoted by $U_m = [\xi _1(\mathcal {G}_m),..., \xi _K(\mathcal {G}_m)]$. We parameterize the synchronized spectral base $Y_m$ as $Y_m=U_mA_m$, where $A_m$ is the $K'\times K$ matrix of linear combination coefficients. From the orthogonality of $Y_m$, it follows that $A_m^TA_m=Id$. Plugging this subspace parametrization into Eq. (3), where $\tilde{\varLambda _m}$ is the diagonal matrix containing the first $K'$ eigenvalues of $L_m$:

$$\begin{aligned} \begin{aligned}&\underset{A_1,...,A_M}{\text {min}}\quad \sum _{m\in \mathcal {I}}||A_m^T\varLambda _mA_m-\tilde{\varLambda _m}||_F^2+\mu \sum _{m,l\in \mathcal {I} , C_m=C_l}||F_m^TU_mA_m-F_l^TU_lA_l||_F^2\\&\text {s.t} \quad A_m^TA_m=Id, (m=1,...,M) \end{aligned} \end{aligned}$$

(4)

The solution of problem Eq. (4) can be carried out using standard constrained optimization techniques. As the label-supervised Coupled Diagonalization of Laplacians enables the approximation and alignment of manifolds for images from the same class, the obtained synchronized spectral bases approximate manifold of each image and align them for images from the same class.

Superpixels Synchronization: Normal SSR and stenosed SSR are respectively achieved by grouping all images pixels with the corresponding synchronized spectral bases. As the obtained spectral bases are automatically synchronized for images from the same class, so the obtained normal SSR and stenosed SSR simultaneously minimize the intra-class difference. Correspondingly, the unsynchronized spectral bases for images from different classes enables the obviously different superpixel representations to maximize the inter-class margin. Hence, the obtained SSR space provides a new discriminate ability for reliable diagnosis even using a simple classifier.

3.2 Testing Phase

In testing, unlabeled neural foramina $I_w$ is first localized by a trained SVM subwindow localization classifier implemented by method introduced in [10], then its class label is predicted by finding the nearest neighbors in SSR space.

Incremental Spectral Bases Synchronization: For unlabeled neural foramina $I_w$, incremental synchronization is proposed to obtain its mapping point $Y_w$ in SSR space:

$$\begin{aligned} cost_{ste}=\underset{Y_w}{\text {min}}\quad ||Y_w^TL_wY_w-\varLambda _w||_F^2+\mu \sum _{C_m=1}||F_m^TY_m-F_w^TY_w||_F^2 \end{aligned}$$

(5)

$$\begin{aligned} cost_{nor}=\underset{Y_w}{\text {min}}\quad ||Y_w^TL_wY_w-\varLambda _w||_F^2+\mu \sum _{C_m=0}||F_m^TY_m-F_w^TY_w||_F^2 \end{aligned}$$

(6)

where $L_w$ is the Laplacian matrix of $I_w$, $\{Y_m|C_m=1, m=1,...,M_2\}$ are the learned stenosed synchronized spectral bases, $\{Y_m|C_m=0, m=1,...,M_1\}$ are the learned normal synchronized spectral bases, $cost_{ste}$ and $cost_{nor}$ denote the mapping cost loss. Incremental spectral bases synchronization maps $I_w$ into SSR space.

Diagnosis: The class label of $I_w$ is predicted by comparing the computed cost loss $cost_{ste}$ and $cost_{nor}$. For example, if $cost_{nor}$ is smaller, its approximated manifold $Y_w$ is more similar to images from normal class and the mapping point of $I_w$ in SSR space is in normal class. Hence, the class label of $I_w$ is naturally predicted by the minimal mapping cost loss:

$$\begin{aligned} f(I_w)={\left\{ \begin{array}{ll} 1(stenosed), &{} \text {if }cost_{ste}<cost_{nor},\\ 0(normal), &{} \text {otherwise}. \end{array}\right. } \end{aligned}$$

(7)

Table 1. Performance of the proposed framework

Full size table

4 Experiments and Results

4.1 Experiment Setup

Following the clinical standard, our experiments are tested on 110 mid-sagittal MR lumbar spine images collected from 110 subjects including healthy cases and patients with NFS. These collected MR scans are scanned using a sagittal T1 weight MRI with repetition time (TR) of 533 ms and echo time (TE) of 17 ms under a magnetic field of 1.5 T. The training sets includes two types: (1) NF images and non-NF images used to train SVM-classifier localization; (2) normal NF images and stenosed NF images used to train SSR model. These training images were manually cropped and labeled by physician according to the clinical commonly diagnosis criterion [1]. The classification accuracy, specificity, and sensitivity are reported in the average from ten runs of leave-one-subject-out cross-validation.

Table 2. The highest accuracy is achieved by SSR in three typical classifier comparing other existing five popular features.

Full size table

4.2 Results

The higher accuracy achieved by the proposed framework both in localization ($99.27\,\%$) and classification ($98.52\,\%$) are shown in Table 1. Besides, its robustness in localizing and diagnosing neural foramina with different appearance, shape, and orientation is qualitatively displayed in Fig. 3. This high accuracy and robustness is derived from the intrinsic class separation captured by our framework. Hence, an accurate and efficient diagnosis of NFS is obtained regardless of the disturbance from appearance, shape, and orientation.

Table 2 demonstrates that SSR achieved highest accuracy (>95 %) than other five classical features (<82 %) in three typical classifiers: k nearest neighbors (KNN) [5], linear discriminant analysis (LDA) [4], and support vector machine (SVM) [3]. The superiority of SSR is from its constructed discriminative feature space where images from the same class are correlated by the synchronized superpixel representation while images from different classes are separated by the unsynchronized superpixel representation. This brings a good class separation for solving the challenging class overlapping problem in automated diagnosis of NFS (see Fig. 4), which leads to the low accuracy of the other five image feature methods. In addition, such discriminative ability is so powerful that it enables the simple classifier like KNN still achieve a higher accuracy. Hence, SSR provides a reliable diagnosis framework for NFS, and can replace conventional image representation methods to be fed into other state-of-the-art classifiers for improving their accuracy and learning performance.

5 Conclusions

In this paper, we propose a novel automated diagnosis for NFS, with a new SSR model to generate discriminative feature space for reliable diagnosis. With SSR space, the class overlapping problem is overcome, and the high diagnosis accuracy of the proposed framework is achieved even using a simple KNN classifier. Hence, an efficient and reliable diagnosis tool is obtained to reduce the workload of radiologists and provide timely treatment of NFS.

References

Lee, S., Lee, J.W., Yeom, J.S., Kim, K.-J., Kim, H.-J., Chung, S.K., Kang, H.S.: A practical mri grading system for lumbar foraminal stenosis. Am. J. Roentgenol. 194(4), 1095–1098 (2010)
Article Google Scholar
Rajaee, S.S., Bae, H.W., Kanim, L.E., Delamarter, R.B.: Spinal fusion in the united states: analysis of trends from 1998 to 2008. Spine 37(1), 67–76 (2012)
Article Google Scholar
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46(1–3), 389–422 (2002)
Article MATH Google Scholar
Ye, J., Janardan, R., Li, Q.: Two-dimensional linear discriminant analysis. In: Advances in Neural Information Processing Systems, pp. 1569–1576 (2004)
Google Scholar
Tan, S.: Neighbor-weighted k-nearest neighbor for unbalanced text corpus. Expert Syst. Appl. 28(4), 667–671 (2005)
Article Google Scholar
Tang, J., Alelyani, S., Liu, H.: Feature selection for classification: a review. In: Data Classification: Algorithms and Applications, p. 37 (2014)
Google Scholar
Eynard, D., Kovnatsky, A., Bronstein, M.M., Glashoff, K., Bronstein, A.M.: Multimodal manifold analysis by simultaneous diagonalization of laplacians. IEEE Trans. Pattern Anal. Mach. Intell. 37(12), 2505–2517 (2015)
Article Google Scholar
Cai, Y., Ali, I., Mousumi, B., Ian, C., Shuo, L.: Unsupervised freeview groupwise m3 cardiac images segmentation using synchronized spectral networks. In: MICCAI (2015)
Google Scholar
Wang, C., Mahadevan, S.: Manifold alignment preserving global geometry. In: IJCAI (2013)
Google Scholar
Nguyen, M.H., Torresani, L., de la Torre, F., Rother, C.: Weakly supervised discriminative localization and classification: a joint learning process. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 1925–1932. IEEE (2009)
Google Scholar

Download references

Acknowledgment

This work was supported by the NSFC Joint Fund with Guangdong under Key Project (Grant No. U1201258).

Author information

Authors and Affiliations

Digital Imaging Group (DIG), London, Ontario, Canada
Xiaoxu He, Manas Sharma, Gary Brahm, Ashley Mercado & Shuo Li
The University of Western Ontario, London, Ontario, Canada
Xiaoxu He, Manas Sharma, Gary Brahm, Ashley Mercado & Shuo Li
Shandong University, Jinan, Shandong, China
Yilong Yin

Authors

Xiaoxu He
View author publications
You can also search for this author in PubMed Google Scholar
Yilong Yin
View author publications
You can also search for this author in PubMed Google Scholar
Manas Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Gary Brahm
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Mercado
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoxu He .

Editor information

Editors and Affiliations

University College London , London, United Kingdom
Sebastien Ourselin
The Hebrew University of Jerusalem , Jerusalem, Israel
Leo Joskowicz
Harvard Medical School , Boston, Massachusetts, USA
Mert R. Sabuncu
Istanbul Technical University , Istanbul, Turkey
Gozde Unal
Harvard Medical School and Brigham and Women's Hospital, Boston, Massachusetts, USA
William Wells

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, X., Yin, Y., Sharma, M., Brahm, G., Mercado, A., Li, S. (2016). Automated Diagnosis of Neural Foraminal Stenosis Using Synchronized Superpixels Representation. In: Ourselin, S., Joskowicz, L., Sabuncu, M., Unal, G., Wells, W. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016. MICCAI 2016. Lecture Notes in Computer Science(), vol 9901. Springer, Cham. https://doi.org/10.1007/978-3-319-46723-8_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-46723-8_39
Published: 02 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46722-1
Online ISBN: 978-3-319-46723-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)