Abstract
In this paper, we introduce a fast alternating method for reconstructing highly undersampled dynamic MRI data using 3D convolutional sparse coding. The proposed solution leverages Fourier Convolution Theorem to accelerate the process of learning a set of 3D filters and iteratively refine the MRI reconstruction based on the sparse codes found subsequently. In contrast to conventional CS methods which exploit the sparsity by applying universal transforms such as wavelet and total variation, our approach extracts and adapts the temporal information directly from the MRI data using compact shift-invariant 3D filters. We provide a highly parallel algorithm with GPU support for efficient computation, and therefore, the reconstruction outperforms CPU implementation of the state-of-the art dictionary learning-based approaches by up to two orders of magnitude.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
1 Introduction
Dynamic cardiac MRI is considered as the gold standard among several imaging modalities in heart function diagnosis. However, due to its long acquisition time, its clinical application has been limited to non-time-critical ones. Recent research advances in Compressed Sensing (CS) have been successfully applied to MRI [8] to reduce acquisition time. Nevertheless, CS-MRI poses a new challenge – the reconstruction time also increases because it needs to solve an in-painting inverse problem in the frequency domain (i.e., k-space). Therefore, accelerating the reconstruction process is a top priority to adopt CS framework to fast MRI diagnosis.
Conventional CS-MRI reconstruction methods have exploited the sparsity of signal by applying universal sparsifying transforms such as Fourier (e.g., discrete Fourier transform (DFT) or discrete cosine transform (DCT)), Total Variation (TV), and Wavelets (e.g., Haar, Daubechies, etc.). This research direction has focused on accelerating the sparsity-based energy minimization problem, with [10] or without hardware supports [7]. Some strategies were designed to accelerate the minimization process such as using TV plus nuclear norm [14] or proposed the solver in other sparsity domain such as low-rank technique [9, 12]. More recently, the other approaches leveraging the state-of-the-art data-driven method, i.e., dictionary learning [2] (DL), have been proposed to further enhance the reconstruction quality [3, 5, 6, 11]. However, the existing learning-based methods suffer from the drawback of patch-based dictionary (i.e., redundant atoms and longer running times).
Convolutional sparse coding (CSC) is a new learning-based sparse representation that approximates the input signal with a superposition of sparse feature maps convolved with a collection of filters. This advanced technique replaces the patch-based dictionary learning process with an energy minimization process using a convolution operator on the image domain, which leads to an element-wise multiplication in frequency domain, derived within Alternating Direction Method of Multiplier (ADMM) framework [4], and later its direct inverse problem is introduced by Wohlberg [13]. CSC can generate much compact dictionaries due to its shift-invatiant nature of filters, and the pixel-wise computation in Fourier domain maps well to parallel architecture. However, such advanced machine learning approaches have not been fully exploited in CS-MRI literature yet. Therefore, in this paper, we propose a novel CS dynamic MRI reconstruction that exploits the compactness and efficiency of 3D CSC. The proposed 3D CSC directly encodes both spatial and temporal features from dynamic cardiac 2D MRI using a compact set of 3D atoms (i.e., filters) without regularizers enforcing temporal coherence (e.g., total variation along the time axis). We also show that the proposed method maps well to data-parallel architecture, such as GPUs, for further accelerating its running time significantly, up to two orders of magnitude faster compared to the state-of-the-art CPU implementation of CS-MRI using patch-based dictionary learning. To the best of our knowledge, this is the first CS-MRI reconstruction method based on GPU-accelerated 3D CSC.
2 Method
Figure 1 is a pictorial description of the proposed method. If the inverse Fourier transform is directly applied to undersampled MRI k-space data (Fig. 1a \(\times \)4 undersampling), the reconstructed images will suffer from artifacts (Fig. 1b). The zero-filling reconstruction will serve as an initial guess for our iterative reconstruction process with randomly initialized filters, e.g., a collection of 16 atoms of size 9\(\times \)9\(\times \)9 as shown in Fig. 1d. Then the image and filters are iteratively updated until they converge as shown in Fig. 1c, e and f.
The proposed CS-MRI reconstruction algorithm is a process of finding s (i.e., a stack of 2D MR images for a given time duration) in the energy minimization problem defined as follows:
where \(d_k\) is the k-th filter (or atom in the dictionary) and \(x_k\), is its corresponding sparse code for s. In Eq. (1), the first term measures the difference between s and its sparse approximation \(s-\sum _{k} d_k * x_k\), weighted by \(\alpha \). The second term is the sparsity regularization of \(x_k\) using an \(\ell \)1 norm with a weight \(\lambda \) instead of an \(\ell \)0 norm as used in [2, 5, 6]. The rest of the equation is the collection of constraints - the first constraint enforces the consistency between undersampled measurement m and the undersampled reconstructed image using the mask R and the Fourier operator \(\mathcal {F}\), and the second constraint restricts the Frobenius norm of each atom \(d_k\) within a unit length. In the following discussion, we will use a simplified notation without indices k and replace the result of Fourier transform of a given variable by using the subscript f (for example, \(d_{f}\) is the simplified notation for \(\mathcal {F}d\) in 3D domain and \(s_{f_2}\) is the simplified notation for \(\mathcal {F}_2s\) in 2D spatial domain) to derive the solution of Eq. (1). Therefore, problem 1 can be rewritten using auxiliary variables y and g for x and d as follows:
where g and d are related by a projection operator as a combination of a truncated matrix followed by a padding-zero matrix in order to make the dimension of g same as that of x. Since we will leverage Fourier transform to solve this problem, g should be zero-padded to make its size same as \(g_f\) and \(x_f\). The above constrained problem can be rebuilt in an unconstrained form with dual variables u, h, and further regulates the measurement consistency and the dual differences with \(\gamma \), \(\rho \), and \(\sigma \), respectively:
Then we can solve problem (3) by iteratively finding the solution of independent smaller problems, as described below:
Solve for x:
If we apply the Fourier transform to the (4), it becomes:
Then the minimum solution of (5) can be found by taking the derivative of (5) with respect to \(x_f\) and setting it to zero as follows:
Note that the notation \(D_f\) stands for the concatenated matrix of all diagonalized matrices \(d_{fk}\) as follows: \(D_f = [diag(d_{f1}), ..., diag(d_{fk})]\) and \(D_f^H\) is the complex conjugated transpose of \(D_f\).
Solve for y:
y for \(\ell \)1 minimization problem can be found by using a shrinkage operation:
Update for u: The update rule for u can be defined as a fixed- point iteration with the difference between x and y (u converges when x and y converge each other) as follows:
Solve for d:
Similar to x, d can be solved in the Fourier domain:
where \(X_f\) stands for the concatenated matrix of all diagonalized matrices \(x_{fk}\) as follows: \(X_f = [diag(x_{f1}), ..., diag(x_{fk})]\) and \(X_f^H\) is the complex conjugated transpose of \(X_f\).
Solve for g:
g can be found by taking the inverse Fourier transform of \(d_f\). This projection should be constrained by suppressing the elements which are outside the filter size \(d_k\), and followed by normalizing its \(\ell \)2-norm to a unit length.
Update for h: Similar to u, the update rule for h can be defined as follows:
Solve for s:
The objective function of (15) can be transformed into 2D Fourier domain:
Since \(d_f\) and \(x_f\) obtained previously in 3D Fourier domain, we need to bring it onto the same space by applying an inverse Fourier transform along time-axis \(\mathcal {F}_t^H\). Then \(s_{f_2}\) can be found by solving the following linear system:
Note that the efficient solutions of (6), (12) and (17) can be determined via the Sherman-Morrison formula for independent linear systems as shown in [13]. To this end, after the iteration process, s will be the results of applying a 2D inverse Fourier transform \(\mathcal {F}_2^H\) on \(s_{f_2}\).
Implementation Details: Since the above derivation consists only Fourier transform and element-wise operations, it maps well to data-parallel architecture, such as GPUs. We used MATLAB to implement the proposed method using the GPU. We set \(\alpha = 1\), \(\gamma = 1\), \(\lambda = 0.1\), \(\rho = 10\), \(\sigma = 10\) and keep refining the filter banks as well as the reconstruction iteratively until they converge.
3 Result
In order to assess the performance of the proposed method, we compared our algorithm with the stage-of-the-art dictionary learning-based CS reconstruction from Caballero et. al. [5], and the conventional CS reconstruction using wavelet and total variation energy from Quan et. al. [10]. We used three cardiac MRI datasets from The Data Science Bowl [1] – 2 chamber view (2ch), 4 chamber view (4ch), and short axis view (sax). Each dataset consists of 30 frames of a \(256\times 256\) image across the cardiac cycle of a heart. In the experiment, we used 3D atoms of size \(9\times 9\times 9\) and CS-undersampling factor was set to \(\times 4\).
Running Time Evaluation: In order to make this direct performance comparison of learning-based methods between the proposed one and Caballero et al. [5], we measured wall clock running time of both methods on a PC equipped with an Intel i7 CPU with 16 GB main memory and an NVIDIA GTX Geforce Titan X GPU. Our prototype code is written in MATLAB 2015b including GPU implementation, and we used the author-provided MATLAB code for Caballero et al. [5]. As shown in Table 1, we observed that our CPU-based method is about 54\(\times \) to 73\(\times \), or about two orders of magnitude, faster than the stage-of-the-art DL-based CS-MRI reconstruction method for 100 epochs (i.e., the number of learning iterations). In addition, our GPU-based accelerated implementation also outperforms the CPU version about 1.25\(\times \) to 3.82\(\times \), which is greatly reduced to a level closer to be ready for clinical application. We expect that the performance of our method can improve further by using CUDA C/C++ without MATLAB.
Quality Evaluation: Figure 2 visualizes the reconstruction errors compared to the full reconstruction of each method, respectively. As can be seen, our approach generated less error compared to the stage-of-the-art method of [5] and conventional CS-reconstruction using wavelet and TV energy [10]. Their glitches on the temporal profile are clearly observed since total variation along time axis may smooth out the temporal features that move quickly, especially near the heart boundary. In our case, the learned atoms are in 3D with larger supports, which helps to capture the time trait better even under fast motion and reduces errors in the reconstructed images. In addition, shift-invariance of CSC helps to generate more compact filters compared to the patch-based method.
Figure 3 shows the achieved Peak Signal-To-Noise-Ratios (PSNRs) measured between the CS-reconstruction results and the full reconstruction. As shown in this figure, our method requires more iterations (epochs) to converge to the steady state, but the actual running time is much faster than the others due to GPU acceleration. In the mean time, our method can reach much higher PSNRs.
4 Conclusion
In this paper, we introduced an efficient CS-MRI reconstruction method based on pure 3D convolutional sparse coding where shift-invariant 3D filters can represent the temporal features of the MRI data. The proposed numerical solver is derived under the ADMM framework by leveraging the Fourier convolution theorem, which can be effectively accelerated using GPUs. As a result, we achieved faster running time and higher PSNRs compared to the state-of-the-art CS-MRI reconstruction methods, such as using a patch-based dictionary learning and conventional wavelet and total variation energy. In the future, we plan to conduct a proper controlled-study of tuning-parameters and assess its feasibility in clinical applications.
References
Data science bowl cardiac challenge data (2015). https://www.kaggle.com/c/second-annual-data-science-bowl/data
Aharon, M., Elad, M., Bruckstein, A.: K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)
Awate, S., DiBella, E.: Spatiotemporal dictionary learning for undersampled dynamic MRI reconstruction via joint frame-based and dictionary-based sparsity. In: 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI), pp. 318–321, May 2012
Bristow, H., Eriksson, A., Lucey, S.: Fast convolutional sparse coding. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 391–398, June 2013
Caballero, J., Price, A., Rueckert, D., Hajnal, J.: Dictionary learning and time sparsity for dynamic MR data reconstruction. IEEE Trans. Med. Imaging 33(4), 979–994 (2014)
Caballero, J., Rueckert, D., Hajnal, J.V.: Dictionary learning and time sparsity in dynamic MRI. In: Ayache, N., Delingette, H., Golland, P., Mori, K. (eds.) MICCAI 2012, Part I. LNCS, vol. 7510, pp. 256–263. Springer, Heidelberg (2012)
Jung, H., Sung, K., Nayak, K.S., Kim, E.Y., Ye, J.C.: k-t FOCUSS: a general compressed sensing framework for high resolution dynamic MRI. Magn. Reson. Med. 61(1), 103–116 (2009)
Lustig, M., Donoho, D., Santos, J., Pauly, J.: Compressed sensing MRI. IEEE Signal Process. Mag. 25(2), 72–82 (2008)
Otazo, R., Cands, E., Sodickson, D.K.: Low-rank plus sparse matrix decomposition for accelerated dynamic MRI with separation of background and dynamic components. Magn. Reson. Med. 73(3), 1125–1136 (2015)
Quan, T.M., Han, S., Cho, H., Jeong, W.-K.: Multi-GPU reconstruction of dynamic compressed sensing MRI. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 484–492. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24574-4_58
Ravishankar, S., Bresler, Y.: MR image reconstruction from highly undersampled k-space data by dictionary learning. IEEE Trans. Med. Imaging 30(5), 1028–1041 (2011)
Tremoulhac, B., Dikaios, N., Atkinson, D., Arridge, S.R.: Dynamic MR image reconstruction; separation from undersampled (k-t)-space via low-rank plus sparse prior. IEEE Trans. Med. Imaging 33(8), 1689–1701 (2014)
Wohlberg, B.: Efficient convolutional sparse coding. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7173–7177, May 2014
Yao, J., Xu, Z., Huang, X., Huang, J.: Accelerated dynamic MRI reconstruction with total variation and nuclear norm regularization. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9350, pp. 635–642. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24571-3_76
Acknowledgments
This work was partially supported by the 2016 Research Fund (1.160047.01) of UNIST, the R&D program of MOTIE/KEIT (10054548), the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2014R1A1A2058773) and the Bio & Medical Technology Development Program of the NRF funded by the Korean government, MSIP (NRF-2015M3A9A7029725).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Quan, T.M., Jeong, WK. (2016). Compressed Sensing Dynamic MRI Reconstruction Using GPU-accelerated 3D Convolutional Sparse Coding. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds) Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. MICCAI 2016. Lecture Notes in Computer Science(), vol 9902. Springer, Cham. https://doi.org/10.1007/978-3-319-46726-9_56
Download citation
DOI: https://doi.org/10.1007/978-3-319-46726-9_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46725-2
Online ISBN: 978-3-319-46726-9
eBook Packages: Computer ScienceComputer Science (R0)