Identifying Brain Networks of Multiple Time Scales via Deep Recurrent Neural Network

Cui, Yan; Zhao, Shijie; Wang, Han; Xie, Li; Chen, Yaowu; Han, Junwei; Guo, Lei; Zhou, Fan; Liu, Tianming

doi:10.1007/978-3-030-00931-1_33

Yan Cui¹⁸,
Shijie Zhao¹⁹,
Han Wang¹⁸,
Li Xie¹⁸,
Yaowu Chen¹⁸,
Junwei Han¹⁹,
Lei Guo¹⁹,
Fan Zhou¹⁸ &
…
Tianming Liu²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11072))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

8557 Accesses
7 Citations

Abstract

For decades, task-based functional magnetic resonance imaging (tfMRI) has been a powerful noninvasive tool to explore the organizational architecture of human brain function. Researchers have developed a variety of brain network analysis methods for tfMRI data, including the general linear model (GLM), independent component analysis (ICA) and sparse representation methods. However, these shallow models are limited in faithful reconstruction and modeling of the hierarchical and temporal structures of brain networks, as demonstrated in more and more studies. Recently, recurrent neural networks (RNNs) exhibit great ability of modeling hierarchical and temporal dependency features in the machine learning field, which might be suitable for tfMRI data modeling. To explore such possible advantages of RNNs for tfMRI data, we propose a novel framework of deep recurrent neural network (DRNN) to model the functional brain networks for tfMRI data. Experimental results on the motor task tfMRI data of Human Connectome Project 900 subjects data release demonstrated that the proposed DRNN can not only faithfully reconstruct functional brain networks, but also identify more meaningful brain networks with multiple time scales which are overlooked by traditional shallow models. In general, this work provides an effective and powerful approach to identifying functional brain networks of multiple time scales from tfMRI data.

Y. Cui and S. Zhao—Co-first authors.

You have full access to this open access chapter, Download conference paper PDF

Exploring Brain Hemodynamic Response Patterns via Deep Recurrent Autoencoder

Modeling Task fMRI Data via Deep Convolutional Autoencoder

3D Convolutional Long-Short Term Memory Network for Spatiotemporal Modeling of fMRI Data

Keywords

1 Introduction

Exploring the organizational architecture of human brain function has been of great interest in neuroscience community [1]. After decades of active research using noninvasive neuroimaging methods such as functional magnetic resonance imaging (fMRI), there has been mounting evidence that the brain function is realized by the interaction of multiple concurrent neural process or functional brain networks [2] and these networks are spatially distributed across specific structural substrate of neuroanatomical areas [3]. In these fMRI based studies, researchers developed a variety of brain network reconstruction and modeling techniques, such as the general linear model (GLM) [4], independent component analysis (ICA) [5] and sparse representation/dictionary learning methods [6, 7]. These methods reconstructed many meaningful functional brain networks which are characterized by both spatial maps and corresponding temporal time series from both tfMRI and rsfMRI data sets and greatly advanced our understanding of the regularity and variability of brain functions [4, 5].

However, those existing approaches which are based on shallow models are limited in faithful reconstruction and modeling of the hierarchical and temporal structures of brain functional networks in tfMRI data [8]. Recently, deep learning methods have attracted much attention in a variety of challenges [9]. The success of deep learning methods lies in the ability of automatically and hierarchically representing the raw data. Inspired by the great success of deep learning methods, more and more researchers applied deep learning methods in functional brain network analysis [10,11,12]. Although recent works demonstrate the advantages of deep learning methods, information of multiple temporal scales is rarely taken into consideration in these models, although it is known that brain activities have multiple time scales [13].

Recently, recurrent neural networks (RNNs) are gaining more and more attention [14]. Unlike traditional neural networks, RNNs can use their internal memory unit to process arbitrary sequences of inputs and model the sequential and time dependencies on multiple time scales [15]. That is, RNN models make their predictions based on not only the information available at a given time, but also the information that was available in the past. Actually, the brain activity is modulated by long temporal dependencies [16], which quite coincides with the characteristics of RNN models. Therefore, it is quite natural and well justified to adopt RNNs to explore the brain functional networks in tfMRI data. However, it has been rarely explored whether RNNs can be utilized to infer functional brain networks with the whole brain tfMRI data. In order to explore the possible advantages of RNN models, in this study, we proposed a novel, alternative framework of deep recurrent neural network (DRNN) for modeling functional brain networks in tfMRI data. An important characteristic of DRNN framework is that the task stimulus information is sequentially processed through the model and it automatically generates the observed whole brain voxel signals. In this way, the hierarchical and temporal structures of the brain activities are captured and brain networks of multiple time scales (especially time dependency sensitive brain networks) can be identified. We used the motor task tfMRI dataset of HCP 900 subjects data release as a test-bed, and extensive experimental results demonstrated the superiority of the proposed method in identifying functional brain networks of multiple time scales in tfMRI.

2 Materials and Methods

2.1 Overview

Figure 1 summarizes the proposed deep recurrent neural network (DRNN) model. There are three major steps to model tfMRI functional brain networks using DRNN. First, for each subject, the task design stimulus curves are gathered into a stimulus matrix $ \varvec{X } $ ($ k $ stimuli with $ t $ time series) as the input layer and the whole brain tfMRI signals are aggregated into a big signal matrix $ \varvec{Y } $ ($ m $ voxels with $ t $ time series). Then these task stimulus patterns passed through two hidden layers and each layer is of $ n_{h} $ RNN units, respectively. Next, the response of top hidden layer is connected to the whole brain signal matrix (m voxels’ signals with t time series) via a fully connected layer ([n_h, m]). Specifically, each hidden node’s connection weight vector represents a typical functional brain network, and its corresponding hidden response to specific stimulus patterns represents the temporal activity pattern of the network.

2.2 Data Acquisition and Pre-processing

The Human Connectome Project (HCP) dataset is one of the most systematic and comprehensive neuroimaging data set in current stage which aims to bring data from the major MRI neuroimaging modalities together into a cohesive framework to enable detailed comparisons between brain architecture, connectivity, and function across individual subjects. Importantly, this data set is publicly available which makes it a good test bed for different researchers. In this paper, we adopt motor tfMRI dataset of HCP 900 subjects data release to test our proposed method. The detailed design paradigms of motor task and other tasks are available in [17].

The detailed acquisition parameters of these tfMRI data were set as follows: 220 mm FOV, in-plane FOV: 208 × 180 mm, flip angle = 52, BW = 2290 Hz/Px, 2 × 2×2 mm spatial resolution, 90 × 104 matrix, 72 slices, TR = 0.72 s, TE = 33.1 ms. The preprocessing of the task fMRI data sets includes skull removal, motion correction, slice time correction, spatial smoothing, and global drift removal (high-pass filtering). All these preprocessing steps were implemented in FSL FEAT. All of these individual fMRI datasets are first registered to MNI common space for further study. Besides, the GLM-based activation results are also derived using FSL FEAT for comparison.

2.3 Deep Recurrent Neural Network Model

RNNs are feedforward neural networks augmented with edges spanning adjacent time steps where connections between units form a directed cycle. These connections introduce a notation of time and provide memory of past state. In contrast with traditional neural networks which only receive information at the bottom layer and output at the highest layer, RNNs receive input and produce output at each iteration step. However, a common RNN only process information through one layer before going to output, which could not provide hierarchical structure of processing the input information and the temporal hierarchy of input signals is not clear. In order to overcome these limitations, we propose a deep recurrent neural network (DRNN) framework for modeling functional brain networks in tfMRI data. The basic idea of DRNN is stacking RNNs to construct a hierarchical network architecture. Each hidden layer is a recurrent neural network and the hidden state of each layer is the input of next layer. In this way, new information propagates throughout the hierarchy during each network update and temporal context is added in each layer (Fig. 2). As demonstrated in character-based language modelling studies [15], stacking RNNs automatically creates different time scales across different levels and also forms a temporal hierarchical information processing structure.

We define a DRNN with $ L $ layers and each layer has $ n_{i} $ hidden units. The input sequence is denoted as $ \left( {\varvec{x}^{\left( 1 \right)} ,\varvec{x}^{\left( 2 \right)} , \ldots ,\varvec{x}^{\left( t \right)} } \right) $ where each data point is a real-valued vector and the target sequence is denoted as $ \left( {\varvec{y}^{\left( 1 \right)} ,\varvec{y}^{\left( 2 \right)} , \ldots ,\varvec{y}^{\left( t \right)} } \right) $ and the hidden state of i-th layer is denoted as $ \varvec{h}_{i}^{\left( t \right)} $. In order to avoid confusion between the indices of nodes and sequence steps, we use superscripts for time and subscripts for layer index. The output of DRNN model can be modeled as Eq. (1), where $ \hat{\varvec{y}}^{t} $ is the estimated output from the top hidden layer and $ \varvec{V} $ is the weight matrix between hidden layer and output, and $ {\mathbf{b}}_{i} $ is the bias parameters which contain the offset of each node.

$$ \hat{\varvec{y}}^{t} = \sigma \left( {\varvec{Vh}_{i}^{t} + {\mathbf{b}}_{i} } \right) $$

(1)

There are different types of RNN architectures and the long short-term memory (LSTM) is among the most popular specialized memory units of RNNs, which is developed for long time series. The first hidden states of an LSTM unit are defined as:

$$ \varvec{h}^{t} = \varvec{o}^{t} \odot {\tanh} \left( {\varvec{c}^{t} } \right) $$

(2)

$$ \varvec{o}^{t} = \sigma \left( {\varvec{U}_{o} \varvec{h}^{t - 1} + \varvec{W}_{o} {\mathbf{x}}^{t} + \varvec{b}_{o} } \right) $$

(3)

where $ \varvec{c}^{t} $ is the cell state, $ \varvec{o}^{t} $ are the output gate activities, and $ \odot $ denotes elementwise multiplication. Information about the previous time points is stored in the cell state. What information will be retrieved from the cell state is controlled by the output gate. The second-layer and upper hidden states are defined similarly to the first-layer hidden/cell states, except for the input is replaced with the output of first-layer hidden states. The parameters in the DRNN framework is optimized to minimize the mean square error between the whole brain signals and their reconstructions. The TensorFlow [18] system is adopted to implement the models.

2.4 Identification of Functional Brain Networks

In the DRNN model, the task design stimulus information is separated in different time points and put into the model step by step in each iteration. In each network update, new information is propagated to the hierarchical structure and temporal context is added in each RNN layer. Each hidden layer in the DRNN is a recurrent neural network and each upper layer receives the hidden state information from previous layer as input. Thus, the output information through the stacking RNNs structure is of different time scales. Finally, the top hidden layer’s output is connected to the whole brain signal matrix via a fully connected layer. Specifically, each hidden node’s connection weight vector represents a typical functional brain network’s spatial distribution and its corresponding hidden response to specific stimulus represents the temporal pattern of the network. In order to compare the derived brain networks with those by other methods, a spatial matching method is adopted to calculate the spatial similarity between the identified networks and the network templates derived from other methods. The spatial similarity is defined as the spatial pattern overlap rate R:

$$ R\left( {\varvec{S},\varvec{T}} \right) = \frac{{\left| {\varvec{S}\,{\bigcap }\,\varvec{T}} \right|}}{{\left| \varvec{T} \right|}} $$

(4)

where $ \varvec{S} $ and $ \varvec{T} $ are cortical spatial maps of a brain network component and the brain network template, respectively.

3 Experimental Results

3.1 Identified Typical Functional Brain Networks

Figure 3 illustrates a few typical brain networks identified on the motor tfMRI dataset of HCP 900 subjects release using DRNN model. For comparison, we also list the GLM group-wise activation maps on the right column. This figure clearly shows that part of our trained functional networks are quite similar to the corresponding GLM activation maps. In order to quantitatively measure the similarity, we adopt the Eq. (4) to calculate the spatial overlap rate between the identified DRNN networks and the corresponding GLM activation maps, which are listed in the first row of Table 1. In addition, the corresponding temporal patterns are also quite similar to the common HRF response patterns (convolution results of task design paradigm and HRF function). Figure 4 shows that the corresponding temporal response patterns, the task design pattern and the HRF response patterns. It is easy to see that the temporal patterns of DRNN brain networks have high correlations to the HRF responses. Through comparisons, the high spatial overlap rate and close temporal correlation suggest that the proposed DRNN model can identify meaningful and reliable functional networks in an automatic way.

Table 1. The first row shows the spatial overlap rate between the identified networks by DRNN and the corresponding GLM-derived group-wise activation maps. The second row shows the Pearson correlation between the temporal pattern and the common HRF response patterns.

Full size table

3.2 Identified Functional Brain Networks of Multiple Time Scales

During the training stage, the task stimulus information goes through the hierarchical and temporal model iteratively, and the final output naturally reflects the brain network’s responses to the original stimulus information crossing multiple time scales. After training stage, we input each stimulus separately and obtain the corresponding temporal patterns for each network. In order to better interpret the identified functional brain networks, we further calculated the correlations between the identified temporal brain activity patterns and the theoretical regressor groups which were adopted in previous literature studies [12]. Essentially, the theoretical regressor groups represent the possible multiple time scale brain responses. Our basic idea is that if a specific temporal pattern is highly correlated with an extended theoretical regressor, the corresponding identified DRNN network should belong to the similar time scale network.

Figure 5 shows the temporal correlation maps between temporal response patterns of the 30 identified DRNN networks using Stimulus M6 and the extended hypothetical regressor groups in [12]. Similarly, we also extended the basic HRF response patterns with multiple delays, derivate, integral and inverse operation. From this figure, we can see that there are a few network temporal patterns highly correlated with the extended hypothetical regressors, and they represent the identified different time scales of brain networks. Figure 6 illustrates a few typical identified different time scales of brain networks and corresponding temporal patterns. From this result, we can see that a variety of time scales of theoretical response networks including multiple delays, multiple inversed HRFs and delays, different derivative and integral operations could be identified. We further checked the spatial patterns of these networks and it is interesting that these networks are similar but not the same. This is reasonable since these networks are evoked by the same stimulus but at different time scales. These multiple time scales of brain networks can be effectively identified with the DRNN framework, which is a major advantage of the proposed model.

The proposed DRNN model was also applied on half of the HCP Q1 release dataset (34 subjects) and obtained similar and consistent results. However, more training data (HCP Q3 release) will improve the reliability and interpretive of the results. L1 and L2 norm regularization were tried during the training stage, but the training loss increased rapidly with either regularization. Therefore, only MSE was taken as the loss function.

4 Discussion and Conclusion

In this work, we proposed a novel deep recurrent neural network (DRNN) for modeling functional brain networks in tfMRI data. The DRNN framework naturally combines the common deep neural networks with RNN. Each hidden layer of DRNN is a recurrent neural network and the output of each layer is the input time series of the upper layer. This structure automatically creates different time scales across different levels and thus form a temporal hierarchy. After training with the task stimulus, the whole brain voxel signals are automatically reconstructed with the top hidden layer output. Specifically, the weight vector between the hidden units and the whole brain fMRI signals describes the spatial distribution of this network and the top hidden layer’s output under specific stimuli naturally represents the corresponding temporal patterns of the brain network. The hierarchical and temporal information of the brain activities is captured, and different time scales of brain networks can be identified. Extensive experiment results demonstrate the superiority of the proposed DRNN framework.

References

Logothetis, N.K.: What we can do and what we cannot do with fMRI. Nature 453, 869 (2008)
Article Google Scholar
Duncan, J.: The multiple-demand (MD) system of the primate brain: mental programs for intelligent behaviour. Trends Cogn. Sci. 14, 172–179 (2010)
Article Google Scholar
Bullmore, E., Sporns, O.: Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186 (2009)
Article Google Scholar
Friston, K.J., Holmes, A.P., Worsley, K.J., Poline, J.P., Frith, C.D., Frackowiak, R.S.: Statistical parametric maps in functional imaging: a general linear approach. Hum. Brain Mapp. 2, 189–210 (1994)
Article Google Scholar
Biswal, B.B., Ulmer, J.L.: Blind source separation of multiple signal sources of fMRI data sets using independent component analysis. J. Comput. Assist. Tomogr. 23, 265–271 (1999)
Article Google Scholar
Lv, J., et al.: Holistic atlases of functional networks and interactions reveal reciprocal organizational architecture of cortical function. IEEE Trans. Biomed. Eng. 62, 1120–1131 (2015)
Article Google Scholar
Zhao, S., et al.: Decoding Auditory Saliency from Brain Activity Patterns during Free Listening to Naturalistic Audio Excerpts. Neuroinformatics 6, 309–324 (2018)
Article Google Scholar
Ferrarini, L., et al.: Hierarchical functional modularity in the resting-state human brain. Hum. Brain Mapp. 30, 2220–2231 (2009)
Article Google Scholar
Russakovsky, O., et al.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211–252 (2015)
Article MathSciNet Google Scholar
Zhao, Y., et al.: Automatic Recognition of fMRI-derived functional networks using 3D convolutional neural networks. IEEE Trans. Biomed. Eng. 65, 1975–1984 (2017)
Google Scholar
Zhao, Y., Ge, F., Liu, T.: Automatic recognition of holistic functional brain networks using iteratively optimized convolutional neural networks (IO-CNN) with weak label initialization. Med. Image Anal. 47, 111–126 (2018)
Article Google Scholar
Huang, H., et al.: Modeling task fMRI data via deep convolutional autoencoder. IEEE Trans. Med. Imaging 37(7) (2018)
Google Scholar
Buzsaki, G., Draguhn, A.: Neuronal oscillations in cortical networks. Science 304, 1926–1929 (2004)
Article Google Scholar
Litjens, G., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)
Article Google Scholar
Hermans, M., Schrauwen, B.: Training and analysing deep recurrent neural networks. In: Advances in Neural Information Processing Systems, pp. 190–198 (2013)
Google Scholar
Güçlü, U., van Gerven, M.A.: Modeling the dynamics of human brain activity with recurrent neural networks. Front. Comput. Neurosci. 11, 7 (2017)
Article Google Scholar
Barch, D.M., et al.: Function in the human connectome: task-fMRI and individual differences in behavior. Neuroimage 80, 169–189 (2013)
Article Google Scholar
Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: OSDI, pp. 265–283 (2016)
Google Scholar

Download references

Acknowledgements

This work was supported by National Key R&D Program of China under contract No. 2017YFB1002201, NSF of China 61806167, the Fundamental Research Funds for the Central Universities (2017FZA5021, 3102017zy030) and the China Postdoctoral Science Foundation (2017M613206).

Author information

Authors and Affiliations

College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
Yan Cui, Han Wang, Li Xie, Yaowu Chen & Fan Zhou
School of Automation, Northwestern Polytechnical University, Xi’an, China
Shijie Zhao, Junwei Han & Lei Guo
Cortical Architecture Imaging and Discovery Lab, Department of Computer Science and Bioimaging Research Center, The University of Georgia, Athens, GA, USA
Tianming Liu

Authors

Yan Cui
View author publications
You can also search for this author in PubMed Google Scholar
Shijie Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Han Wang
View author publications
You can also search for this author in PubMed Google Scholar
Li Xie
View author publications
You can also search for this author in PubMed Google Scholar
Yaowu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Junwei Han
View author publications
You can also search for this author in PubMed Google Scholar
Lei Guo
View author publications
You can also search for this author in PubMed Google Scholar
Fan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Tianming Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Fan Zhou or Tianming Liu .

Editor information

Editors and Affiliations

University of Leeds, Leeds, UK
Alejandro F. Frangi
King’s College London, London, UK
Julia A. Schnabel
University of Pennsylvania, Philadelphia, PA, USA
Christos Davatzikos
Universidad de Valladolid, Valladolid, Spain
Carlos Alberola-López
Queen’s University, Kingston, ON, Canada
Gabor Fichtinger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, Y. et al. (2018). Identifying Brain Networks of Multiple Time Scales via Deep Recurrent Neural Network. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. MICCAI 2018. Lecture Notes in Computer Science(), vol 11072. Springer, Cham. https://doi.org/10.1007/978-3-030-00931-1_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-00931-1_33
Published: 13 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00930-4
Online ISBN: 978-3-030-00931-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics