1 Introduction

In the last two decades, large-scale pandemics caused by coronaviruses have occurred three times, one of which was the outbreak of Severe Acute Respiratory Syndrome (SARS) in 2003 in Guangdong Province of China [1], the others were the outbreak of Middle East Respiratory Syndrome (MERS) in 2012 in Saudi Arabia and 2015 in South Korea [2, 3]. Unfortunately, at the end of December 2019, a new kind of coronavirus, called COVID-19, was discovered in Wuhan, the capital of Hubei Province of China. It is reported that COVID-19 is a single-stranded RNA virus and usually causes respiratory symptoms and fever, as well as death in severe cases [4, 5]. Later on, during investigating and tracking the infected, it is found that a part of infected people are asymptomatic, but could infect others. Besides, Wuhan has a large and highly mobile scale of population and is a transportation hub in central China. These factors made the prevention and control of COVID-19 even more tough. So, on January 23 Wuhan went into lockdown and the government took first-level public health emergency response to the outbreak.

With COVID-19 spreading around the world, there have emerged an enormous number of works about the pandemic [6,7,8,9,10]. Tang et al. [11] proposed a deterministic susceptible-exposed-infected-removed (SEIR) compartmental model to describe the spread of COVID-19, and used partial data obtained for the confirmed cases of COVID-19 to estimate the basic reproduction number of the disease transmission. Zhao et al. [12] modelled the epidemic time series of COVID-19 cases and estimated the transmission rate of COVID-19 via the basic reproduction number based on the data of Wuhan from January 10 to 24. Wu et al. [13] used a SEIR compartmental model to infer the number of infections in Wuhan from December 1, 2019, to January 25, 2020, from the data from December 31 to January 28 on the number of cases exported from Wuhan internationally. Above works assume that population is mixed homogeneously. But in reality, the differences of individuals and environments lead to particular contact patterns among people; thus, models with the homogeneousness assumption do not apply to real cases. Therefore, to describe the outbreak accurately, we establish a model on a social network, where nodes represent individuals and links stand for the contacts between individuals.

Furthermore, it is found that it is practical and meaningful to study propagation of epidemics on networks [14,15,16,17,18]. In 2002, Newman [19] showed that a large class of the so-called SIR models of epidemic disease can be solved exactly on a wide variety of networks using a combination of mapping to percolation models and generating function methods. Later on, Pastor-Satorras et al. [20] modelled the spread of epidemics on scale-free networks. It was found that epidemics was prevalent on scale-free networks and the topology of networks had a great influence on the behaviours of epidemic spreading. Granell et al. [21] combined the spread of information and diseases to propose an unaware-aware-unaware-susceptible-infected-susceptible (UAU-SIS) model on networks and revealed that information awareness prevented epidemic spreading. Wei et al. [22] modelled a cooperative spreading process over an interconnected network and found that inter-layer links promoted epidemics spreading. Existing works indicate that behaviours of epidemic spreading are not only influenced by the characteristics of the diseases, but also impacted by the underlying network structures [23,24,25,26,27]. As Wuhan went into lockdown from January 23, ignoring the number of medical teams dispatched to assist Wuhan, the newborn, natural deaths and deaths caused by reasons other than the coronavirus, we can assume that the total population in Wuhan is unchanged. Therefore, modelling the outbreak of COVID-19 during the lockdown on a sparse social network is reasonable. We will explore whether the measures the government took were efficient and why the number of the infected was rising in the early stage even though the government took several restrictive measures to fight against the novel coronavirus.

Motivated by above discussions, we introduce a new SAIR model on a social network to describe the spread of COVID-19 in Wuhan after January 23 and then analyse how to suppress the spread of the pandemic. Specially, by calculating the basic reproduction number, we find that there are several measures that can curb COVID-19 spreading, and these measures are in fair agreement with the measures the government took. Furthermore, based on the epidemic data of Wuhan from January 24 to March 2, we analyse the outbreak in Wuhan and find that the outbreak is gradually under control after February 13.

The paper is organized as follows. In Sect. 2, a SAIR model is established and the basic reproduction number is obtained. Next, numerical simulations are provided to verify the effectiveness of the proposed measures with the model in Sect. 3. Then, some analyses and discussions about the outbreak of COVID-19 are given in Sect. 4. Finally, conclusions are drawn in Sect. 5

2 Model and analysis

As is well known, there exists asymptomatic transmission in the spread of COVID-19; thus, we propose a new SAIR model on network G which consists of susceptible (S), asymptomatically infected (A), symptomatically infected (I) and removed (R) individuals, and the total number of individuals is N. For a susceptible individual, if he (she) is infected by contacting with infected individuals (including asymptomatically and symptomatically infected individuals), he (she) could become symptomatic or asymptomatic. For an asymptomatically infected individual, he (she) might be symptomatic after a latent period or get immunized against the virus. Define \(S_{k}\) as the fraction of susceptible individuals out of the individuals with degree k. Similarly, \(A_{k},~I_{k}\) and \(R_{k}\) represent the fraction of asymptomatically infected, symptomatically infected and removed individuals with degree k, respectively. Obviously, \(S_{k}(t),~A_{k}(t),~I_{k}(t),~R_{k}(t)\ge 0\) and \(S_{k}(t)+A_{k}(t)+I_{k}(t)+R_{k}(t)=1\). The propagation process of COVID-19 can be described by

$$\begin{aligned} \left\{ \begin{aligned}&\frac{\mathrm{d}S_{k}(t)}{\mathrm{d}t}=-\beta _{1}kS_{k}(t)\Theta _{A}(t)-\beta _{2}kS_{k}(t)\Theta _{I}(t),\\&\frac{\mathrm{d}A_{k}(t)}{\mathrm{d}t}=\alpha (\beta _{1}kS_{k}(t)\Theta _{A}(t)\\&+\beta _{2}kS_{k}(t)\Theta _{I}(t))-mA_{k}(t-\tau )-\gamma _{1}A_{k}(t),\\&\frac{\mathrm{d}I_{k}(t)}{\mathrm{d}t}=(1-\alpha )(\beta _{1}kS_{k}(t)\Theta _{A}(t)\\&+\beta _{2}kS_{k}(t)\Theta _{I}(t))+mA_{k}(t-\tau )-\gamma _{2}I_{k}(t),\\&\frac{\mathrm{d}R_{k}(t)}{\mathrm{d}t}=\gamma _{1}A_{k}(t)+\gamma _{2}I_{k}(t), \end{aligned}\right. \end{aligned}$$
(1)

where \(\Theta _{A}(t)=\frac{\sum _{k'}k'p(k')A_{k'}(t)}{\langle k \rangle }\) and \(\Theta _{I}(t)=\frac{\sum _{k'}k'p(k')I_{k'}(t)}{\langle k \rangle }\). Here, p(k) is the degree distribution of network G and \(\langle k \rangle \) is the average degree of network G (Table 1).

Table 1 Definition of parameters

Define \(X=(S_{k_{1}},\ldots ,S_{k_{n}},A_{k_{1}},\ldots ,A_{k_{n}},I_{k_{1}},\ldots ,I_{k_{n}},R_{k_{1}}, \ldots ,R_{k_{n}})\), where {\( k_{1},\ldots ,k_{n}\)} is a set of the node degrees of network G. It is obvious that system (1) has a disease-free equilibrium \(E^{0}=(1,\ldots ,1, 0,\ldots ,0)\). By the definition of next-generation matrix [28], combining with compartments \(A_{k}\) and \(I_{k}\) in system (1), we get F, the growth rate of secondary infection, and V, the rate of individuals removed from compartments \(A_{k}\) and \(I_{k}\). The expressions of F and V are

$$\begin{aligned}&F=\begin{bmatrix} \alpha \beta _{1}P &{} \alpha \beta _{2}P\\ (1-\alpha ) \beta _{1}P &{} (1-\alpha ) \beta _{2}P \end{bmatrix}~\text{ and }\\&\quad ~ V=\begin{bmatrix} (m+\gamma _{1})I_{n} &{} 0\\ -mI_{n} &{} \gamma _{2}I_{n} \end{bmatrix},~\text{ respectively }, \end{aligned}$$

where \(P=\frac{1}{\langle k \rangle } \begin{bmatrix} k_{1}^{2}p(k_{1}) &{} k_{1}k_{2}p(k_{2}) &{} \ldots &{} k_{1}k_{n}p(k_{n})\\ k_{2}k_{1}p(k_{1}) &{} k_{2}^{2}p(k_{2}) &{} \ldots &{} k_{2}k_{n}p(k_{n})\\ \vdots &{} \vdots &{} \ddots &{}\vdots \\ k_{n}k_{1}p(k_{1}) &{} k_{n}k_{2}p(k_{2}) &{} \ldots &{} k_{n}^{2}p(k_{n}) \end{bmatrix}\) and \(I_{n}\) is the \(n\times n\) identity matrix.

Then, the basic reproduction number is \(R_{0}=\rho (FV^{-1})\), where \(\rho (\cdot )\) is the spectrum radius of a matrix. Thus, we obtain

$$\begin{aligned} \begin{aligned} FV^{-1}&=\begin{bmatrix} \alpha \beta _{1}P &{} \alpha \beta _{2}P\\ (1-\alpha ) \beta _{1}P &{} (1-\alpha ) \beta _{2}P \end{bmatrix}\\&\begin{bmatrix} \frac{1}{m+\gamma _{1}}I_{n} &{} 0\\ \frac{m}{(m+\gamma _{1})\gamma _{2}}I_{n} &{} \frac{1}{\gamma _{2}}I_{n}\\ \end{bmatrix}\\&=\begin{bmatrix} (\frac{\alpha \beta _{1}}{m+\gamma _{1}}+\frac{m\alpha \beta _{2}}{(m+\gamma _{1})\gamma _{2}})P &{} \frac{\alpha \beta _{2}}{\gamma _{2}}P \\ (\frac{(1-\alpha )\beta _{1}}{m+\gamma _{1}}+\frac{m(1-\alpha )\beta _{2}}{(m+\gamma _{1})\gamma _{2}})P &{} \frac{(1-\alpha )\beta _{2}}{\gamma _{2}}P\\ \end{bmatrix}. \end{aligned} \end{aligned}$$

As \(rank(FV^{-1})=rank(P)\), there is

$$\begin{aligned} \begin{aligned} \rho (FV^{-1})&=trace((\frac{\alpha \beta _{1}}{m+\gamma _{1}}+\frac{m\alpha \beta _{2}}{(m+\gamma _{1})\gamma _{2}}\\&+\frac{(1-\alpha )\beta _{2}}{\gamma _{2}})P)\\&=(\frac{\alpha \beta _{1}}{m+\gamma _{1}}+\frac{m\alpha \beta _{2}}{(m+\gamma _{1})\gamma _{2}}\\&+\frac{(1-\alpha )\beta _{2}}{\gamma _{2}})trace(P){,} \end{aligned} \end{aligned}$$

where \(rank(\cdot )\) is the rank of a matrix and \(trace(\cdot )\) is the sum of the diagonal elements of a matrix. As \(trace(P)=\frac{\sum _{i=1}^{n}k_{i}^{2}p(k_{i})}{\langle k \rangle }=\frac{\langle k^{2} \rangle }{\langle k \rangle }\), the basic reproduction number \(R_{0}\) is

$$\begin{aligned} R_{0}=\left( \frac{\alpha \beta _{1}}{m+\gamma _{1}}+\frac{m\alpha \beta _{2}}{(m+\gamma _{1})\gamma _{2}}+\frac{(1-\alpha )\beta _{2}}{\gamma _{2}}\right) \frac{\langle k^{2} \rangle }{\langle k \rangle }. \end{aligned}$$
(2)

From the expression of \(R_{0}\), we find that the first term (\(\frac{\alpha \beta _{1}\langle k^{2} \rangle }{(m+\gamma _{1})\langle k \rangle }\)) represents the number of individuals who are infected by an asymptomatically infected individual during recovery and latent period, and the third term (\(\frac{(1-\alpha )\beta _{2}\langle k^{2} \rangle }{\gamma _{2}\langle k\rangle }\)) is the number of individuals who are infected by a symptomatically infected individual during the recovery period, whereas the second term (\(\frac{m\alpha \beta _{2}\langle k^{2}\rangle }{(m+\gamma _{1})\gamma _{2}\langle k\rangle }\)) indicates the number of individuals who are infected by the individual that is transformed from an asymptomatically infected to a symptomatical one.

Nowadays, the most attractive issue for the public is how to control the spread of COVID-19. Therefore, we are not going to elaborate the stability of system (1) here, with analysis being attached in “Appendix A”. In the following sections, we focus on how to curb the spread of COVID-19 and analyse the outbreak based on the epidemic data of Wuhan from January 24 to March 2.

Throughout the paper, S(t),  A(t),  I(t) and R(t) are the fraction of susceptible, asymptomatically infected, symptomatically infected and removed individuals on network G, respectively, where \(S(t)\!=\!\sum _{k}p(k)S_{k}(t)\), \(A(t)\!=\!\sum _{k}p(k)A_{k}(t)\), \(I(t)=\sum _{k}p(k)I_{k}(t)\) and \(R(t)=\sum _{k}p(k)R_{k}(t)\), and \(S(t)+A(t)+I(t)+R(t)=1\). Owing to the truth that only the infected can turn into the removed, the removed in the stable state (referred to as \(R_{\infty }\)) represents the fraction of all infected individuals from the outbreak to extinction of COVID-19. Furthermore, we refer to the measure of cutting off connections between symptomatically infected individuals and their neighbours, and that of cutting off connections between hub nodes and their neighbours as Measures 1 and 2, respectively.

3 Numerical simulation

Real-world networks are usually scale free [29]. Thus, we use the algorithm proposed by Barab\(\acute{a}\)si and Albert [30] to generate BA networks of size \(N=1000\), and carry out simulations on these networks. Specifically, start with a fully connected network of \(m_{0}=3\) nodes, then sequentially add the remaining nodes, each connecting to \(m=3\) existing nodes with a probability which is proportional to the number of links that the existing nodes already have. Furthermore, because cutting off connections between symptomatically infected individuals and their neighbours (Measure 1) at each step is a dynamical process, we take the Monte Carlo (MC) method to obtain the values \(S(t),~A(t),~I(t)~\text{ and }~R(t)\). Therefore, initially we set \(S(t)=0.99,~A(t)=0.005,~I(t)=0.005\) and \(R(t)=0\), and seed the infected individuals on a BA network randomly. Then, we take 100 repetitions of MC simulations on the network, and average corresponding values over 10 BA networks. Since \(S(t)+A(t)+I(t)+R(t)=1\), we only present the evolution of A(t),  I(t) and R(t).

Fig. 1
figure 1

Time evolution of A(t),  I(t) and R(t), where \(\beta _{1}=0.25,~\beta _{2}=0.3,~\alpha =0.3,~\tau =14,~m=0.9\). The pink solid line, blue dotted-dashed line and green dashed line represent the fraction of asymptomatically infected, symptomatically infected and removed individuals, respectively

First, we explore the impact of recovery rate \(\gamma _{1}\) and removed rate \(\gamma _{2}\) on the spread of COVID-19. From Fig. 1, we discover that increasing \(\gamma _{1}\) and \(\gamma _{2}\) can obviously suppress the number of infected cases. Specifically, comparing Panels (a) and (b), we observe that A(t) and I(t) in Panel (a) are smaller than those in Panel (b). Similarly, A(t) and I(t) are much smaller in Panel (c) than those in Panel (b). These indicate that increasing the recovery rate and removed rate can effectively curb the spread of COVID-19.

Fig. 2
figure 2

Time evolution of A(t),  I(t) and R(t), where \(\beta _{1}=0.25,~\beta _{2}=0.3,~\alpha =0.3,~\tau =14\), and \(m=0.9\). In the top panels, \(\gamma _{1}=0\) and \(\gamma _{2}=0.1\), and in the bottom panels, \(\gamma _{1}=0.1\) and \(\gamma _{2}=0.1\). a, e Measure 1 is taken immediately at \(t=0\); b, f Measures 1 and 2 are taken from \(t=0\); c, g Measure 1 comes into force at \(t=20\); d, h two measures come into force at \(t=20\). We set that nodes with degree of over 10 are hub nodes. The pink solid line, blue dotted-dashed line and green dashed line represent the fraction of asymptomatically infected, symptomatically infected and removed individuals, respectively

Next, we investigate the roles of Measures 1 and 2 playing in the inhibition of COVID-19. We can obtain that taking Measures 1 and 2 immediately is the best approach to curb the spread of the pandemic. Specifically, when \(\gamma _{1}=0\) and \(\gamma _{2}=0.1\), without any measures taken to control the spread, the pandemic becomes prevalent (see Fig. 1a), and with measures taken, \(R_{\infty }\) becomes much smaller (refer to the top panels of Fig. 2), especially in the case of both Measures 1 and 2 taken. Furthermore, comparing with the case that measures are taken immediately (see Fig. 2a, b), we can observe that for the case of taking measures at \(t=20\), \(R_{\infty }\) is much larger and the time that I(t) arrives at zero becomes longer (Fig. 2c, d). Similar observations can be obtained from the bottom panels of Fig. 2 for \(\gamma _{1}=0.1\) and \(\gamma _{2}=0.1\). These indicate that to curb the spread of COVID-19, we should take Measures 1 and 2 immediately, otherwise the COVID-19 would spread rapidly around the globe and the duration time will become very long, which might cause serious consequence.

In summary, the best way to curb the spread of COVID-19 is to increase the recovery rate and the removed rate and cut off connections between symptomatically infected individuals and their neighbours, as well as to cut off connections between hub nodes and their neighbours. Indeed, the measures proposed by us coincide with those the government had taken to prevent the spread of COVID-19. Specifically, in order to prevent the spread of COVID-19, the government encourages the public to enhance immunity, segregates the infected with the symptomatic, prevents citizens from aggregating, by shutting down all the places of entertainment and imposing a home quarantine mechanism. Increasing the recovery rate and the removed rate means enhancing immunity. In fact, the most effective way to prevent the spread of the pandemic is vaccination (that is, \(\gamma _{1}=\gamma _{2}=1\)). However, since COVID-19 is single-stranded RNA virus, vaccine development is difficult and faces many uncertainties. Thus, one of the feasible methods to curb the spread of COVID-19 is to enhance individual immunity. Measure 1 represents quarantining the infected with symptoms and Measure 2 represents preventing citizens from aggregating. From Eq. (2), Measure 1 reduces not only \(\frac{\langle k^{2} \rangle }{\langle k \rangle }\), but also \(\beta _{1}\) and \(\beta _{2}\). And Measure 2 weakens the effect of hub nodes to further reduce \(\frac{\langle k^{2} \rangle }{\langle k \rangle }\).

Fig. 3
figure 3

Time evolution of A(t) and I(t), where \(\beta _{1}=0.25,~\beta _{2}=0.3,~\alpha =0.3,~\tau =14,~m=0.9,~\gamma _{1}=0\) and \(\gamma _{2}=0.1\). From left to right, the average degree of network is 2, 4, and 6, respectively. In top panels, measures are taken immediately, whereas in the bottom panels, measures come into force at \(t=20\). The pink solid line, blue dotted-dashed line and green dashed line represent the fraction of asymptomatically infected, symptomatically infected and removed individuals, respectively

In Fig. 2, the dynamical behaviours of system (1) are more complicated with \(\gamma _{1}=0\) and \(\gamma _{2}=0.1\), thus we explore the effect of network structures on the pandemic when \(\gamma _{1}=0\), \(\gamma _{2}=0.1\) and Measures 1 and 2 are taken immediately. From Fig. 3, we discover that the spread of the pandemic almost has the same tendency. Specifically, when measures are taken to control the spread immediately, at \(t=14\), I(t) reaches peak and A(t) drops, then during \(t=14\) to 28, I(t) falls slowly and A(t) falls fast. When measures come into force at \(t=20\), we can obtain that when the average degree of network is 4 or 6, I(t) and A(t) reach peaks at \(t=20\), then during the next latent period I(t) falls slowly and A(t) falls fast, finally after \(t=34\) they all arrive at zero. The phenomenon indicate that the latent period and measures play important roles in the spread of the pandemic, which is beneficial for us to analyse the spread of COVID-19 by using the epidemic data of Wuhan from January 24 to March 2 in the next section. Furthermore, it is found that with the average degree of network changing from 2 to 4 and 6, \(R_{\infty }\) becomes larger and the duration of the outbreak becomes longer. Thus, another feasible way to curb the spread of COVID-19 is to reduce the density of social networks, such as restricting mobility and decreasing in-person social contacts.

4 Data analysis

In this section, we use the epidemic data of Wuhan from January 24 to March 2 to analyse the spread of COVID-19 in the city after January 23. We define the net increment of the infected with symptoms as \(\bigtriangleup nI(t)\) at t. Obviously, \(\bigtriangleup nI(t)=\bigtriangleup {I}(t)-\bigtriangleup {R}(t)\), where \(\bigtriangleup {I}(t)\) is the increment of infected cases at t, and \(\bigtriangleup {R}(t)\) is the increment of removed cases at t. According to the official data published by the Chinese government, we can obtain \(\bigtriangleup nI\) from January 24 to March 2, as shown in Fig. 3a (the data are attached in “Appendix B”). Similarly, we define the net increment rate of symptomatically infected individuals as \(\bigtriangleup nIr(t)\) at t, where \(\bigtriangleup nIr(t+1)=I(t+1)-I(t)\).

Fig. 4
figure 4

a The net increment of the infected \(\bigtriangleup nI\) from January 24 to March 2. b The net increment rate of the symptomatically infected \(\bigtriangleup nIr\) with \(\gamma _{1}=0\), and c the net increment rate of the symptomatically infected \(\bigtriangleup nIr\) with \(\gamma _{1}=0.1\). We set \(\beta _{1}=0.25,~\beta _{2}=0.3,~\alpha =0.3,~\tau =14,~m=0.9\) and \(\gamma _{2}=0.1\), and Measures 1 and 2 are taken immediately

From Fig. 4, we realize that the spread of the pandemic is sensitive to the latent period. Specifically, from Fig. 4b, c, we observe that at about \(t=14~(28)\), \(\bigtriangleup nIr\) goes up sharply then falls, and for \(t \in (1,13)((15,26))\), \(\bigtriangleup nIr\) fluctuates. In Fig. 4a, we can obtain that from January 23 to February 4 (except January 27 and 28), \(\bigtriangleup nI\) rose, then from February 4 to 13, \(\bigtriangleup nI\) fluctuated and after February 13, \(\bigtriangleup nI\) had a downward trend.

Thus, in our opinion, the first stage of the outbreak of COVID-19 is from January 23 to February 4. In this stage, because there were many asymptomatically infected people (secondary infected) before January 23, \(\bigtriangleup nI\) rose continually even though the government had taken efficient measures to prevent the spread of COVID-19 on January 23. In the second stage, from February 4 to 13, \(\bigtriangleup nI\) fluctuated because the asymptomatically infected people began to display symptoms. After February 13, as the third stage, the trend of \(\bigtriangleup nI\) was down and especially \(\bigtriangleup nI\) decreased to negative after February 19. This indicates that the spread of COVID-19 is gradually under control.

From the aforementioned analysis of Fig. 4, we realize that the net increment rate of symptomatically infected individuals \(\bigtriangleup nIr\) fluctuates during the latent period. And throughout the spread of COVID-19 in Wuhan, the net increment of the infected with symptoms \(\bigtriangleup nI\) fluctuated from February 4 to 13, thus we reckon that the latent period is around 9 days. Thus, the opinion that the latent period can be extremely longer than one or two month is wrong. Perhaps there are some cases that asymptomatically infected people begin to exhibit symptoms after one month, but these are small probability events.

5 Conclusion

In this paper, we propose a SAIR model on social networks to model the spread of COVID-19 and analyse the outbreak based on the epidemic data of Wuhan from January 24 to March 2. By calculating the basic reproduction number and carrying out Monte Carlo simulations, we can obtain that to curb the spread of the pandemic, the measures proposed by us are in fair agreement with the measures the government had taken. In particular, we divide the outbreak of COVID-19 in Wuhan into three stages to explain why the infected cases increased though the government had taken efficient measures in the early stage. Furthermore, we find that with the average degree of network ascending, the pandemic would spread more rapidly and the duration time becomes longer. Therefore, another feasible method to curb the spread of the pandemic is to reduce the density of social networks, such as limiting the mobility and in-person social contacts.