A stochastic SIR network epidemic model with preventive dropping of edges

Ball, Frank; Britton, Tom; Leung, Ka Yin; Sirl, David

doi:10.1007/s00285-019-01329-4

A stochastic SIR network epidemic model with preventive dropping of edges

Open access
Published: 13 March 2019

Volume 78, pages 1875–1951, (2019)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Biology Aims and scope Submit manuscript

A stochastic SIR network epidemic model with preventive dropping of edges

Download PDF

4904 Accesses
20 Citations
2 Altmetric
Explore all metrics

Abstract

A Markovian Susceptible $\rightarrow $ Infectious $\rightarrow $ Recovered (SIR) model is considered for the spread of an epidemic on a configuration model network, in which susceptible individuals may take preventive measures by dropping edges to infectious neighbours. An effective degree formulation of the model is used in conjunction with the theory of density dependent population processes to obtain a law of large numbers and a functional central limit theorem for the epidemic as the population size $N \rightarrow \infty $, assuming that the degrees of individuals are bounded. A central limit theorem is conjectured for the final size of the epidemic. The results are obtained for both the Molloy–Reed (in which the degrees of individuals are deterministic) and Newman–Strogatz–Watts (in which the degrees of individuals are independent and identically distributed) versions of the configuration model. The two versions yield the same limiting deterministic model but the asymptotic variances in the central limit theorems are greater in the Newman–Strogatz–Watts version. The basic reproduction number $R_0$ and the process of susceptible individuals in the limiting deterministic model, for the model with dropping of edges, are the same as for a corresponding SIR model without dropping of edges but an increased recovery rate, though, when $R_0>1$, the probability of a major outbreak is greater in the model with dropping of edges. The results are specialised to the model without dropping of edges to yield conjectured central limit theorems for the final size of Markovian SIR epidemics on configuration-model networks, and for the size of the giant components of those networks. The theory is illustrated by numerical studies, which demonstrate that the asymptotic approximations are good, even for moderate N.

Emergence in complex networks of simple agents

Article Open access 23 May 2023

David G. Green

Ergodic stationary distribution of age-structured HBV epidemic model with standard incidence rate

Article 08 April 2024

Anwarud Din & Yongjin Li

Infection-induced increases to population size during cycles in a discrete-time epidemic model

Article Open access 10 April 2024

Laura F. Strube, Shoshana Elgart & Lauren M. Childs

1 Introduction

In understanding the transmission dynamics in a population, one of the most important modelling components is the contact process. In this work we consider a form of self-initiated social distancing in response to an epidemic while at the same time taking into account the underlying contact network structure of the population. The resulting network is sometimes referred to as an adaptive network, e.g. Gross et al. (2006), Shaw and Schwartz (2008), Zanette and Risau-Gusmán (2008) and Tunc and Shaw (2014). Behavioural dynamics in infectious disease models can come in many different forms. Much of the literature that combines behavioural changes with network models uses agent-based simulations, as in the works cited above, although analytical advances have also been made (e.g. Britton et al. (2016) and Jacobsen et al. (2018)). Our work takes the model introduced in Britton et al. (2016) as its starting point. Britton et al. (2016) consider a broader class of models but restrict the analysis to the initial phase of the epidemic. In the current paper we analyse the time evolution and the final size of the epidemic. We model an SIR (Susceptible $\rightarrow $ Infectious $\rightarrow $ Recovered) infection on a configuration network that is static in the absence of infection. A susceptible individual breaks off its connection to an infectious neighbour upon learning of that neighbour’s infectious status. This occurs at a constant rate, independently per neighbour. One can think of this mechanism as being governed by infectious individuals informing their neighbours. Whereas infectious and recovered neighbours do not take any action upon being informed, susceptible neighbours want to avoid becoming infected and therefore cease contact with the infectious individual. We use the term ‘preventive dropping of edges’ to indicate this type of behaviour. Details of the model formulation are presented in Sect. 2.

To some extent, from the point of view of a susceptible neighbour of an infectious individual, it does not matter whether the infectious individual recovers or informs and dissolves the connection. Either way, it means that the susceptible neighbour can no longer acquire infection from this individual. In Sect. 5 we see that this is true when dealing with the asymptotic mean (deterministic) process, in that the number of susceptibles in the deterministic process for the model with dropping of edges coincides with that for the model without dropping of edges but with an increased recovery rate. In Sect. 8 we also see that this is not true for the stochastic process, in particular, the probability of a major outbreak differs (Theorem 8.1). Indeed, we cannot expect the two stochastic processes to coincide since informing neighbours happens independently of one another, while recovery affects all neighbours simultaneously.

In Sect. 3 we analyse the preventive dropping model throughout the epidemic outbreak, by using a so-called effective degree construction (cf. Ball and Neal 2008). Using such a construction, conditional on a major outbreak, by using techniques from Ethier and Kurtz (1986), we show under the assumption of bounded degrees that, as the population size N tends to infinity, the fractions of the population that are susceptible, infective and recovered satisfy a law of large numbers (LLN) over any finite time interval (more specifically that they converge almost surely to a limiting deterministic process), together with an associated functional central limit theorem (CLT) which describes fluctuations of the stochastic epidemic process about the limiting deterministic epidemic.

The population consists of N individuals that make up a network, which is formed using the configuaration model. The configuration model was introduced by Bollobás (1980), see Bollobás (2001) for further references, and comes in two versions: either (i) the degrees of individuals are prescribed deterministically, the Molloy–Reed (MR) random graph (Molloy and Reed 1995), or (ii) the degrees of individuals are assumed to be independent and identically distributed, the Newman–Strogatz–Watts (NSW) random graph (Newman et al. 2001). We treat both the MR and the NSW versions. If the limiting distribution of the degrees in the MR construction agrees with the degree distribution of the NSW random graph, the two versions give the same LLN, as we show in Theorem 3.1. However, the two versions differ regarding the variance in the CLTs, since (for finite N) there is greater variability in the degrees of the individuals in the NSW model than in the MR model. The functional CLT for the epidemic on an MR random graph is given in Theorem 3.2. By making a random time transformation, in Sect. 4, we conjecture a CLT for the final outcome of the epidemic on an MR random graph; see Conjecture 4.1. Corresponding results for the epidemic on an NSW random graph are discussed in Sect. 7; see Theorem 7.2 and Conjecture 7.1. To prove the latter results we require a version of the functional CLT in Ethier and Kurtz (1986) which allows for asymptotically random initial conditions; see Theorem 7.1.

The asymptotic variance–covariance matrix in the CLT in Proposition 4.1 is far from explicit. In order to obtain a nearly-explicit expression for the limiting variance of the final size, it is necessary to solve (partially) a time-transformed limiting deterministic process, which is more amenable to analysis than the corresponding deterministic process in real time. This is done in Sect. 5.1 and linked to the solution of the real-time process in Sect. 5.1.2. These results are used in Sects. 6 and 7 to obtain almost fully explicit expressions for the asymptotic variance of the final size of epidemics on MR and NSW random graphs, respectively, see Proposition 6.1 and Conjecture 7.1. In Sect. 5.2, we connect our analysis of the deterministic effective degree model to results derived using other deterministic approaches (cf. Volz (2008), Leung and Diekmann (2016) for related models), leading to a simple proof that the process of susceptible individuals in the limiting deterministic model for the epidemic with preventive dropping of edges is identical to that in the corresponding deterministic model without dropping of edges but with an increased recovery rate (see Remark 5.3).

Note that in the absence of behaviour change, we are in the setting of a Markov SIR epidemic on a configuration model network, which we consider in Sect. 9. This model has been analysed in several papers, e.g. Newman (2002), Kenah and Robins (2007), Lindquist et al. (2011) and Miller (2011). Our results further improve understanding of this well-studied model, particularly in terms of the asymptotic variance of the final size in Conjecture 9.1. Moreover, our work yields conjectured CLTs for the size of the giant component in MR and NSW configuration model random graphs; see Conjecture 9.2.

In Sect. 10, we illustrate our results with some numerical studies. In particular, we demonstrate that the asymptotic results generally give a good approximation for moderate population sizes, investigate the impact of the dropping of edges on properties of epidemics and do some comparison of the behaviour of the epidemic on MR and NSW type random graphs. Some brief concluding comments are given in Sect. 11.

Finally, we would like to make a note on the structure of the paper. Clearly, this paper does not readily lend itself to a quick superficial read, owing to its length and some of the technicalities and details involved in obtaining our results. However, we have tried to help the reader by formulating our main results in terms of propositions, theorems and well-motivated conjectures. The more technical aspects can be found in the appendices for the interested reader, which consequently constitute a significant part of the paper.

2 The stochastic SIR network epidemic model with preventive dropping

In this section we define the stochastic SIR network epidemic model with preventive dropping. This model is a special case of the network epidemic model with preventive rewiring defined in Britton et al. (2016), namely where there is no latent period and where the fraction of dropped edges that are replaced by new edges is set to zero.

The population consists of N individuals, labelled $1,2,\ldots ,N$, that make up a network. The network is formed using the configuration model, which, as described in Sect. 1, comes in two versions, namely MR and NSW random graphs. Let D be a random variable which describes the degree of a typical individual and $p_k=\mathrm{P}(D=k), k=0,1,\ldots $. Let $\mu _D$ and $\sigma ^2_D$ denote the mean and variance of D, respectively, both of which are assumed to be finite.

(i)
In the MR model, the degrees are prescribed. More specifically, for $N=1,2,\ldots $, let $d_1^N,d_2^N,\ldots , d_N^N$ denote the degrees of the individuals when the population size is N. Note that these are deterministic. Let $p_k^N=N^{-1}\sum _{i=1}^N \delta _{k,d_i^N}, k=0,1,\ldots $ be the empirical distribution of $d_1^N,d_2^N,\ldots , d_N^N$, where the Kronecker delta $\delta _{k,j}$ is 1 if $k=j$ and 0 otherwise. It is assumed that $\lim _{N \rightarrow \infty } p_k^N =p_k, k=0,1,\ldots $.
(ii)
In the NSW model, the degrees $D_1,D_2,\ldots ,D_N$ of the N individuals are independent and identically distributed copies of D. A sequence of networks, indexed by N, may be constructed from a sequence $D_1,D_2,\ldots $ of independent and identically distributed copies of D by using the first N random variables for the network on N individuals.

In both models the network is formed by attaching a number of stubs (i.e. half-edges) to each individual, according to its degree (so, for example, in the NSW model, $D_i$ stubs are attached to individual i, for $i=1,2,\ldots ,N$), and then pairing up these stubs uniformly at random to form the network. If $D_1+D_2+\cdots +D_N$ is odd, there is a left-over stub, which is ignored. The network may have some ‘defects’, specifically self-loops and multiple edges between pairs of individuals, but provided $\sigma ^2_D<\infty $, which we assume, such defects become sparse in the network as $N \rightarrow \infty $; see Durrett (2007), Theorem 3.1.2.

A Markovian SIR epidemic is defined on the network of N individuals as follows. Each individual is at any point in time either susceptible, infective or recovered (and immune to further infection). An infective individual infects each of its susceptible neighbours at the points of independent Poisson processes, each having rate $\beta $. An infectious individual recovers and becomes immune at rate $\gamma $ (implying that the duration of the infectious period follows an exponential distribution having mean $1/\gamma $). Finally, susceptible individuals that have infectious neighbours drop such connections, independently, at rate $\omega $ (an equivalent description to be used later is that the infective ‘warns’ its neighbours independently at rate $\omega $, and warned susceptible individuals drop the corresponding edge). All infectious periods, infecting processes and edge-dropping processes are mutually independent. The epidemic is initiated at time $t=0$ by one or more individuals being infectious and all other individuals being susceptible. More precise initial conditions are given when they are required. The epidemic continues until there is no infectious individual. Then the epidemic stops and the result is that some of the individuals have been infected (and later recovered) and the rest of the population remains susceptible and hence have not been infected during the outbreak.

The parameters of the model are the degree distribution $\{p_k\}$, including its mean $\mu _D$ and variance $\sigma _D^2$, the infection rate $\beta $, the recovery rate $\gamma $ and the dropping rate $\omega $.

It was shown in Britton et al. (2016) that the basic reproduction number for the model is given by

$$\begin{aligned} R_0=\frac{\beta }{\beta +\gamma +\omega }\left( \mu _D+\frac{\sigma _D^2}{\mu _D}-1\right) , \end{aligned}$$

(2.1)

see also Sect. 8. Note that the first factor in (2.1) is the probability that an infective infects a given susceptible neighbour before either the infective recovers or the neighbour drops its edge to that infective. The second factor is the expected number of susceptible neighbours for infected individuals during the early stages of an outbreak initiated by few infectives. Owing to the way the network is constructed, the degree $\tilde{D}$ of a typical neighbour of a typical individual has the size-biased distribution $\mathrm{P}(\tilde{D}=k)=\mu _D^{-1}k p_k$, $k=1,2,\ldots $, and hence mean $\mu _D^{-1}\mathrm{E}[D^2]=\mu _D^{-1}(\mu _D^2+\sigma _D^2)$. In the early stages of an outbreak, a typical infective has all susceptible neighbours except for one, namely its infector.

Note that $R_0$ for the dropping model is the same as for a Markovian SIR epidemic on a configuration model network without dropping of edges but with an increased recovery rate $\gamma +\omega $; see also Remark 5.3 and Sect. 8, where we discuss this modified model with increased recovery rate and its relation to the dropping model. Furthermore, from (2.1) we find that $R_0$ is a monotonically decreasing function of $\omega $, i.e. dropping edges always decreases the epidemic threshold parameter $R_0$; see also Fig. 5 in Sect. 10.4. For epidemics initiated by few infectives, this paper is concerned mainly with the case where $R_0>1$, since only then is there a possibility for a major outbreak to take place.

3 Effective degree formulation

In this section we analyse the stochastic SIR network epidemic model with preventive dropping that is described in Sect. 2. We do so by extending the ‘effective degree’ construction of an SIR epidemic on a configuration model network, introduced in Ball and Neal (2008), to incorporate dropping of edges. This allows us to prove a LLN and a functional CLT for the epidemic process (Theorems 3.1 and 3.2). Our proofs rely on the results of Ethier and Kurtz (1986) (see also Kurtz (1970, 1971)), and we adopt mostly the notation used in their work for ease of reference.

In the effective degree formulation the network is constructed as the epidemic progresses. The process starts with some individuals infective and the remaining individuals susceptible, but with none of the stubs paired up. For $i=1,2,\ldots ,N$, the effective degree of individual i is initially $d_i^N$ in the MR graph and $D_i$ in the NSW graph. Infected individuals behave in the following fashion. An infective, i say, transmits infection down its unpaired stubs at points of independent Poisson processes, each having rate $\beta $. When i transmits infection down a stub, that stub is paired with a stub (attached to individual j, say) chosen uniformly at random from all other unpaired stubs to form an edge. If $i \ne j$ then the effective degrees of both i and j are reduced by 1, otherwise the effective degree of i is reduced by 2. If individual j is susceptible then it becomes infective. If individual j is infective or recovered then nothing happens, apart from the edge being formed. The infective i also independently sends warning messages down its unpaired stubs at points of independent Poisson processes, each having rate $\omega $. When i sends a warning message down a stub, that stub is paired with a stub (attached to individual j, say) chosen uniformly at random from all other unpaired stubs. If individual j is susceptible then the stub from individual i and the stub from individual j are deleted, corresponding to dropping of an edge in the original model. If individual j is infective or recovered then the two stubs are paired to form an edge. In all three cases, the effective degrees of i and j are reduced as above. Individual i recovers independently at rate $\gamma $, keeping all, if any, of its unpaired stubs. Note that in the formulation in Ball and Neal (2008), when an infective recovers, its unpaired stubs, if any, are paired immediately; however, that is not necessary and indeed complicates analysis of the model.

Note also that we now use the equivalent formulation of the process for dropping edges of Sect. 2, where dropping is driven by infectives rather than by susceptibles, although it is clear that the two formulations are probabilistically equivalent. The change is required for the effective degree formulation to model dropping of edges correctly.

Before proceeding we introduce some notation. For $i=0,1,\ldots $ and $t \ge 0$, let $X_i^N(t)$ and $Y_i^N(t)$ be respectively the numbers of susceptibles and infectives having effective degree i at time t. We refer to such individuals as type-i susceptibles and type-i infectives. For $t \ge 0$, let $Z_E^N(t)$ be the number of unpaired stubs attached to recovered individuals at time t. (Note that it is not necessary to keep track of the effective degrees of recovered individuals since only the total number of unpaired stubs attached to recovered individuals, and not the effective degrees of the individuals involved, is required in the above effective degree formulation.) Let $\varvec{X}^N(t)=(X_0^N(t), X_1^N(t), \ldots )$, $\varvec{Y}^N(t)=(Y_0^N(t), Y_1^N(t), \ldots )$ and $\varvec{W}^N(t)=(\varvec{X}^N(t), \varvec{Y}^N(t), Z_E^N(t))$. (Unless stated to the contrary, vectors are row vectors in this paper.) Let $H=\mathbb {Z}_+^{\infty } \times \mathbb {Z}_+^{\infty }\times \mathbb {Z}_+$ denote the state space of $\{\varvec{W}^N(t)\}=\{\varvec{W}^N(t): t \ge 0\}$. Define unit vectors $\varvec{e}^\mathrm{S}_i, \varvec{e}^\mathrm{I}_i$$(i=0,1,\ldots )$ and $\varvec{e}^\mathrm{R}$ on H, where, for example, $\varvec{e}^\mathrm{S}_i$ has a one in the ith ‘susceptible component’ and zeros elsewhere, and $\varvec{e}^\mathrm{R}$ has a one in the ‘recovered component’ and zeros elsewhere. Let $\varvec{n}=(n_0^X,n_1^X,\ldots , n_0^Y,n_1^Y, \ldots , n_E^Z)$ denote a typical element of H, and let $n_E^X=\sum _{i=1}^{\infty }i n_i^X$ and $n_E^Y=\sum _{i=1}^{\infty } i n_i^Y$. Thus $n_E^X, n_E^Y$ and $n_E^Z$ are the total number of stubs attached to susceptibles, infectives and recovered individuals, respectively, when $\varvec{W}^N(t)=\varvec{n}$.

The process $\{\varvec{W}^N(t)\}$ is a continuous-time Markov chain with the following transition intensities, where an intensity is zero if $n_E^X+n_E^Y+n_E^Z=1$, since then there is only one stub remaining.

For $i,j=1,2,\ldots $,

(i)
type-i infective infects a type-j susceptible
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{I}_{j-1})=\beta i n_i^Y \frac{j n_j^X}{n_E^X+n_E^Y+n_E^Z-1}; \end{aligned}$$
(ii)
type-i infective ‘infects’ a type-j infective, so an edge is formed
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{I}_j+\varvec{e}^\mathrm{I}_{j-1})=\beta i n_i^Y \frac{j n_j^Y}{n_E^X+n_E^Y+n_E^Z-1}; \end{aligned}$$
(iii)
type-i infective warns a type-j susceptible, so an edge is dropped
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{S}_{j-1})=\omega i n_i^Y \frac{j n_j^X}{n_E^X+n_E^Y+n_E^Z-1}; \end{aligned}$$
(iv)
type-i infective ‘warns’ a type-j infective, so an edge is formed
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{I}_j+\varvec{e}^\mathrm{I}_{j-1})=\omega i n_i^Y \frac{j n_j^Y}{n_E^X+n_E^Y+n_E^Z-1}. \end{aligned}$$

For $i=1,2,\ldots $,

(v)
type-i infective ‘infects’ a recovered individual, so an edge is formed
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{R})=\beta i n_i^Y \frac{n_E^Z}{n_E^X+n_E^Y+n_E^Z-1}; \end{aligned}$$
(vi)
type-i infective ‘warns’ a recovered individual, so an edge is formed
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{R})=\omega i n_i^Y \frac{n_E^Z}{n_E^X+n_E^Y+n_E^Z-1}. \end{aligned}$$

For $i=0,1,\ldots $,

(vii)
type-i infective recovers
$$\begin{aligned} q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+i\varvec{e}^\mathrm{R})=\gamma n_i^Y. \end{aligned}$$

Remark 3.1

(Comments on the intensities) Note that although the above intensities are all independent of N, we index them by N since that is required so that $\{\varvec{W}^N(t)\}$ is a density dependent population process, see (3.6) and (3.7) below. Note also that the intensities in (ii) and (iv) above need to be modified slightly if $i=j$ to include the possibility that an infective ‘infects’ or ‘warns’ itself. For example, the intensity for a type-i infective ‘infecting’ itself is given by $q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-2})=\beta i (i-1)n_i^Y/(n_E^X+n_E^Y+n_E^Z-1)$, so this should be subtracted from the intensity in (ii) when $j=i$ and included instead in a new transition, (ii’) say. It is easily verified that that $q^N(\varvec{n}, \varvec{n}-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-2})=O(1)$ as $N \rightarrow \infty $, so the modifications may be absorbed into the O(1 / N) term in (3.6) below and ignoring such transitions does not affect the LLNs and CLTs in the paper.

We now introduce notation for the jumps of $\{\varvec{W}^N(t)\}$. Note that the transitions in (ii) and (iv) above are identical, as are the transitions in (v) and (vi), so there are five types of jumps. For $i,j=1,2,\ldots $, let

$$\begin{aligned} \varvec{l}_{ij}^{(1)}= & {} -\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{I}_{j-1}, \end{aligned}$$

(3.1)

$$\begin{aligned} \varvec{l}_{ij}^{(2)}= & {} -\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{I}_j+\varvec{e}^\mathrm{I}_{j-1},\end{aligned}$$

(3.2)

$$\begin{aligned} \varvec{l}_{ij}^{(3)}= & {} -\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{S}_{j-1}, \end{aligned}$$

(3.3)

for $i=1,2,\ldots $, let

$$\begin{aligned} \varvec{l}_{i}^{(4)}= & {} -\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{R}, \end{aligned}$$

(3.4)

and, for $i=0,1,\ldots $, let

$$\begin{aligned} \varvec{l}_{i}^{(5)}= & {} -\varvec{e}^\mathrm{I}_i+i\varvec{e}^\mathrm{R}. \end{aligned}$$

(3.5)

Then, excluding self-infection and self-warning (see Remark 3.1), the set of possible jumps of $\{\varvec{W}^N(t)\}$ from a typical state $\varvec{n}\in H$ is $\varDelta =\cup _{k=1}^5 \varDelta _k$, where

$$\begin{aligned} \varDelta _k=&\left\{ \varvec{l}_{ij}^{(k)}:i,j=1,2,\ldots \right\} \quad (k=1,2,3), \quad \varDelta _4=\left\{ \varvec{l}_{i}^{(4)}:i=1,2,\ldots \right\} \\&\text{ and }\quad \varDelta _5=\left\{ \varvec{l}_{i}^{(5)}:i=0,1,\ldots \right\} . \end{aligned}$$

Let $\varvec{x}=(x_0,x_1,\ldots )$ and $\varvec{y}=(y_0,y_1,\ldots )\in \mathbb {R}_+^{\infty }$, $z_E \in \mathbb {R}_+$ and $\varvec{w}=(\varvec{x},\varvec{y},z_E)$. Further, let $x_E=\sum _{i=1}^{\infty } i x_i$, $y_E=\sum _{i=1}^{\infty } i y_i$ and $\eta _E=x_E+y_E+z_E$. For $\varepsilon >0$, let $H_{\varepsilon }^N=\{\varvec{n}\in H:\sum _{i=1}^{\infty }i n_i^X \ge \varepsilon N\}$. For any $\varepsilon >0$, the intensities of the jumps of $\{\varvec{W}^N(t)\}$ admit the form

$$\begin{aligned} q^N(\varvec{n},\varvec{n}+\varvec{l})=N\left[ \beta _{\varvec{l}}(N^{-1}\varvec{n})+O(1/N)\right] \qquad (\varvec{n}\in H_{\varepsilon }^N, \varvec{l}\in \varDelta ), \end{aligned}$$

(3.6)

with the functions $\beta _{\varvec{l}}$$(\varvec{l}\in \varDelta )$ given by

$$\begin{aligned} \beta _{\varvec{l}}(\varvec{w})=\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E) = {\left\{ \begin{array}{ll} \beta _{ij}^{(1)}(\varvec{x},\varvec{y},z_E)=\frac{\beta i y_i j x_j}{\eta _E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(1)} \in \varDelta _1, \\ \beta _{ij}^{(2)}(\varvec{x},\varvec{y},z_E)=\frac{(\beta +\omega )i y_i j y_j}{\eta _E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(2)} \in \varDelta _2,\\ \beta _{ij}^{(3)}(\varvec{x},\varvec{y},z_E)=\frac{\omega i y_i j x_j}{\eta _E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(3)} \in \varDelta _3,\\ \beta _{i}^{(4)}(\varvec{x},\varvec{y},z_E)=\frac{(\beta +\omega )i y_i z_E}{\eta _E}&{} \text { for } \varvec{l}=\varvec{l}_{i}^{(4)} \in \varDelta _4,\\ \beta _{i}^{(5)}(\varvec{x},\varvec{y},z_E)=\gamma y_i &{} \text { for } \varvec{l}=\varvec{l}_{i}^{(5)} \in \varDelta _5. \end{array}\right. } \end{aligned}$$

(3.7)

Remark 3.2

(Applying the theory of Ethier and Kurtz) The theory of density dependent population processes in Ethier and Kurtz (1986), Chapter 11, is for a class of continuous-time Markov chains whose state space is a subset of $\mathbb {Z}^d$ for some $d \in \mathbb {N}$. Thus to use this theory we need to assume that there is a maximum degree, i.e. that $d_{\max }<\infty $, where $d_{\max }=\sup \{k \ge 0 :p_k>0\}$. Then, for any $\varepsilon >0$, provided the sample paths of $\{\varvec{W}^N(t)\}$ remain within $ H_{\varepsilon }^N$, $\{\varvec{W}^N(t)\}$ is a density dependent population process; see Appendix B for details. We conjecture that our results continue to hold when the condition $d_{\max }<\infty $ is relaxed, provided suitable conditions are imposed on (i) the distribution of D and (ii), for epidemics on MR random graphs, the convergence of the empirical distribution of prescribed degrees.

The key theorems in Ethier and Kurtz (1986), Chapter 11, have their origin in Kurtz (1970, 1971). However, the proofs in Ethier and Kurtz (1986) are different from those in the earlier papers and the LLN is stronger in that it concerns almost sure convergence rather than convergence in probability. In Ethier and Kurtz (1986), the processes corresponding to $\{\varvec{W}^N(t)\}$$(N=1,2,\ldots )$ are defined on the same probability space by using a single set of independent unit-rate Poisson processes indexed by the possible jumps $\varvec{l}$.

A LLN and a functional CLT for density dependent population processes having countable state space are proved in Barbour and Luczak (2012a, b). They do not apply immediately to $\{\varvec{W}^N(t)\}$ as the jumps of $\{Z_E^N(t)\}$ are unbounded, though that can be overcome by replacing $\{Z_E^N(t)\}$ by $\{(Z_0^N(t), Z_1^N(t), \ldots )\}$, where $Z_i^N(t)$ is the number of recovered individuals having effective degree i at time t. We do not consider here sufficient conditions for the theorems in Barbour and Luczak (2012a, b) to be satisfied in the present setting, since $d_{\max }< \infty $ is satisfied for real-life epidemics. We note that LLNs for the Markov SIR epidemic ($\omega =0$) on an MR random graph with unbounded degree are given in Decreusefond et al. (2012) and Janson et al. (2014), and a functional CLT for the Markov SI epidemic ($\omega =\gamma =0$) on an MR random graph with unbounded degree is given in KhudaBukhsh et al. (2017). It seems likely that similar techniques used in the first two of those papers will apply to the present model. LLNs for the Markov SIR epidemic ($\omega =0$) on an MR random graph with bounded degree are given in Bohman and Picollelli (2012) and Barbour and Reinert (2013), the latter for epidemics started by a trace of infection. Indeed our model (assuming bounded degrees) fits into the framework of Barbour and Reinert (2013), Sect 3.2.

Following Ethier and Kurtz (1986), define the drift function $F(\varvec{w})=F(\varvec{x},\varvec{y},z_E)$ by

$$\begin{aligned} F(\varvec{x},\varvec{y},z_E)=\sum _{\varvec{l}\in \varDelta }\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E). \end{aligned}$$

Substituting from (3.7) yields (see Appendix A for details)

$$\begin{aligned} F(\varvec{x},\varvec{y},z_E)=&\sum _{i=0}^{\infty }\left[ -\beta i x_i +\omega (-ix_i + (i+1) x_{i+1})\right] \frac{y_E}{\eta _E} \varvec{e}^\mathrm{S}_i \nonumber \\&+\sum _{i=0}^{\infty }\left[ (\beta +\omega )(-iy_i + (i+1) y_{i+1})\left( 1+\frac{y_E}{\eta _E}\right) \right. \nonumber \\&\quad \left. +\,\beta (i+1)x_{i+1}\frac{y_E}{\eta _E}-\gamma y_i \right] \varvec{e}^\mathrm{I}_i \nonumber \\&+\left[ \gamma y_E-(\beta +\omega )\frac{y_E z_E}{\eta _E}\right] \varvec{e}^\mathrm{R}. \end{aligned}$$

(3.8)

Consider a sequence of epidemics indexed by N, each having $Z_E^N(0)=0$. Suppose that $N^{-1}Y_i^N(0) {\mathop {\longrightarrow }\limits ^{\mathrm{a.s.}}}\varepsilon _i$ and $N^{-1}X_i^N(0) {\mathop {\longrightarrow }\limits ^{\mathrm{a.s.}}}p_i-\varepsilon _i$ as $N \rightarrow \infty $, where $\varepsilon _E=\sum _{i=1}^{\infty }i \varepsilon _i>0$ and ${\mathop {\longrightarrow }\limits ^{\mathrm{a.s.}}}$ denotes almost sure convergence. Note that for epidemics on NSW random graphs $\varvec{X}^N(0)$ is random and, depending on how the initial infectives are chosen, $\varvec{Y}^N(0)$ may also be random. The above almost sure convergence is reasonable for such epidemics since in an NSW random graph, the fraction of vertices of any given degree satisfies the strong law of large numbers. For epidemics on MR random graphs it is often more natural for $(\varvec{X}^N(0), \varvec{Y}^N(0))$ to be non-random, in which case $N^{-1}Y_i^N(0) \rightarrow \varepsilon _i$ and $N^{-1}X_i^N(0) \rightarrow p_i-\varepsilon _i$ as $N \rightarrow \infty $. Let $\varvec{x}(0)=(p_0-\varepsilon _0, p_1-\varepsilon _1,\ldots )$ and $\varvec{y}(0)=(\varepsilon _0,\varepsilon _1,\ldots )$. The following result holds for epidemics on both MR and NSW random graphs.

Theorem 3.1

(LLN for epidemic on network with dropping)

Suppose that $d_{\max }<\infty $ and $\varepsilon _E>0$. Then, for any $T>0$,

$$\begin{aligned} \lim _{N \rightarrow \infty } \sup _{0 \le t \le T} |N^{-1}\varvec{W}^N(t)-\varvec{w}(t)|=0\qquad \text{ almost } \text{ surely }, \end{aligned}$$

where $\varvec{w}(t)=(\varvec{x}(t),\varvec{y}(t),z_E(t))$ is given by the solution of the following system of ordinary differential equations (ODEs) with initial condition $\varvec{w}(0)=(\varvec{x}(0), \varvec{y}(0),0)$:

$$\begin{aligned} \dfrac{dx_i}{dt}&=-\beta \rho _E(t) i x_i+\omega \rho _E(t)(-i x_i +(i+1)x_{i+1})\quad (i=0,1,\ldots ), \end{aligned}$$

(3.9)

$$\begin{aligned} \dfrac{dy_i}{dt}&=(\beta +\omega )((i+1)y_{i+1}-i y_i)-\gamma y_i+(\beta +\omega )\rho _E(t)[(i+1)y_{i+1}-i y_i]\nonumber \\&\quad +\beta \rho _E(t)(i+1)x_{i+1}\quad (i=0,1,\ldots ),\end{aligned}$$

(3.10)

$$\begin{aligned} \dfrac{dz_E}{dt}&= \gamma y_E(t)-(\beta +\omega )\rho _E(t) z_E, \end{aligned}$$

(3.11)

where

$$\begin{aligned} \rho _E(t)=y_E(t)/\eta _E(t) \end{aligned}$$

(3.12)

and $\eta _E(t)=x_E(t)+y_E(t)+z_E(t)$.

Proof

See Appendix B. $\square $

Remark 3.3

(Solving the ODEs (3.9)–(3.11)) The solution of the system of ODEs (3.9)–(3.11) is considered in Sect. 5. Note that under the conditions of Theorem 3.1 the system of ODEs (3.9)–(3.11) is finite, so existence and uniqueness of a solution follow from standard results. We do not consider existence and uniqueness of solutions to ODEs (3.9)–(3.11) when the degrees are unbounded but acknowledge that further justification and some regularity conditions will be required. A similar comment applies to the time-transformed system of ODEs (4.3)–(4.5) in Sect. 4.

For the epidemic on an MR random graph, a functional CLT for the fluctuations of $\{\varvec{W}^N(t)\}$ about its deterministic limit $\{\varvec{w}(t)\}$ is also available using Ethier and Kurtz (1986), Theorem 11.2.3, as we formulate in Theorem 3.2. See Sect. 7 for discussion of a corresponding CLT for the epidemic on an NSW random graph.

Write $\varvec{w}$ as $(w_1,w_2,\ldots )$ and let $\partial F(\varvec{w})=[\partial _j F_i(\varvec{w})]$ denote the matrix of first partial derivatives of $F(\varvec{w})$. For $0 \le u \le t <\infty $, let $\varPhi (t,u)$ be the solution of the matrix ODE

$$\begin{aligned} \dfrac{\partial }{\partial t}\varPhi (t,u)=\partial F(\varvec{w}(t))\varPhi (t,u), \quad \varPhi (u,u)=I, \end{aligned}$$

(3.13)

where I denotes the identity matrix of appropriate dimension. Let

$$\begin{aligned} G(\varvec{w})=\sum _{\varvec{l}\in \varDelta } \varvec{l}^{\top }\varvec{l}\beta _{\varvec{l}}(\varvec{w}), \end{aligned}$$

where ${}^{\top }$ denotes transpose. Note that $\partial F(\varvec{w}(t))$ is the coefficient matrix of the time-inhomogeneous linear drift of the limiting Gaussian process $\{\varvec{V}(t)\}$ in Theorem 3.2 below and $\varPhi (t,u)$ enables a representation of $\{\varvec{V}(t)\}$ in terms of an Itô integral with respect to a time-inhomogeneous Brownian motion; see (7.1) in Sect. 7.

Theorem 3.2

(Functional CLT for epidemic on MR graph with dropping)

Suppose that $d_{\max }<\infty , \varepsilon _E>0$ and, for $i=0,1,\ldots ,d_{\max }$,

$$\begin{aligned} \lim _{N \rightarrow \infty } \sqrt{N}\left( N^{-1}Y_i^N(0) - \varepsilon _i\right) =v_i^{Y} \quad \text{ and }\quad \lim _{N \rightarrow \infty } \sqrt{N}\left( N^{-1}X_i^N(0) - p_i-\varepsilon _i\right) =v_i^{X}, \end{aligned}$$

(3.14)

where $\varvec{v}=(v_0^{X}, v_1^{X}, \ldots , v_{d_{\max }}^{X},v_0^{Y}, v_1^{Y}, \ldots , v_{d_{\max }}^{Y},0)$ is constant. Then

$$\begin{aligned} \sqrt{N}\left( \{N^{-1}\varvec{W}^N(t)\}-\{\varvec{w}(t)\}\right) \Rightarrow \{\varvec{V}(t)\} \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(3.15)

where $\Rightarrow $ denotes weak convergence and $\{\varvec{V}(t)\}=\{\varvec{V}(t):t \ge 0\}$ is a zero-mean Gaussian process with $\varvec{V}(0)=\varvec{v}$ and covariance function given by

$$\begin{aligned} \mathrm{cov}\left( \varvec{V}(t_1), \varvec{V}(t_2)\right) =\int _0^{\min (t_1,t_2)}\varPhi (t_1,u) G(\varvec{w}(u))\varPhi (t_2,u) ^\top \,\mathrm{d}u \qquad (t_1,t_2 \ge 0). \end{aligned}$$

Proof

See Appendix B, where a complete definition of $\Rightarrow $ is given. $\square $

Remark 3.4

(Computing the asymptotic variance) Theorem 3.2 yields immediately that

$$\begin{aligned} \varSigma (t)=\mathrm{var}\left( \varvec{V}(t)\right) =\int _0^t \varPhi (t,u) G(\varvec{w}(u))\varPhi (t,u) ^\top \,\mathrm{d}u. \end{aligned}$$

(3.16)

It follows from (3.13) and (3.16) that $\varSigma (t)$ satisfies the ODE

$$\begin{aligned} \dfrac{d\varSigma }{dt}=G(\varvec{w})+\partial F(\varvec{w}) \varSigma +\varSigma [\partial F(\varvec{w})]^{\top }, \end{aligned}$$

(3.17)

with initial condition $\varSigma (0)=0$. Thus, provided $d_{\max }<\infty $, $\varSigma (t)$ can be computed by numerically solving the ODEs (3.9)–(3.11) and (3.17) simultaneously.

4 Final outcome of epidemic on MR random graph

We conjecture a CLT for the final outcome of the epidemic with preventive dropping on an MR random graph (see Conjecture 4.1). In order to do so, we consider a random time-transformation of the real-time process.

For $t \ge 0$, let $X_E^N(t)=\sum _{i=1}^\infty i X_i^N(t)$ and $Y_E^N(t)=\sum _{i=1}^\infty i Y_i^N(t)$ be respectively the numbers of susceptible and infectious stubs at time t. Let $\tau ^N=\inf \{t \ge 0: Y_E^N(t)=0\}$, so the final number of susceptibles of different types is given by $\varvec{X}^N(\tau ^N)$. For $\delta \ge 0$, let $\tau ^N_{\delta }= \inf \{t \ge 0: N^{-1}Y_E^N(t)\le \delta \}$, so $\tau ^N=\tau ^N_0$. Recall the definition of $\varepsilon _E$ following (3.8). For $\delta \in (0, \varepsilon _E)$, we derive a CLT for $\varvec{W}^N(\tau ^N_{\delta })$; see Proposition 4.1. Assuming that Proposition 4.1 holds also when $\delta =0$ leads immediately to a CLT (Conjecture 4.1) for $X^N(\tau ^N)=\sum _{i=0}^\infty X_i^N(\tau ^N)$, and hence for the total number of individuals that are ultimately infected by the epidemic, since the latter is given by $N-\sum _{i=0}^\infty X_i^N(\tau ^N)$. A key step in deriving these CLTs is to consider the following random time-scale transformation of $\{\varvec{W}^N(t)\}$; cf. Ethier and Kurtz (1986), page 467, and Janson et al. (2014), Section 3, where similar transformations are used to derive a CLT for the final size of the so-called general stochastic epidemic and a LLN for the Markovian SIR epidemic on an MR random graph, respectively.

For $t \in [0, \tau ^N]$, let

$$\begin{aligned} A^N(t)=\int _0^t \frac{Y_E^N(u)}{X_E^N(u)+Y_E^N(u)+Z_E^N(u)} \,\mathrm{d}u, \end{aligned}$$

and let $\tilde{\tau }^N=A^N(\tau ^N)$. For $0 \le t \le \tilde{\tau }^N$, let $U^N(t)=\inf \{u \ge 0:A^N(u)=t\}$ and

$$\begin{aligned} \tilde{\varvec{W}}^N(t)=(\tilde{\varvec{X}}^N(t), \tilde{\varvec{Y}}^N(t), \tilde{Z}^N_E(t))=\varvec{W}^N\left( U^N(t)\right) . \end{aligned}$$

Then $\{\tilde{\varvec{W}}^N(t)\}=\{\tilde{\varvec{W}}^N(t): 0 \le t \le \tilde{\tau }^N\}$ is also a density dependent population process, having the same set $\varDelta $ of jumps as $\{\varvec{W}^N(t)\}$, and intensity functions $\tilde{\beta }_{\varvec{l}}$$(\varvec{l}\in \varDelta )$ given by

$$\begin{aligned} \tilde{\beta }_{\varvec{l}}(\varvec{w})=\tilde{\beta }_{\varvec{l}}(\varvec{x},\varvec{y},z_E) = {\left\{ \begin{array}{ll} {\tilde{\beta }}_{ij}^{(1)}(\varvec{x},\varvec{y},z_E)=\frac{\beta i y_i j x_j}{y_E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(1)} \in \varDelta _1, \\ {\tilde{\beta }}_{ij}^{(2)}(\varvec{x},\varvec{y},z_E)=\frac{(\beta +\omega )i y_i j y_j}{y_E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(2)} \in \varDelta _2,\\ {\tilde{\beta }}_{ij}^{(3)}(\varvec{x},\varvec{y},z_E)=\frac{\omega i y_i j x_j}{y_E}&{} \text { for } \varvec{l}=\varvec{l}_{ij}^{(3)} \in \varDelta _3,\\ {\tilde{\beta }}_{i}^{(4)}(\varvec{x},\varvec{y},z_E)=\frac{(\beta +\omega )i y_i z_E}{y_E}&{} \text { for } \varvec{l}=\varvec{l}_{i}^{(4)} \in \varDelta _4,\\ {\tilde{\beta }}_{i}^{(5)}(\varvec{x},\varvec{y},z_E)=\gamma y_i \frac{\eta _E}{y_E} &{} \text { for } \varvec{l}=\varvec{l}_{i}^{(5)} \in \varDelta _5. \end{array}\right. } \end{aligned}$$

(4.1)

Note that when $\{\varvec{W}^N(t)\}$ is in state $\varvec{n}=(n_0^X,n_1^X,\ldots , n_0^Y,n_1^Y, \ldots , n_E^Z)$, the clock in $\{\tilde{\varvec{W}}^N(t)\}$ runs at rate $(n_E^X+n_E^Y+n_E^Z)/n_E^Y$ times faster than the clock in $\{\varvec{W}^N(t)\}$, so the intensities in (4.1) are obtained by multiplying the corresponding intensities in (3.7) by $\eta _E/y_E$. The drift function associated with $\{\tilde{\varvec{W}}^N(t)\}$ is (cf. (3.8))

$$\begin{aligned} \tilde{F}(\varvec{x},\varvec{y},z_E)=&\sum _{i=0}^{\infty }\left[ -\beta i x_i +\omega (-ix_i + (i+1) x_{i+1})\right] \varvec{e}^\mathrm{S}_i \nonumber \\&+\sum _{i=0}^{\infty }\left[ (\beta +\omega )(-iy_i + (i+1) y_{i+1})\left( 1+\frac{\eta _E}{y_E}\right) \right. \nonumber \\&\qquad \qquad \left. +\beta (i+1)x_{i+1}-\gamma y_i\frac{\eta _E}{y_E} \right] \varvec{e}^\mathrm{I}_i \nonumber \\&+\left[ \gamma \eta _E-(\beta +\omega )z_E\right] \varvec{e}^\mathrm{R}. \end{aligned}$$

(4.2)

Let $\{\tilde{\varvec{w}}(t): t \ge 0\}=\{(\tilde{\varvec{x}}(t),\tilde{\varvec{y}}(t),\tilde{z}_E(t)): t \ge 0\}$ be the solution of the following system of ODEs, with initial condition $\tilde{\varvec{w}}(0)=(\varvec{x}(0), \varvec{y}(0),0)$:

$$\begin{aligned} \dfrac{d\tilde{x}_i}{dt}=&-\beta i \tilde{x}_i+\omega [-i \tilde{x}_i +(i+1)\tilde{x}_{i+1}], \end{aligned}$$

(4.3)

$$\begin{aligned} \dfrac{d\tilde{y}_i}{dt}=&\left\{ (\beta +\omega )[(i+1)\tilde{y}_{i+1}-i \tilde{y}_i]-\gamma \tilde{y}_i\right\} \frac{1}{\tilde{\rho }_E(t)}\nonumber \\&+(\beta +\omega )[(i+1)\tilde{y}_{i+1}-i \tilde{y}_i]+\beta (i+1)\tilde{x}_{i+1},\end{aligned}$$

(4.4)

$$\begin{aligned} \dfrac{d\tilde{z}_E}{dt}=&\gamma \tilde{\eta }_E(t)-(\beta +\omega ) \tilde{z}_E, \end{aligned}$$

(4.5)

where $i=0,1,\ldots $ and $\tilde{\rho }_E(t)=\tilde{y}_E(t)/\tilde{\eta }_E(t)$, $\tilde{\eta }_E(t)=\tilde{x}_E(t)+\tilde{y}_E(t)+\tilde{z}_E(t)$ with $\tilde{x}_E(t)= \sum _{i=1}^{\infty } i \tilde{x}_i(t)$ and $\tilde{y}_E(t)= \sum _{i=1}^{\infty } i \tilde{y}_i(t)$. The solution of this system is considered in Sect. 5.1.1. Let $\tilde{\tau }=\inf \{t \ge 0: \tilde{y}_E(t)=0\}$. It is shown in Appendix C that $\tilde{\tau }< \infty $, i.e. the duration of the limiting time-changed deterministic epidemic is finite, unless $\gamma =\omega =p_1-\varepsilon _1=0$.

We consider the same sequence of epidemics as for Proposition 3.1 in Sect. 3. Again, using Ethier and Kurtz (1986), Theorem 11.2.1, as $N \rightarrow \infty $, $\{N^{-1}\tilde{\varvec{W}}^N(t)\}$ converges almost surely over any finite time interval $[0, t_0]$, with $t_0 < \tilde{\tau }$, to $\{\tilde{\varvec{w}}(t)\}=\{\tilde{\varvec{w}}(t):0 \le t \le \tilde{\tau }\}$ (see Appendix B for further details of this and of the functional CLT given at (4.6)). Suppose further that the initial conditions satisfy (3.14) and $d_{\max }< \infty $. Then it follows using Ethier and Kurtz (1986), Theorem 11.2.3, that, for any $t_0 \in [0,\tilde{\tau })$,

$$\begin{aligned} \sqrt{N}\left( \{N^{-1}\tilde{\varvec{W}}^N(t):0 \le t \le t_0\}-\{\tilde{\varvec{w}}(t):0 \le t \le t_0\}\right) \Rightarrow \{\tilde{\varvec{V}}(t)\} \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(4.6)

where $\{\tilde{\varvec{V}}(t):0 \le t \le t_0\}$ is a zero-mean Gaussian process with $\tilde{\varvec{V}}(0)=\varvec{0}$ and variance given by

$$\begin{aligned} \tilde{\varSigma }_\mathrm{MR}(t)=\mathrm{var}\left( \tilde{\varvec{V}}(t)\right) =\int _0^t \tilde{\varPhi }(t,s) \tilde{G}(\tilde{\varvec{w}}(u))\tilde{\varPhi }(t,s) ^\top \,\mathrm{d}s, \end{aligned}$$

(4.7)

where

$$\begin{aligned} \tilde{G}(\tilde{\varvec{w}}(u))=\sum _{\varvec{l}\in \varDelta } \varvec{l}^{\top }\varvec{l}\tilde{\beta }_{\varvec{l}}(\tilde{\varvec{w}}(u)) \end{aligned}$$

(4.8)

and, for $0 \le s \le t <\infty $, $\tilde{\varPhi }(t,s)$ is the solution of the matrix ODE

$$\begin{aligned} \dfrac{\partial }{\partial t}\tilde{\varPhi }(t,u)=\partial \tilde{F}(\tilde{\varvec{w}}(t))\tilde{\varPhi }(t,u), \quad \tilde{\varPhi }(u,u)=I. \end{aligned}$$

(4.9)

For $t \ge 0$, let $\tilde{Y}^N_E(t)=\sum _{i=1}^{\infty } i \tilde{Y}_i^N(t)$. Further, for $\delta \ge 0$, let

$$\begin{aligned} \tilde{\tau }_{\delta }^N= \inf \{t \ge 0: N^{-1}\tilde{Y}^N_E(t)\le \delta \} \quad \text{ and } \quad \tilde{\tau }_{\delta }=\inf \{t \ge 0: \tilde{y}_E(t)=\delta \}, \end{aligned}$$

(4.10)

so both $\tilde{\tau }_{\delta }^N$ and $\tilde{\tau }_{\delta }$ are decreasing with $\delta $, $\tilde{\tau }^N_0=\tilde{\tau }^N$ and $\tilde{\tau }_0=\tilde{\tau }$. We show in Appendix C that $\tilde{\tau }_{\delta }<\infty $; it is clearly finite if $\tilde{\tau }< \infty $. Let $\varphi (\tilde{\varvec{w}})= \varphi (\tilde{\varvec{x}},\tilde{\varvec{y}},\tilde{z}_E)=\sum _{i=1}^{\infty } i \tilde{y}_i$$(=\tilde{y}_E)$, so

$$\begin{aligned} \tilde{\tau }_{\delta }^N= \inf \left\{ t \ge 0: \varphi \left( N^{-1}\tilde{\varvec{W}}^N(t)\right) \le \delta \right\} \quad \text{ and } \quad \tilde{\tau }_{\delta }=\inf \{t \ge 0: \varphi (\tilde{\varvec{w}}(t))=\delta \}. \end{aligned}$$

For fixed $\delta \in (0,y_E(0))$, application of Ethier and Kurtz (1986), Theorem 11.4.2, yields

$$\begin{aligned} \sqrt{N}\left( N^{-1} \tilde{\varvec{W}}^N(\tilde{\tau }_{\delta }^N)-\tilde{\varvec{w}}(\tilde{\tau }_{\delta })\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}\tilde{\varvec{V}}(\tilde{\tau }_{\delta })-&\frac{\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\cdot \tilde{\varvec{V}}(\tilde{\tau }_{\delta })}{\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))} \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\nonumber \\&\quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(4.11)

where $\cdot $ denotes inner vector product and ${\mathop {\longrightarrow }\limits ^{\mathrm{D}}}$ denotes convergence in distribution. This result requires that

$$\begin{aligned} \nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))<0, \end{aligned}$$

(4.12)

which we show in Appendix C. Condition (4.12) ensures that $\tilde{\tau }_{\delta }$ is a proper crossing time. Note that

$$\begin{aligned} \tilde{\varvec{V}}(\tilde{\tau }_{\delta })-\frac{\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\cdot \tilde{\varvec{V}}(\tilde{\tau }_{\delta })}{\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))} \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) =\tilde{\varvec{V}}(\tilde{\tau }_{\delta })B_{\delta }^{\top }, \end{aligned}$$

where

$$\begin{aligned} B_{\delta }=I-\frac{\tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\bigotimes \nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))}{\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))} \end{aligned}$$

(4.13)

and $\bigotimes $ denotes outer vector product.

The following proposition follows immediately from (4.11) on noting that $\varvec{W}^N(\tau ^N_{\delta })=\tilde{\varvec{W}}^N(\tilde{\tau }_{\delta }^N)$ and $\varvec{w}(\tau _{\delta })= \tilde{\varvec{w}}(\tilde{\tau }_{\delta })$, where $\tau _{\delta }=\inf \{t \ge 0: y_E(t)=\delta \}$.

Proposition 4.1

(CLT for ‘final’ outcome of epidemic on MR graph with dropping) Suppose that $d_{\max }<\infty , \varepsilon _E>0, \delta \in (0,y_E(0))$ and (3.14) is satisfied. Then

$$\begin{aligned} \sqrt{N}\left( N^{-1} \varvec{W}^N(\tau ^N_{\delta })-\varvec{w}(\tau _{\delta })\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N\!\left( \varvec{0}, \varSigma _{\mathrm{MR},\delta } \right) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(4.14)

where

$$\begin{aligned} \varSigma _{\mathrm{MR},\delta } =B_{\delta }\tilde{\varSigma }_\mathrm{MR}(\tilde{\tau }_{\delta }) B_{\delta }^{\top } \end{aligned}$$

and $N\!\left( \varvec{0}, \varSigma _{\mathrm{MR},\delta } \right) $ denotes a multivariate normal distribution (of appropriate dimension) with mean vector $\varvec{0}$ and variance–covariance matrix $\varSigma _{\mathrm{MR},\delta }$.

Remark 4.1

(Extending Proposition 4.1to$\delta =0$) We are primarily interested in the case when $\delta =0$. The difficulty in extending Proposition 4.1 to include $\delta =0$ is that to apply Ethier and Kurtz (1986), Theorem 11.4.2, we need the weak convergence at (4.6) to hold for some $t_0>\tilde{\tau }$. Thus we need to extend the process $\{\tilde{\varvec{W}}^N(t)\}$ so that it is defined beyond time $\tilde{\tau }^N$. Now $\tilde{y}_E(t)<0$ for $t>\tilde{\tau }$ (see (5.11) in Sect. 5.1.1), so we need to extend the state space of $\{\tilde{\varvec{W}}^N(t)\}$ so that $\tilde{Y}_i^N(t)$$(i=0,1,\ldots ,d_{\max })$ can be negative. However, this cannot be done so that the conditions of the LLN and CLT theorems in Ethier and Kurtz (1986) are satisified. In particular, in any neighbourhood of $\{\varvec{w}:y_E=0\}$, the intensity functions $\tilde{\beta }_{\varvec{l}}$$(\varvec{l}\in \varDelta )$ are not bounded and the drift function $\tilde{F}$ is not Lipschitz continuous.

In work done while this paper was under review, the first author has found a way of overcoming this problem; see Ball (2018) which is in the setting of an SIR epidemic (without dropping of edges) with an arbitrary but specified infectious period distribution on configuration model networks. The theorems proved in Ball (2018) provide further (very strong) support for Conjecture 4.1 below, which assumes that Proposition 4.1 extends in the obvious way to include $\delta =0$, and for subsequent conjectures which are contingent on Conjecture 4.1. Note that the final outcome of the epidemic is given by $\tilde{\varvec{W}}^N(\tilde{\tau }^N)$ and the corresponding determinsitic outcome is $\tilde{\varvec{w}}(\tilde{\tau })$.

We use the term final outcome to refer to that of the effective degree formulation, in which the degrees of susceptibles can change owing to dropping of edges. This is sufficient to determine the final size of an epidemic. If the final numbers of susceptibles of various original degrees are required, the effective degree formulation can be extended to keep track of both the original and effective degrees of suceptibles.

Conjecture 4.1

(CLT for final outcome of epidemic on MR graph with dropping) Suppose that $d_{\max }<\infty , \varepsilon _E>0$ and (3.14) is satisfied. Then

$$\begin{aligned} \sqrt{N}\left( N^{-1} \tilde{\varvec{W}}^N(\tilde{\tau }^N)-\tilde{\varvec{w}}(\tilde{\tau })\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N\!\left( \varvec{0}, \varSigma _{\mathrm{MR}} \right) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(4.15)

where

$$\begin{aligned} \varSigma _{\mathrm{MR}} =B \tilde{\varSigma }_\mathrm{MR}(\tilde{\tau }) B^{\top } \end{aligned}$$

with B given by (4.13) with $\delta =0$.

Remark 4.2

(LLN for final outcome of SIR epidemic with preventive dropping) Conjecture 4.1 implies that $\varvec{X}^N(\tau ^N) {\mathop {\longrightarrow }\limits ^{\mathrm{p}}}\varvec{x}(\infty )$ as $N \rightarrow \infty $, where ${\mathop {\longrightarrow }\limits ^{\mathrm{p}}}$ denotes convergence in probability, i.e. the final outcome of the epidemic on an MR random graph obeys a weak LLN. The same conjecture holds also for the epidemic on an NSW random graph, using the theory in Sect. 7. Note that $\varvec{x}(\infty )=\tilde{\varvec{x}}(\tilde{\tau })$ and an expression for $\tilde{\varvec{x}}(\tilde{\tau })$ is given in Eq. (5.26) in Sect. 5.3.

Remark 4.3

(Explicit expression for asymptotic variance of final size) Note that $\tilde{\varSigma }_\mathrm{MR}(t)$, and hence $\varSigma _{\mathrm{MR},\delta }$, can be computed numerically as described for $\varSigma (t)$ in Remark 3.4. However, as detailed in Sect. 6 for the case $\delta =0$, it is possible to derive an almost fully explicit expression, as a function of $\tilde{\tau }_{\delta }$, for the asymptotic variance of the ‘final’ number of susceptibles. Moreover, the expression is fully explicit when $\omega =0$, i.e. when there is no dropping of edges, so the model reduces to a standard Markov SIR epidemic on an MR configuration model network.

5 Deterministic temporal behaviour and final size

In Sect. 5.1 we study the deterministic temporal behaviour of the effective degree model, described by the system of ODEs (3.9)–(3.11) given in Theorem 3.1, by considering first the corresponding time-transformed system (4.3)–(4.5). The resulting (partial) solution of this system is required to calculate the asymptotic variance of the final size in Sects. 6 and 7. Furthermore, the results of this section are used in Appendix C to prove that the conditions $\tilde{\tau }_{\delta }<\infty $ and (4.12), required for the application of Ethier and Kurtz (1986), Theorem 11.4.2, are satisfied. In Sect. 5.2, we connect the analysis of (4.3)–(4.5) to other approaches taken in the literature for the deterministic analysis of epidemics on configuration model networks. Finally, in Sect. 5.3, we give a characterization of the deterministic final size of the epidemic and consider the final size of epidemics initiated by a trace of infection in Proposition 5.1. We do not consider existence and uniqueness of solutions of the determinstic model when $d_{\max }=\infty $ (see Remark 3.3) but indicate where further justification is required for a proof.

5.1 Temporal behaviour

5.1.1 Time-transformed process

Consider the system of ODEs given by (4.3)–(4.5), with initial condition $\tilde{\varvec{x}}(0)=(p_0-\varepsilon _0, p_1-\varepsilon _1,\ldots )$, $\tilde{\varvec{y}}(0)=(\varepsilon _0,\varepsilon _1,\ldots )$ and $\tilde{z}_E(0)=0$. In this section we obtain explicit expressions for $\tilde{\varvec{x}}(t)$, ${\tilde{x}}_E(t)$, ${\tilde{y}}_E(t)$ and other variables pertaining to the fraction of susceptible, infectious, and recovered individuals in the population in the time-transformed process, while in Sect. 5.1.2 we connect these to corresponding variables in the real-time process.

Observe that the evolution of $\{\tilde{\varvec{x}}(t)\}$ is decoupled from the rest of the system. To solve (4.3), let $\{X(t)\}=\{X(t):t \ge 0\}$ denote a transient continuous-time Markov chain describing the evolution of a single susceptible individual, whose stubs are independently dropped at rate $\omega $ and independently infected at rate $\beta $. For $t \ge 0$, let X(t) be the number of stubs attached to the individual at time t, if it is still susceptible, otherwise let $X(t)=-1$. Let $p_{ji}(t)=\mathrm{P}(X(t)=i|X(0)=j)$, for $i,j=0,1,\dots $ and $t\ge 0$. By deriving the forward equation for $\{X(t)\}$ it is easily seen that, for $i=0,1,\ldots $, $\tilde{x}_i(t)=\sum _{j=i}^{\infty } \tilde{x}_j(0)p_{ji}(t)$$(t \ge 0$).

It is straightforward to calculate $p_{ji}(t)$, since stubs disappear (by dropping or infection) independently, the probability that a given initial stub has disappeared by time t is $1-\mathrm{e}^{-(\beta +\omega )t}$ and, given that a stub has disappeared, the probability its disappearance was caused by dropping is $p_{\omega }=\frac{\omega }{\beta +\omega }$. Thus,

$$\begin{aligned} p_{ji}(t)={\left\{ \begin{array}{ll} \left( {\begin{array}{c}j\\ i\end{array}}\right) \mathrm{e}^{-(\beta +\omega )it}\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) ^{j-i} p_{\omega }^{j-i} &{} \text { for } j \ge i,\\ 0&{} \text { for } j < i, \end{array}\right. } \end{aligned}$$

(5.1)

whence, for $i=0,1,\ldots $,

$$\begin{aligned} \tilde{x}_i(t)= & {} \sum _{j=i}^{\infty }\tilde{x}_j(0)p_{ji}(t)\nonumber \\= & {} \sum _{j=i}^{\infty }(p_j -\varepsilon _j)\left( {\begin{array}{c}j\\ i\end{array}}\right) \mathrm{e}^{-(\beta +\omega )it}\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) ^{j-i} p_{\omega }^{j-i}\nonumber \\= & {} \frac{ \mathrm{e}^{-(\beta +\omega )it}}{i!}\sum _{j=i}^{\infty }(p_j -\varepsilon _j) \frac{j!}{(j-i)!}\left[ p_{\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) \right] ^{j-i}\nonumber \\= & {} \frac{ \mathrm{e}^{-(\beta +\omega )it}}{i!} f_{D_{\varepsilon }}^{(i)}\left( p_{\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) \right) , \end{aligned}$$

(5.2)

where

$$\begin{aligned} f_{D_{\varepsilon }}(s)=\sum _{k=0}^{\infty } (p_k-\varepsilon _k)s^k\qquad (0 \le s \le 1), \end{aligned}$$

(5.3)

and $f_{D_{\varepsilon }}^{(i)}$ denotes the ith derivative of $f_{D_{\varepsilon }}$. It then follows that

$$\begin{aligned} \tilde{x}_E(t)= & {} \sum _{i=0}^{\infty } \frac{i \mathrm{e}^{-(\beta +\omega )it}}{i!} f_{D_{\varepsilon }}^{(i)}\left( p_{\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) \right) \nonumber \\= & {} \mathrm{e}^{-(\beta +\omega )t}\sum _{i=1}^{\infty } \frac{ \mathrm{e}^{-(\beta +\omega )(i-1)t}}{(i-1)!} f_{D_{\varepsilon }}^{(i)}\left( p_{\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) \right) \nonumber \\= & {} \mathrm{e}^{-(\beta +\omega )t}f_{D_{\varepsilon }}'\left( p_{\omega }\left[ 1-\mathrm{e}^{-(\beta +\omega )t}\right] +\mathrm{e}^{-(\beta +\omega )t}\right) \nonumber \\= & {} \mathrm{e}^{-(\beta +\omega )t}f_{D_{\varepsilon }}'\left( \psi (t)\right) , \end{aligned}$$

(5.4)

where

$$\begin{aligned} \psi (t)= p_{\omega }+(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )t}. \end{aligned}$$

(5.5)

Differentiating (5.4) yields

$$\begin{aligned} \dfrac{d\tilde{x}_E}{dt}=-(\beta +\omega )\tilde{x}_E-\beta \mathrm{e}^{-2(\beta +\omega )t}f_{D_{\varepsilon }}''\left( \psi (t)\right) . \end{aligned}$$

(5.6)

Note that $\sum _{i=1}^{\infty } i[(i+1)\tilde{y}_{i+1}-i \tilde{y}_i]=-\tilde{y}_E$ and, using a similar argument to the derivation of (5.4),

$$\begin{aligned} \sum _{i=1}^{\infty } i (i+1)\tilde{x}_{i+1}(t)=\mathrm{e}^{-2(\beta +\omega )t}f_{D_{\varepsilon }}''\left( \psi (t)\right) . \end{aligned}$$

(5.7)

Multiplying (4.4) by i and summing over $i=1,2,\ldots $ yields

$$\begin{aligned} \dfrac{d\tilde{y}_E}{dt}=-(\beta +\omega +\gamma )\tilde{\eta }_E-(\beta +\omega )\tilde{y}_E+\beta \mathrm{e}^{-2(\beta +\omega )t}f_{D_{\varepsilon }}''\left( \psi (t)\right) . \end{aligned}$$

(5.8)

(This requires justifying and further conditions if $d_{\max }=\infty $. A similar comment applies to equations contingent on (5.8), such as (5.11).) Adding (5.6), (5.8) and (4.5) gives

$$\begin{aligned} \dfrac{d\tilde{\eta }_E}{dt}=-2(\beta +\omega )\tilde{\eta }_E, \end{aligned}$$

which, together with the initial condition $\tilde{\eta }_E(0)=\mu _D$, yields

$$\begin{aligned} \tilde{\eta }_E(t)=\mu _D\mathrm{e}^{-2(\beta +\omega )t}. \end{aligned}$$

(5.9)

Substituting (5.9) into (4.5) yields

$$\begin{aligned} \dfrac{d\tilde{z}_E}{dt}= \gamma \mu _D\mathrm{e}^{-2(\beta +\omega )t} -(\beta +\omega ) \tilde{z}_E, \qquad \tilde{z}_E(0)=0, \end{aligned}$$

whence

$$\begin{aligned} \tilde{z}_E(t)=\frac{\gamma }{\beta +\omega }\mu _D\mathrm{e}^{-(\beta +\omega )t}\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) . \end{aligned}$$

(5.10)

Thus

$$\begin{aligned} \tilde{y}_E(t)&=\tilde{\eta }_E(t)-\tilde{x}_E(t)-\tilde{z}_E(t)\nonumber \\&= \mathrm{e}^{-(\beta +\omega )t}\left( \frac{\beta +\omega +\gamma }{\beta +\omega }\mu _D\mathrm{e}^{-(\beta +\omega )t}-\frac{\gamma }{\beta +\omega }\mu _D-f_{D_{\varepsilon }}'\left( \psi (t)\right) \right) . \end{aligned}$$

(5.11)

Remark 5.1

(Fractions of susceptible, infectious, and recovered individuals) Although the above results are useful for analysing the final outcome of the epidemic, of greater practical interest is the evolution of the fractions of the population that are susceptible, infective and recovered individuals, which in the time-transformed process are given by $\tilde{x}(t)=\sum _{i=0}^{\infty } {\tilde{x}}_i(t), \tilde{y}(t)=\sum _{i=0}^{\infty } {\tilde{y}}_i(t)$ and $\tilde{z}(t)=\sum _{i=0}^{\infty } {\tilde{z}}_i(t)$, respectively. Summing (5.2) over $i=0,1,\ldots $ and using a similar argument to the derivation of (5.4) yields

$$\begin{aligned} \tilde{x}(t)=f_{D_{\varepsilon }}\left( \psi (t)\right) . \end{aligned}$$

(5.12)

Turning to $\tilde{y}(t)$, summing (4.4) over $i=1,2,\ldots $ and using (5.4) yields

$$\begin{aligned} \dfrac{d\tilde{y}}{dt}=-\frac{\gamma }{\tilde{\rho }_E(t)}\tilde{y}+\beta \mathrm{e}^{-(\beta +\omega )t}f_{D_{\varepsilon }}\left( \psi (t)\right) . \end{aligned}$$

(5.13)

Let $\varepsilon =\sum _{i=0}^{\infty } \varepsilon _i=\tilde{y}(0)$ and

$$\begin{aligned} c(t)=\int _0^t \frac{1}{{\tilde{\rho }}_E(u)}\,\mathrm{d}u. \end{aligned}$$

Then (5.13) has solution

$$\begin{aligned} \tilde{y}(t)=\mathrm{e}^{-\gamma c(t)}\varepsilon +\beta \int _0^t \mathrm{e}^{-[(\beta +\omega )u+\gamma (c(t)-c(u))]}f_{D_{\varepsilon }}\left( \psi (u)\right) \,\mathrm{d}u. \end{aligned}$$

(5.14)

We do not have a closed-form expression for the integral in (5.14), though it is straightforward to calculate $\tilde{y}(t)$ numerically using the ODE (5.13). Finally, note that $\tilde{z}(t)=1-\tilde{x}(t)-\tilde{y}(t)$.

5.1.2 Real-time process

Turning to the system of ODEs (3.9)–(3.11), which describe the limiting evolution of the epidemic as the population size $N \rightarrow \infty $, let

$$\begin{aligned} \xi (t)= \int _0^t \rho _E(u)\,\mathrm{d}u, \end{aligned}$$

(5.15)

where $\rho _E$ is given by (3.12). Then $\xi '(t)=\rho _E(t)$ and it follows that, for $t \ge 0$,

$$\begin{aligned} \varvec{w}(t)=\tilde{\varvec{w}}(\xi (t)), \end{aligned}$$

(5.16)

connecting the original process to the time-transformed process. Hence, $\xi '(t)=\tilde{\rho }_E(\xi (t))$, so (5.11) and (5.9) imply that $\xi (t)$ is determined by

$$\begin{aligned} \dfrac{d\xi }{dt}=1+\frac{\gamma }{\beta +\omega }\left( 1-\mathrm{e}^{(\beta +\omega )\xi }\right) -\mathrm{e}^{(\beta +\omega )\xi }\frac{f_{D_{\varepsilon }}'\left( \psi (\xi )\right) }{\mu _D}, \end{aligned}$$

(5.17)

together with $\xi (0)=0$. The ODE (5.17) does not seem to admit an explicit solution, although it is straightforward to solve numerically.

5.2 Connection to other approaches

In this section we consider other deterministic formulations of the preventive dropping model and make the connection to the effective degree approach (ODE system (3.9)–(3.11)). Our focus is on the deterministic variable $\theta (t)$ that is defined as follows:

$$\begin{aligned} \theta (t)={\mathscr {F}}(t)-\int _{0}^t \frac{f_{D_{\varepsilon }}'(\theta (u))}{\mu _D}{\mathscr {F}}'(t-u)\,\mathrm{d}u. \end{aligned}$$

(5.18)

Here, ${\mathscr {F}}(t)$ is the probability that an individual escapes infection from a given neighbour, up to at least t units of time after the neighbour became infected. In the Markovian SIR case with dropping of edges, this probability equals

$$\begin{aligned} {\mathscr {F}}(t) = \frac{\gamma +\omega }{\beta +\gamma +\omega }+\frac{\beta }{\beta +\gamma +\omega }\mathrm{e}^{-(\beta +\gamma +\omega )t}. \end{aligned}$$

(5.19)

Indeed, there are three competing events: transmission, ending of the infectious period, and informing the susceptible neighbour, that occur at rates $\beta $, $\gamma $, and $\omega $, respectively. We see immediately from the renewal equation for $\theta $, obtained by substituting (5.19) into (5.18), that one can also interpret dropping of edges as an increased recovery rate for the deterministic mean temporal behaviour since $\omega $ only appears as part of the sum $\gamma +\omega $ (see Remark 5.3 in Sect. 5.3). This aspect of the mean temporal behaviour may not be immediately clear from the system (3.9)–(3.11).

The variable $\theta $ can be interpreted as the probability that along a randomly chosen edge between two individuals, i and j say, there is no transmission from j to i before time t, given that no transmission occurred from individual i to j. The variable $\theta $ formed the basis for the edge-based compartmental models of Volz, Miller and co-workers (see e.g. Kiss et al. (2017) and references therein). Closely related to edge-based compartmental models is the binding site formulation presented in Leung and Diekmann (2016), where the relation to edge-based compartmental models is also explained. We use the binding site formulation in this section to state the renewal equation for the variable $\theta $, restricting ourselves to the Markovian SIR epidemic (in Leung and Diekmann (2016) ${\bar{x}}$ is used instead of $\theta $). In principle, the renewal Eq. (5.18) is far more general and allows for randomness in infectiousness beyond the Markovian setting, see Leung and Diekmann (2016) for details. Note that in the above works, the derivation of the equations describing the evolution of $\theta (t)$ is heuristic. Those equations are proved for the Markov SIR epidemic on a configuration model network, in the sense of a large population limit, in Decreusefond et al. (2012) and Janson et al. (2014); see also Barbour and Reinert (2013).

The variable $\theta $ relates to the effective degree formulation as follows:

$$\begin{aligned} \theta (t)=p_{\omega }+(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )\xi (t)}=\psi (\xi (t)), \end{aligned}$$

(5.20)

where the functions $\psi $ and $\xi $ from the effective degree formulation are defined at (5.5) and (5.15), respectively. Indeed, Eq. (5.20) is expected from the interpretation of $\theta $: $p_{\omega }$ is the probability that the susceptible individual is informed by the infection status of a given neighbour before being infected by that neighbour, so the stub disappears through dropping, while $(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )\xi (t)}$ is the probability that there is no dropping and the given stub has not disappeared at time $\xi (t)$ (where $\xi (t)$ accounts for the time-transformation, see (5.16)). One can check that (5.20) holds true by first transforming the renewal Eq. (5.18) into an ODE for $\theta $ by differentiating (and using (5.19)):

$$\begin{aligned} \dfrac{d\theta }{dt}=\beta \frac{f_{D_{\varepsilon }}'(\theta )}{\mu _D}-(\beta +\gamma +\omega )\theta +\gamma +\omega , \end{aligned}$$

(5.21)

with initial condition $\theta (0)=1$. Next, differentiating the right-hand-side of (5.20), and using (5.17), we find that $\psi (\xi )$ satisfies the ODE (5.21). Furthermore, the initial condition $\xi (0)=0$ implies that $\psi (\xi (0))=1$.

Finally, the Malthusian parameter r, the basic reproduction number $R_0$ and the final size of the epidemic are easily derived from the single renewal equation (5.18). Here we only state the expressions and refer to Leung and Diekmann (2016), Section 2.5, for details. In the limit of $\varepsilon \downarrow 0$ the Euler–Lotka characteristic equation is

$$\begin{aligned} 1&=-\frac{f''_{D}(1)}{\mu _D}\int _0^\infty \mathrm{e}^{-\lambda t}\mathscr {F}'(t)\,\mathrm{d}t\nonumber \\&=\left( \mu _D-1+\frac{\sigma _D^2}{\mu _D}\right) \int _0^\infty e^{-\lambda t}\beta \mathrm{e}^{-(\beta +\gamma +\omega )t}\,\mathrm{d}t. \end{aligned}$$

(5.22)

The Malthusian parameter r is the unique real root of (5.22) and a simple calculation yields

$$\begin{aligned} r=\beta \left( \mu _D-2+\frac{\sigma ^2_D}{\mu _D}\right) -\gamma -\omega , \end{aligned}$$

(5.23)

agreeing with Britton et al. (2016), equation (3). The basic reproduction number $R_0$ is obtained from (5.22) by evaluating the right hand side at $\lambda =0$, yielding the same expression as (2.1). The final size is discussed in Remark 5.2.

5.3 Final size

Recall that $\tilde{\tau }_{\delta }$ defined at (4.10) satisfies $\tilde{y}_E(\tilde{\tau }_{\delta })=\delta $. In particular, using (5.11), $\tilde{\tau }=\tilde{\tau }_0$ satisfies

$$\begin{aligned} \frac{\beta +\omega +\gamma }{\beta +\omega }\mu _D\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}-\frac{\gamma }{\beta +\omega }\mu _D-f_{D_{\varepsilon }}'\left( \psi (\tilde{\tau })\right) =0. \end{aligned}$$

(5.24)

For later use, we rewrite (5.24) as

$$\begin{aligned} \left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right] \mu _D=f_{D_{\varepsilon }}'\left( \widetilde{\psi }(z)\right) , \end{aligned}$$

(5.25)

where $z=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}$ and $\widetilde{\psi }(z)=p_{\omega }+(1-p_{\omega })z$. Further, using (5.12) yields that the final proportion of the population that remains uninfected is given by

$$\begin{aligned} \tilde{x}(\tilde{\tau })=f_{D_{\varepsilon }}\left( \psi (\tilde{\tau })\right) . \end{aligned}$$

(5.26)

We let $\rho =1-\tilde{x}(\tilde{\tau })$ denote the fraction of the population ultimately infected in the limiting deterministic epidemic.

Let $\varepsilon _E=\sum _{i=1}^{\infty } i\varepsilon _i$. Then in the limit as $\varepsilon _E\downarrow 0$, i.e. for epidemics started by a trace of infection (or, more precisely, a trace of infected stubs), the final susceptible fraction is given by (5.26), where $\tilde{\tau }$ satisfies

$$\begin{aligned} \frac{\beta +\omega +\gamma }{\beta +\omega }\mu _D\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}-\frac{\gamma }{\beta +\omega }\mu _D-f_D'\left( \psi (\tilde{\tau })\right) =0. \end{aligned}$$

(5.27)

We can now formulate the characterization for the final size $\rho $ of the epidemic. We illustrate the dependence of $\rho $ on the dropping rate $\omega $ in Sect. 10.4.

Proposition 5.1

(Deterministic final size) Suppose that $d_{\max }<\infty $.

(a)
Suppose that $\varepsilon _E>0$. Then the fraction of the population that is ultimately infected in the deterministic epidemic is given by
$$\begin{aligned} \rho =1-f_{D_{\varepsilon }}(s), \end{aligned}$$
(5.28)
where s is the unique solution in [0, 1) of
$$\begin{aligned} (\beta +\omega +\gamma )s-(\omega +\gamma )=\beta \mu _D^{-1} f_{D_{\varepsilon }}'(s). \end{aligned}$$
(5.29)
(b)
Suppose $R_0>1$. Then in the limit as $\varepsilon _E\downarrow 0$, the fraction of the population that is ultimately infected in the limiting deterministic epidemic is given by
$$\begin{aligned} \rho =1-f_D(s), \end{aligned}$$
(5.30)
where s is the unique solution in [0, 1) of
$$\begin{aligned} (\beta +\omega +\gamma )s-(\omega +\gamma )=\beta \mu _D^{-1} f_D'(s). \end{aligned}$$
(5.31)

Proof

(a) Suppose that $\varepsilon _E>0$. Let $s=\widetilde{\psi }(z)$, so $z=\frac{(\beta +\omega )s-\omega }{\beta }$. It then follows from (5.25) and (5.26) that s satisfies (5.29) and $\rho $ is given by (5.28). Let $g_1(s)=(\beta +\omega +\gamma )s-(\omega +\gamma )$ and $g_2(s)=\beta \mu _D^{-1} f_{D_{\varepsilon }}'(s)$. Then $g_1(0)\le 0 <g_2(0)$ and $g_1(1)>g_2(1)$, since $f_{D_{\varepsilon }}'(1)=\sum _{i=1}^{\infty } i(p_i-\varepsilon _i)<\sum _{i=1}^{\infty }i p_i =\mu _D$. Thus (5.29) has a unique solution in [0, 1) as $g_2$ is convex on [0, 1], since $g_2''(s)\ge 0$.

(b) Letting $\varepsilon _E\downarrow 0$ in (5.28) and (5.29) shows that $\rho $ is given by (5.30), where s satisfies (5.31). Let $g_1$ be as in (a) and $g_2(s)=\beta \mu _D^{-1} f_D'(s)$. Then $g_1(0)\le 0 <g_2(0)$ and $g_1(1)=g_2(1)$, since $\mu _D= f_D'(1)$. Further, $g_2$ is a convex function, so it follows that (5.31) has a solution in [0, 1) if and only if $g_1'(1)<g_2'(1)$ and moreover that solution is unique. Now $g_1'(1)=\beta +\omega +\gamma $ and $g_2'(1)=\beta \mu _D^{-1} f_D''(1)$, so $g_1'(1)<g_2'(1)$ if and only if $R_0=\frac{\beta }{\beta +\omega +\gamma }\mu _D^{-1} f_D''(1)>1$. $\square $

Remark 5.2

(Connection to the renewal Eq. (5.18)) Proposition 5.1(b) can also be derived by taking the limit $t\rightarrow \infty $ in (5.18):

$$\begin{aligned} \theta (\infty )&=\mathscr {F}(\infty )+(1-\mathscr {F}(\infty ))\frac{f'_{D}(\theta (\infty ))}{\mu _D}\nonumber \\&=\frac{\gamma +\omega }{\beta +\gamma +\omega }+\frac{\beta }{\beta +\gamma +\omega }\frac{f'_{D}(\theta (\infty ))}{\mu _D}, \end{aligned}$$

(5.32)

using (5.19), so $\theta (\infty )$ satisfies (5.31). Then, using (5.12) and (5.16), one obtains that the proportion $x(\infty )$ of the population that ultimately is susceptible agrees with (5.30).

Remark 5.3

(Increased recovery rate and no dropping) Observe that Eq. (5.18) for $\theta $ and (5.19) for ${\mathscr {F}}$ together imply immediately that the process of susceptibles in the deterministic model with recovery rate $\gamma $ and dropping rate $\omega $ depends on $(\gamma , \omega )$ only through their sum $\gamma +\omega $, since $\gamma $ and $\omega $ only appear in (5.19) through the sum $\gamma +\omega $. Furthermore, (5.20) relates the variable $\theta $ of the binding site formulation to the effective degree formulation through $\psi $ and $\xi $ defined at (5.5) and (5.15), respectively. Thus the LLN limit $\{\varvec{x}(t)\}$ describing the evolution of susceptibles classified by their effective degree for the model with dropping is the same as that for the model without dropping (i.e. the standard Markov SIR epidemic on a configuration model network) but with the recovery rate $\gamma $ increased to $\gamma +\omega $. In particular, this implies that the deterministic final size $\rho $ of the two models are the same, as is apparent immediately from Proposition 5.1. This invariance also holds for the basic reproduction number $R_0$ and Mathusian parameter r, as is clear from the formulae in Eqs. (2.1) and (5.23), respectively. Note however that the LLN limit $\{\varvec{y}(t)\}$ describing the infectives is not the same for these two models, since infectives recover more quickly in the model with increased recovery rate. Thus (as illustrated in Fig. 9 in Sect. 10.6) at any time $t>0$ there are more infectives in the deterministic model with dropping than in the corresponding model with increased recovery rate and no dropping. We revisit the model with increased recovery rate and no dropping in Sect. 8, where we focus on the probability of a major outbreak in the stochastic model with few initial infectives.

6 Asymptotic variance of final size of epidemic on an MR random graph

Recall that $X^N(\tau ^N)=\sum _{i=0}^{\infty } X_i^N(\tau ^N)$ denotes the number of susceptibles remaining at the end of the epidemic on an MR random graph. Thus $T^N_{\mathrm{MR}}=X^N(0)-X^N(\tau ^N)$ denotes the final size of the epidemic. Note that, in an obvious notation, $X^N(\tau ^N)= \tilde{X}^N(\tilde{\tau }^N)=\sum _{i=0}^{\infty } \tilde{X}_i^N(\tilde{\tau }^N)$. Let $\varvec{0}=(0,0,\ldots )$ and $\varvec{1}=(1,1,\ldots )$. Then, assuming the truth of Conjecture 4.1 for $d_{\max }=\infty $, the asymptotic variance of $N^{-\frac{1}{2}}T^N_{\mathrm{MR}}$ is given by

$$\begin{aligned} \sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )=(\varvec{1}, \varvec{0},0)\varSigma _{\mathrm{MR},0}(\varvec{1}, \varvec{0},0)^{\top }. \end{aligned}$$

(6.1)

Suppose that $\varepsilon _E=\sum _{i=1}^{\infty } i\varepsilon _i>0$ and let z be the unique solution in [0, 1) of (5.25); cf. Proposition 5.1(a). The following proposition gives an almost fully explicit expression for the asymptotic variance $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$.

Proposition 6.1

(Asymptotic variance of final size of epidemic on MR graph with dropping) Suppose that $\varepsilon _E>0$ and $z>0$. Then,

$$\begin{aligned} \sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )&=2\frac{(\beta +\omega +\gamma )[\gamma -\beta -\omega -(\beta +\omega +\gamma )z]}{(\beta +\omega )^2}\mu _D\tilde{b}(z)^2 z^2(1-z)\nonumber \\&\quad +\frac{\gamma }{\beta (\beta +\omega )} \mu _D\tilde{b}(z)^2 z[\beta -(2\beta +\omega )z]\nonumber \\&\quad +\frac{\gamma }{\beta [2(\beta +\omega )+\gamma ]}\tilde{b}(z)^2 z^2\left[ \beta (\sigma _D^2+\mu _D^2)+\omega \mu _D\right] \nonumber \\&\quad -\frac{\gamma [(\beta +\omega +\gamma )z-\gamma ]z}{[2(\beta +\omega )+\gamma ](\beta +\omega )} \mu _D\tilde{b}(z)+I_A+I_B+I_C+I_D, \end{aligned}$$

(6.2)

with

$$\begin{aligned} \tilde{b}(z)&=\frac{\beta \left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right] \mu _D}{z\left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] }, \end{aligned}$$

(6.3)

$$\begin{aligned} I_A&=\frac{1}{\beta +\omega } \int _z^1 \left[ \omega \left( \widetilde{\psi }_3(z,v)-1\right) ^2+\beta \widetilde{\psi }_3(z,v)^2\right] f_{D_{\varepsilon }}'\left( \widetilde{\psi }_2(z,v)\right) \,\mathrm{d}v,\end{aligned}$$

(6.4)

$$\begin{aligned} I_B&=2\frac{\omega z \tilde{b}(z)}{\beta +\omega } \int _z^1 \widetilde{\psi }_1(z,v)\left( \widetilde{\psi }_1(z,v)-1\right) \left( 1-\widetilde{\psi }_3(z,v)\right) f_{D_{\varepsilon }}''\left( \widetilde{\psi }_2(z,v)\right) \,\mathrm{d}v,\end{aligned}$$

(6.5)

$$\begin{aligned} I_C&=\frac{\beta z \tilde{b}(z)}{\beta +\omega } \int _z^1 \widetilde{\psi }_1(z,v)^2\left( \tilde{b}(z)z v^{-1}- 2\widetilde{\psi }_3(z,v)\right) f_{D_{\varepsilon }}''\left( \widetilde{\psi }_2(z,v)\right) \,\mathrm{d}v,\end{aligned}$$

(6.6)

$$\begin{aligned} I_D&=\frac{z^2 \tilde{b}(z)^2}{\beta +\omega } \int _z^1 \left[ \omega \left( \widetilde{\psi }_1(z,v)-1\right) ^2+\beta \widetilde{\psi }_1(z,v)^2 \right] \widetilde{\psi }_1(z,v)^2 f_{D_{\varepsilon }}^{(3)}\left( \widetilde{\psi }_2(z,v)\right) \,\mathrm{d}v,\nonumber \\ \end{aligned}$$

(6.7)

$\widetilde{\psi }_1(z,v)=p_{\omega }+(1-p_{\omega })zv^{-1},\widetilde{\psi }_2(z,v)=v\widetilde{\psi }_1(z,v)^2+p_{\omega }(1-v)$ and $\widetilde{\psi }_3(z,v)=\widetilde{\psi }_1(z,v)-\tilde{b}(z)z v^{-1}$.

Proof

The proof is rather long so only an outline is given here, with detailed calculations deferred to appendices. Let

$$\begin{aligned} \varvec{c}(\tilde{\tau },u)=(\varvec{1}, \varvec{0},0)B\tilde{\varPhi }(\tilde{\tau },u), \end{aligned}$$

(6.8)

where B is given by (4.13) with $\delta =0$. Then, using (4.7) and (4.8),

$$\begin{aligned} \sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )= & {} \int _0^{\tilde{\tau }} \varvec{c}(\tilde{\tau },u) \tilde{G}(\tilde{\varvec{w}}(u))\varvec{c}(\tilde{\tau },u)^\top \,\mathrm{d}u,\nonumber \\= & {} \sum _{\varvec{l}\in \varDelta }\int _0^{\tilde{\tau }} \varvec{c}(\tilde{\tau },u)\varvec{l}^{\top }\varvec{l}\varvec{c}(\tilde{\tau },u)^\top \tilde{\beta }_{\varvec{l}}(\tilde{\varvec{w}}(u)) \,\mathrm{d}u. \end{aligned}$$

(6.9)

The rest of the proof involves showing that the right-hand side of (6.9) yields the expression (6.2) for $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$.

Recall that $\varDelta =\cup _{k=1}^5 \varDelta _k$ and note that $\varvec{c}(\tilde{\tau },u)\varvec{l}^{\top }$ is a scalar. It then follows that

$$\begin{aligned} \sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )=\sum _{i=1}^5 \sigma ^2_i, \end{aligned}$$

(6.10)

where

$$\begin{aligned} \sigma ^2_i=\int _0^{\tilde{\tau }}\sum _{\varvec{l}\in \varDelta _i}\left( \varvec{c}(\tilde{\tau },u)\varvec{l}^{\top }\right) ^2 \tilde{\beta }_{\varvec{l}}(\tilde{\varvec{w}}(u))\,\mathrm{d}u. \end{aligned}$$

(6.11)

Evaluation of (6.11) requires $\varvec{c}(\tilde{\tau },u)$, which we now determine.

Let $a(\tilde{\tau })=\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }))$. Observe that $\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }))=(\varvec{0},\varvec{p},0)$, where $\varvec{p}=(0,1,2,\ldots )$, so using (4.2),

$$\begin{aligned} a(\tilde{\tau })= & {} -(\beta +\omega )[\tilde{y}_E(\tilde{\tau })+\tilde{\eta }_E(\tilde{\tau })]-\gamma \tilde{\eta }_E(\tilde{\tau })+\beta \sum _{i=1}^{\infty } i (i+1)\tilde{x}_{i+1}(\tilde{\tau })\qquad \end{aligned}$$

(6.12)

$$\begin{aligned}= & {} \mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\left[ \beta f_{D_{\varepsilon }}''\left( \psi (\tilde{\tau })\right) -(\beta +\omega +\gamma )\mu _D\right] , \end{aligned}$$

(6.13)

using $\tilde{y}_E(\tilde{\tau })=0$, (5.7) and (5.9). Also, using (4.2), $(\varvec{1}, \varvec{0},0)\tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }))=-\beta \tilde{x}_E(\tilde{\tau })$, so

$$\begin{aligned} (\varvec{1}, \varvec{0},0)B=(\varvec{1}, b(\tilde{\tau }) \varvec{p},0), \end{aligned}$$

(6.14)

where

$$\begin{aligned} b(\tilde{\tau })=a(\tilde{\tau })^{-1}\beta \tilde{x}_E(\tilde{\tau }). \end{aligned}$$

(6.15)

Note from (4.2) that $\partial \tilde{F}(\tilde{\varvec{w}}(t))$ takes the partitioned form

$$\begin{aligned} \partial \tilde{F}(\tilde{\varvec{w}}(t))= \begin{bmatrix} \partial \tilde{F}_{XX}(\tilde{\varvec{w}}(t))&0&\varvec{0}^\top \\ \partial \tilde{F}_{YX}(\tilde{\varvec{w}}(t))&\partial \tilde{F}_{YY}(\tilde{\varvec{w}}(t))&\partial \tilde{F}_{YZ}(\tilde{\varvec{w}}(t)) \\ \partial \tilde{F}_{ZX}(\tilde{\varvec{w}}(t))&\partial \tilde{F}_{ZY}(\tilde{\varvec{w}}(t))&\partial \tilde{F}_{ZZ}(\tilde{\varvec{w}}(t)) \end{bmatrix}. \end{aligned}$$

(6.16)

It follows from (4.9) that $\tilde{\varPhi }(t,u)$ has the partitioned form

$$\begin{aligned} \tilde{\varPhi }(t,u)= \begin{bmatrix} \tilde{\varPhi }_{XX}(t,u)&0&\varvec{0}^\top \\ \tilde{\varPhi }_{YX}(t,u)&\tilde{\varPhi }_{YY}(t,u)&\tilde{\varPhi }_{YZ}(t,u) \\ \tilde{\varPhi }_{ZX}(t,u)&\tilde{\varPhi }_{ZY}(t,u)&\tilde{\varPhi }_{ZZ}(t,u) \end{bmatrix}. \end{aligned}$$

Thus, using (6.8) and (6.14), we have

$$\begin{aligned} \varvec{c}(\tilde{\tau },u)=\left( \varvec{1}\tilde{\varPhi }_{XX}(\tilde{\tau },u)+b(\tilde{\tau })\varvec{p}\tilde{\varPhi }_{YX}(\tilde{\tau },u), b(\tilde{\tau })\varvec{p}\tilde{\varPhi }_{YY}(\tilde{\tau },u), b(\tilde{\tau })\varvec{p}\tilde{\varPhi }_{YZ}(\tilde{\tau },u)\right) . \end{aligned}$$

We show in Appendix D that

$$\begin{aligned} \left( \varvec{1}\tilde{\varPhi }_{XX}(\tilde{\tau },u)\right) _j&=\psi (\tilde{\tau }-u)^j \qquad (j=0,1,\ldots ),\\ \left( \varvec{p}\,\tilde{\varPhi }_{YX}(\tilde{\tau },u)\right) _j&=\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}j\left[ \frac{(\beta +\omega +\gamma ) \mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-\gamma }{\beta +\omega }\right] \\&\qquad -\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)} j \psi (\tilde{\tau }-u)^{j-1}\qquad (j=0,1,\ldots ),\\ \varvec{p}\,\tilde{\varPhi }_{YY}(\tilde{\tau },u)&=\left( \frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )(\tilde{\tau }-u)}- \frac{\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right) \varvec{p},\\ \varvec{p}\,\tilde{\varPhi }_{YZ}(\tilde{\tau },u)&=-\frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\left( 1-\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right) , \end{aligned}$$

see (D.5), (D.26), (D.15) and (D.14), respectively. Hence,

$$\begin{aligned} \varvec{c}(\tilde{\tau },u)=\left( \varvec{c}_S(\tilde{\tau },u), h_I(\tilde{\tau },u) \varvec{p}, h_R(\tilde{\tau },u)\right) , \end{aligned}$$

(6.17)

where

$$\begin{aligned} h_I(\tilde{\tau },u)&=-\frac{b(\tilde{\tau })}{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\left[ \gamma -(\beta +\omega +\gamma )\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right] , \end{aligned}$$

(6.18)

$$\begin{aligned} h_R(\tilde{\tau },u)&=h_I(\tilde{\tau },u)-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}, \end{aligned}$$

(6.19)

$$\begin{aligned} \varvec{c}_S(\tilde{\tau },u)&=(\tilde{c}_0(\tilde{\tau },u), \tilde{c}_1(\tilde{\tau },u), \ldots )+h_I(\tilde{\tau },u)\varvec{p}, \end{aligned}$$

(6.20)

with

$$\begin{aligned} \tilde{c}_j(\tilde{\tau },u)=\psi (\tilde{\tau }-u)^j-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}j\psi (\tilde{\tau }-u)^{j-1} \qquad (j=0,1,\ldots ). \end{aligned}$$

We can now calculate $\sigma ^2_i$$(i=1,2,\ldots ,5)$ using (6.11), (6.17) and (4.1), and hence obtain $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$ using (6.10). The details are lengthy and are given in Appendix E. $\square $

Recall from Sect. 5.3 that if $\varepsilon _E>0$ then $\rho =1-f_{D_{\varepsilon }}\left( \widetilde{\psi }(z)\right) $, where z is the unique solution in [0, 1) of (5.25), and if $\varepsilon _E=0$ and $R_0>1$ then $\rho =1-f_D\left( \widetilde{\psi }(z)\right) $, where z is the unique solution in [0, 1) of (5.25) with $f_{D_{\varepsilon }}'$ replaced by $f_D'$; cf. Proposition 5.1.

Conjecture 6.1

(CLT for of final size of epidemic on MR graph with dropping)

(a)
Suppose that $\varepsilon _E>0, d_{\max }<\infty $ and $z>0$. Then,
$$\begin{aligned} \sqrt{N}\left( N^{-1}T^N_{\mathrm{MR}}- \rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$
(6.21)
where $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$ is given by Proposition 6.1.
(b)
Suppose that $\varepsilon _E=0, d_{\max }<\infty , R_0>1$ and $z>0$. Then, in the event of a major outbreak, (6.21) holds with $D_{\varepsilon }$ replaced by D in (6.3)–(6.7).

Remark 6.1

(Proving Conjecture 6.1) Part (a) of Conjecture 6.1 follows immediately from Conjecture 4.1 and Proposition 6.1; see Remark 4.1 for how Conjecture 4.1 might be proved. Part (b) of Conjecture 6.1 is concerned with epidemics started by a trace of infection, i.e. with $\varepsilon _E=0$. Similar CLTs for the final size of a wide range of SIR epidemics (e.g. von Bahr and Martin-Löf (1980), Scalia-Tomba (1985) and Ball and Neal (2003)) suggest that letting $\varepsilon _E\downarrow 0$ in the CLT with $\varepsilon _E>0$ yields the correct CLT when $\varepsilon _E=0$ for epidemics that become established and lead to a major outbreak. This is proved for the SIR epidemic without dropping of edges on configuration model networks in Ball (2018); see Remark 4.1. A similar proof should hold for the present model with dropping of edges.

Remark 6.2

(The condition$z>0$) The condition $z>0$ in Proposition 6.1 and Conjecture 6.1 is required to ensure that $\tilde{\tau }<\infty $; recall from Sect. 5.3 that $z=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}$. Note from (5.28) that $z>0$ implies $\rho <1$, so the LLN and functional CLT in Ethier and Kurtz (1986), Chapter 11, hold for both the original and random time-scale transformed processes $\{\varvec{W}^N(t)\}$ and $\{\tilde{\varvec{W}}^N(t)\}$ provided there is a maximum degree; see Appendix B. Further, as explained in Appendix C, if $\varepsilon _E>0$ then $z=0$ if and only if $\gamma =\omega =f_{D_{\varepsilon }}'(0)=0$. Now $f_{D_{\varepsilon }}'(0)=0$ if and only if $p_1-\varepsilon _1=0$. Thus $z>0$ unless there is no recovery of infectives, no droping of edges and the limiting fraction of degree-1 susceptibles is 0. The same conclusion holds when $\varepsilon _E=0$.

7 Extension to iid degrees: epidemics on an NSW random graph

In this section we assume that the underlying network is constructed from a sequence $D_1,D_2,\ldots $ of independent and identically distributed copies of the random variable D, which describes the degree of a typical individual. The random variables $D_1,D_2,\ldots ,D_N$ are used to construct a network of N individuals, yielding a realisation of NSW random graph. The almost sure convergence results described in Theorem 3.1 (and the corresponding time-transformed almost sure convergence result of Sect. 4) still hold for the present model, as noted previously, but the functional CLT and the CLT for the final size (Theorem 3.2 and Conjecture 6.1) need modifying, as the variability in the empirical degree distribution of the random network (and hence in the initial conditions for the effective degree process $\{\varvec{W}^N(t)\}$) is of the same order of magnitude as that of the process itself. The modified results for epidemics on an NSW random graph are presented in Theorem 7.2 and Conjecture 7.1. In order to prove and motivate, respectively, these results we need a version of the functional CLT (Theorem 11.2.3) in Ethier and Kurtz (1986) that allows for asymptotically random initial conditions; see Theorem 7.1 below, which may be of more general interest beyond the present paper. Like the above-mentioned Theorem 11.2.3, Theorem 7.1 assumes a finite-dimensional state space, which for our application amounts to assuming that $d_{\max }< \infty $.

The limiting Gaussian process $\{\varvec{V}(t)\}$ in Theorem 3.2 admits the Itô integral representation

$$\begin{aligned} \varvec{V}(t)=\varPhi (t,0)\varvec{V}(0)+\int _0^t \varPhi (t,s) \,\mathrm{d}\varvec{U}(s) \qquad (t \ge 0), \end{aligned}$$

(7.1)

where $\{\varvec{U}(t)\}$ is a time-inhomogeneous Brownian motion (see Ethier and Kurtz (1986), Theorem 11.2.3, page 458) and $\varvec{V}(0)=\lim _{N \rightarrow \infty } \sqrt{N}\left( \varvec{W}^N(0)-\varvec{w}(0)\right) $. (To aid connection with Ethier and Kurtz (1986), $\varvec{V}(t)$ and $\varvec{U}(t)$ are now column vectors.) In Ethier and Kurtz (1986), Theorem 11.2.3, $\varvec{V}(0)$ is nonrandom. In Theorem 7.1 below, we allow $\varvec{V}(0)$ to be random.

Theorem 7.1

(Functional CLT for process with asymptotically random initial conditions) Suppose that the conditions of Ethier and Kurtz (1986), Theorem 11.2.3, are satisfied except that $\sqrt{N}\left( N^{-1}\varvec{W}^N(0)-\varvec{w}(0)\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}\varvec{V}(0)$ as $N \rightarrow \infty $, where $\varvec{V}(0) \sim N(\varvec{0},\varSigma _0)$. Then

$$\begin{aligned} \sqrt{N}\left( \{N^{-1}\varvec{W}^N(t)\}-\{\varvec{w}(t)\}\right) \Rightarrow \{\varvec{V}(t)\} \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(7.2)

where $\{\varvec{V}(t)\}=\{\varvec{V}(t):t \ge 0\}$ is a zero-mean Gaussian process with covariance function given, for $t_1,t_2 \ge 0$, by

$$\begin{aligned} \mathrm{cov}\left( \varvec{V}(t_1), \varvec{V}(t_2)\right) =\varPhi (t_1,0)\varSigma _0 \varPhi (t_2,0)^{\top } + \int _0^{\min (t_1,t_2)}\varPhi (t_1,u) G(\varvec{w}(u))\varPhi (t_2,u) ^{\top } \,\mathrm{d}u. \end{aligned}$$

(7.3)

Proof

It is easily seen that the proof of Ethier and Kurtz (1986), Theorem 11.2.3, continues to hold in this more general setting. In particular, the limiting process satisfies (7.1), where now $\varvec{V}(0) \sim N(\varvec{0},\varSigma _0)$, so $\{\varvec{V}(t)\}$ is a zero-mean Gaussian process. Further, the time-inhomogeneous Brownian motion $\{\varvec{U}(t)\}$ arises as the weak limit, as $N \rightarrow \infty $, of the (suitably centred and scaled) Poisson processes used to construct realisations of $\{\varvec{W}^N(t)\}$$(N=1,2,\ldots )$, and hence is independent of $\varvec{V}(0)$. The covariance function in (7.3) then follows immediately from (7.1). $\square $

Remark 7.1

(Computing the asymptotic variance) Setting $t_1=t_2=t$ in (7.3) and differentiating as in Remark 3.4 shows that $\varSigma (t) = \mathrm{var}(\varvec{V}(t))$ satisfies the ODE (3.17) but now with initial condition $\varSigma (0)=\varSigma _0$.

Remark 7.2

(Non-Gaussian limiting initial conditions) The covariance function (7.3) also holds when $\varvec{V}(0)$ is non-Gaussian, provided $\mathrm{E}[\varvec{V}(0)]=0$ and $\mathrm{var}(\varvec{V}(0))=\varSigma _0$, though of course $\{\varvec{V}(t)\}$ is no longer Gaussian.

Theorem 7.2

(Functional CLT for epidemic on NSW graph with dropping)

Suppose that as $N \rightarrow \infty , \left( N^{-1}(\varvec{X}^N(0),\varvec{Y}^N(0),Z^N_E(0))-(\varvec{x}(0),\varvec{y}(0),z_E(0))\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}} N(\varvec{0},\varSigma _0)$. Then the same functional CLT holds as in the MR graph situation (Theorem 3.2), but with the covariance function of $\{\varvec{V}(t)\}$ changed in accordance with Eq. (7.3) and Remark 7.1 to reflect the randomness in the initial conditions.

Proof

The details of the proof, applying Theorem 7.1, are exactly the same as those in Appendix B where Theorem 11.2.3 of Ethier and Kurtz (1986) is applied to prove Theorem 3.2.

Remark 7.3

(The asymptotic variance matrix$\varSigma _0$) Note that $\varSigma _0$ in Theorem 7.2 depends on how the initial infectives are chosen from the population. An example and some discussion can be found in Sect. 10.1. Also note that Theorem 7.2 as presented allows for the possibility of some initially recovered individuals in the population. This is to simplify the presentation of the theorem; the assumption of no initially recovered individuals implies that $Z^N_E(0)=0$, from which it follows that $z_E(0)=0$ and the last row and column of $\varSigma _0$ have all entries 0.

Next, we use Theorem 7.1 to conjecture a CLT for the final size of the epidemic on an NSW random graph. For $N=1,2,\ldots $, let $D^{(N)}$ denote a random variable with distribution given by the empirical distribution of $D_1,D_2,\ldots ,D_N$, so

$$\begin{aligned} \mathrm{P}\left( D^{(N)}=k\right) =N^{-1}\sum _{i=1}^N 1_{\{D_i=k\}}\qquad (k=0,1,\ldots ). \end{aligned}$$

(7.4)

For $N=1,2,\ldots $, let $T^N_{\mathrm{NSW}}$ be the final size of the epidemic on an NSW configuration model random graph having N vertices. We consider epidemics initiated by a trace of infection and assume that the variability in the initial conditions is owing entirely to the variability in $D^{(N)}$.

Conjecture 7.1

(CLT for final size of epidemic on NSW graph with dropping)

Suppose that $\varepsilon _E=0$, $d_{\max }<\infty $, $R_0>1$ and $z>0$. Then, in the event of a major outbreak,

$$\begin{aligned} \sqrt{N}\left( N^{-1}T^N_{\mathrm{NSW}}- \rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{NSW}}(\beta ,\omega ,\gamma )) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(7.5)

where

$$\begin{aligned} \sigma _\mathrm{NSW}^2(\beta , \omega , \gamma )=\sigma _\mathrm{MR}^2(\beta , \omega , \gamma )+\sigma _0^2(\beta , \omega , \gamma ), \end{aligned}$$

(7.6)

with $\sigma _\mathrm{MR}^2(\beta , \omega , \gamma )$ given by (6.2) (replacing $D_{\varepsilon }$ by D in (6.3)–(6.7)) and

$$\begin{aligned}&\sigma _0^2(\beta , \omega , \gamma )\nonumber \\&\quad =f_D\left( \widetilde{\psi }(z)^2\right) -(1-\rho )^2+\tilde{b}(z)^2 \widetilde{\psi }(z)^2 z^2 f_D''\left( \widetilde{\psi }(z)^2\right) \nonumber \\&\qquad +\tilde{b}(z)f_D'\left( \widetilde{\psi }(z)^2\right) z\left[ z\tilde{b}(z)-2\widetilde{\psi }(z)\right] \nonumber \\&\qquad +\tilde{b}(z)^2z^2\left( \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right) ^2\left( \sigma _D^2+\mu _D^2\right) \nonumber \\&\qquad -2\tilde{b}(z)^2 z^2 \mu _D\left( \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right) \left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }+\frac{(\beta +\omega +\gamma )}{\beta }\widetilde{\psi }(z)\right] . \end{aligned}$$

(7.7)

We now give the argument leading to this conjecture. Suppose, for the time being, that $\varepsilon _E>0$ and consider the random time-scale transformed process $\{\tilde{\varvec{W}}^N(t)\}$, defined in Sect. 4, but now for the epidemic on an NSW network. Using (4.6) and Theorem 7.1, for any $t_0 \in [0,\tilde{\tau })$,

$$\begin{aligned} \sqrt{N}\left( \{N^{-1}\tilde{\varvec{W}}^N(t):0 \le t \le t_0\}-\{\tilde{\varvec{w}}(t):0 \le t \le t_0\}\right) \Rightarrow \{\tilde{\varvec{V}}_\mathrm{NSW}(t)\} \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

where $\{\tilde{\varvec{V}}_\mathrm{NSW}(t):0 \le t \le t_0\}$ is a zero-mean Gaussian process with variance–covariance matrix at time t given by

$$\begin{aligned} \tilde{\varSigma }_\mathrm{NSW}(t)=\tilde{\varSigma }_\mathrm{MR}(t)+\tilde{\varSigma }^0(t); \end{aligned}$$

(7.8)

$\tilde{\varSigma }_\mathrm{MR}(t)$ is given by (4.7) and $\tilde{\varSigma }^0(t)=\varPhi (t,0)\varSigma _0 \varPhi (t,0)^{\top }$, with $\varSigma _0$ being defined as in Theorem 7.1. Then arguing as in the derivation of Proposition 4.1 yields, for any $\delta \in (0,y_E(0))$,

$$\begin{aligned} \sqrt{N}\left( N^{-1} \varvec{W}^N(\tau ^N_{\delta })-\varvec{w}(\tau _{\delta })\right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N\left( \varvec{0}, \varSigma _{\mathrm{NSW},\delta } \right) , \quad \text{ as } N \rightarrow \infty , \end{aligned}$$

(7.9)

where

$$\begin{aligned} \varSigma _{\mathrm{NSW},\delta } =B_{\delta }\tilde{\varSigma }_\mathrm{NSW}(\tilde{\tau }_{\delta }) B_{\delta }^{\top }. \end{aligned}$$

(7.10)

We now assume that (7.9) extends to the case $\delta =0$, so (7.5) holds with

$$\begin{aligned} \sigma ^2_{\mathrm{NSW}}(\beta ,\omega ,\gamma )=(\varvec{1}, \varvec{0},0)\varSigma _{\mathrm{NSW},0}(\varvec{1}, \varvec{0},0)^{\top }; \end{aligned}$$

cf. (6.1). Thus, using (7.8) and (7.10),

$$\begin{aligned} \sigma _\mathrm{NSW}^2(\beta , \omega , \gamma )=\sigma _\mathrm{MR}^2(\beta , \omega , \gamma )+\sigma _0^2(\beta , \omega , \gamma ), \end{aligned}$$

(7.11)

where

$$\begin{aligned} \sigma _0^2(\beta , \omega , \gamma )= & {} (\varvec{1}, \varvec{0},0)B \tilde{\varSigma }^0(\tilde{\tau }) B^{\top } (\varvec{1}, \varvec{0},0)^{\top }\nonumber \\= & {} (\varvec{1}, b(\tilde{\tau }) \varvec{p},0)\tilde{\varSigma }^0(\tilde{\tau })(\varvec{1}, b(\tilde{\tau }) \varvec{p},0)^{\top }, \end{aligned}$$

(7.12)

using (6.14).

We now assume that the above extends in the obvious way to $\varepsilon _E=0$ and calculate the resulting asymptotic variance $\sigma _\mathrm{NSW}^2(\beta , \omega , \gamma )$. Write

$$\begin{aligned} \tilde{\varSigma }^0(\tilde{\tau })= \begin{bmatrix} \tilde{\varSigma }^0_{XX}(\tilde{\tau })&\tilde{\varSigma }^0_{XY}(\tilde{\tau })&\tilde{\varSigma }^0_{XZ}(\tilde{\tau })\\ \tilde{\varSigma }^0_{YX}(\tilde{\tau })&\tilde{\varSigma }^0_{YY}(\tilde{\tau })&\tilde{\varSigma }^0_{YZ}(\tilde{\tau })\\ \tilde{\varSigma }^0_{ZX}(\tilde{\tau })&\tilde{\varSigma }^0_{ZY}(\tilde{\tau })&\tilde{\varSigma }^0_{ZZ}(\tilde{\tau }) \end{bmatrix}. \end{aligned}$$

(7.13)

Then

$$\begin{aligned} \sigma _0^2(\beta , \omega , \gamma )&=\varvec{1}\tilde{\varSigma }^0_{XX}(\tilde{\tau }) \varvec{1}^{\top } +2b(\tilde{\tau }) \varvec{p}\tilde{\varSigma }^0_{YX}(\tilde{\tau }) \varvec{1}^{\top } +b(\tilde{\tau })^2 \varvec{p}\tilde{\varSigma }^0_{YY}(\tilde{\tau }) \varvec{p}^{\top }\nonumber \\&=\lim _{N \rightarrow \infty } N\left[ \mathrm{var}\left( \tilde{x}^N(\tilde{\tau })\right) +2b(\tilde{\tau })\mathrm{cov}\left( \tilde{x}^N(\tilde{\tau }), \tilde{y}_E^N(\tilde{\tau })\right) +b(\tilde{\tau })^2 \mathrm{var}\left( \tilde{y}_E^N(\tilde{\tau })\right) \right] , \end{aligned}$$

(7.14)

where $\tilde{x}^N(\tilde{\tau })$ and $\tilde{y}_E^N(\tilde{\tau })$ are the deterministic ‘number’ of susceptible individuals and infectious half-edges, given by (5.26) and (5.11), respectively, but with (random) initial conditions induced by the NSW random graph on N vertices.

Recall the function $\psi $ and the random variable $D^{(N)}$, defined at (5.5) and (7.4), respectively. It follows from (5.26) that

$$\begin{aligned} \tilde{x}^N(\tilde{\tau })=f_{D^{(N)}}\left( \psi (\tilde{\tau })\right) \end{aligned}$$

(7.15)

and, from (5.11), that

$$\begin{aligned} \tilde{y}_E^N(\tilde{\tau })=\frac{\beta +\omega +\gamma }{\beta +\omega }\mu _{D^{(N)}} \mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}-\frac{\gamma }{\beta +\omega }\mu _{D^{(N)}}\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}-\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}f_{D^{(N)}}'\left( \psi (\tilde{\tau })\right) . \end{aligned}$$

(7.16)

Let $\theta \in [0,1]$. Note, for example, that $f_{D^{(N)}}(\theta )=N^{-1}\sum _{i=1}^N \theta ^{D_i}$, so $\mathrm{var}\left( f_{D^{(N)}}(\theta )\right) =N^{-1}\left[ f_D(\theta ^2)-f_D(\theta )^2\right] $ and $f_{D^{(N)}}(\theta )$ is asymptotically normally distributed by the CLT for independent and identically distributed random variables. This and similar elementary calculations show that

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{var}\left( f_{D^{(N)}}(\theta )\right)= & {} f_D(\theta ^2)-f_D(\theta )^2, \end{aligned}$$

(7.17)

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{var}\left( \mu _{D^{(N)}}\right)= & {} \sigma ^2_D (=\mathrm{var}(D)),\end{aligned}$$

(7.18)

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{var}\left( f_{D^{(N)}}'(\theta )\right)= & {} \theta ^2f_D''(\theta ^2)+f_D'(\theta ^2)-f_D'(\theta )^2,\end{aligned}$$

(7.19)

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{cov}\left( \mu _{D^{(N)}}, f_{D^{(N)}}(\theta )\right)= & {} \theta f_D'(\theta )-\mu _Df_D(\theta ),\end{aligned}$$

(7.20)

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{cov}\left( \mu _{D^{(N)}}, f_{D^{(N)}}'(\theta )\right)= & {} \theta f_D''(\theta )+f_D'(\theta )-\mu _Df_D'(\theta ),\end{aligned}$$

(7.21)

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{cov}\left( f_{D^{(N)}}(\theta ), f_{D^{(N)}}'(\theta )\right)= & {} \theta f_D'(\theta ^2)-f_D(\theta )f_D'(\theta ). \end{aligned}$$

(7.22)

Recall that $z=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}$, $\widetilde{\psi }(z)=p_{\omega }+(1-p_{\omega })z$ and $\rho =1-f_D\left( \widetilde{\psi }(z)\right) $ (see (5.25) and Proposition 5.1(b)). Setting $\delta =0$ in (5.27) then gives (cf. (5.25))

$$\begin{aligned} f_D'\left( \widetilde{\psi }(z)\right) =\left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right] \mu _D. \end{aligned}$$

(7.23)

Then, using (7.15) and (7.17),

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{var}\left( \tilde{x}^N(\tilde{\tau })\right) =f_D\left( \widetilde{\psi }(z)^2\right) -(1-\rho )^2, \end{aligned}$$

(7.24)

using (7.15), (7.16), (7.20) and (7.22)

$$\begin{aligned} \lim _{N \rightarrow \infty } N\mathrm{cov}\left( \tilde{x}^N(\tilde{\tau }), \tilde{y}_E^N(\tilde{\tau })\right) = z \widetilde{\psi }(z)\left[ \left( z+\frac{\gamma }{\beta +\omega }(z-1)\right) ^2\mu _D-f_D'\left( \widetilde{\psi }(z)^2\right) \right] \end{aligned}$$

(7.25)

and

$$\begin{aligned} \lim _{N \rightarrow \infty } N \mathrm{var}\left( \tilde{y}_E^N(\tilde{\tau })\right)&= z^2\left[ \left( z+\frac{\gamma }{\beta +\omega }(z-1)\right) ^2\left( \sigma _D^2+\mu _D^2-2\mu _D\right) \right. \nonumber \\&\quad +\,\widetilde{\psi }(z)^2f_D''\left( \widetilde{\psi }(z)^2\right) +f_D'\left( \widetilde{\psi }(z)^2\right) \nonumber \\&\quad \left. -\,2\left( z+\frac{\gamma }{\beta +\omega }(z-1)\right) \widetilde{\psi }(z)f_D''\left( \widetilde{\psi }(z)\right) \right] . \end{aligned}$$

(7.26)

It follows from (5.4), (6.13), (6.15) (all with $D_{\varepsilon }$ replaced by D) and (7.23), that

$$\begin{aligned} b(\tilde{\tau })=\frac{\beta \left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right] \mu _D}{z\left[ \beta f_D''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] }, \end{aligned}$$

(7.27)

so

$$\begin{aligned} b(\tilde{\tau })z f_D''\left( \widetilde{\psi }(z)\right) =\left[ (\beta +\omega +\gamma )\left( \frac{1}{\beta +\omega } +\frac{b(\tilde{\tau })}{\beta }\right) z-\frac{\gamma }{\beta +\omega }\right] \mu _D\ \end{aligned}$$

(7.28)

Note that $b(\tilde{\tau })=\tilde{b}(z)$, where $\tilde{b}(z)$ is given by (6.3) with $D_{\varepsilon }$ replaced by D. Substituting (7.24), (7.25) and (7.26) into (7.14), and invoking (7.23) and (7.28), yields (7.7) after a little algebra.

Remark 7.4

(Proving Conjecture 7.1) The two remaining steps required to prove Conjecture 7.1 are to justify (i) that (7.9) holds when $\delta =0$ and (ii) letting $\varepsilon _E\downarrow 0$ to obtain a CLT in the event of a major outbreak; cf. Remarks 4.1 and 6.1 which discuss these steps, respectively, for an epidemic on a MR random graph. As for epidemics on MR random graphs, the proofs in Ball (2018) for the SIR epidemic without dropping of edges on an NSW random graph should extend to the model with dropping of edges.

Remark 7.5

(Conjecture 7.1with$\varepsilon _E>0$) It is possible to extend Conjecture 7.1 to consider also the case $\varepsilon _E>0$ and obtain an analogous result to Conjecture 6.1(a). The asymptotic variance $\sigma _\mathrm{NSW}^2(\beta , \omega , \gamma )$ is given by (7.11) and (7.12) but now $\tilde{\varSigma }^0(\tilde{\tau })$ depends on how the initial infectives are chosen.

8 Increased recovery rate instead of dropping edges

Recall the equivalent formulation of the model with dropping in which an infectious individual sends out warnings to each neighbour independently at rate $\omega $, and susceptible individuals who receive such a warning immediately drop the corresponding edge. Consider a different but related model where, instead of sending out warnings to each neighbour at rate $\omega $independently, one single warning (at rate $\omega $) is used for all neighbours simultaneously (and all of them immediately drop the edges). The effect of this change is that edge droppings become dependent. However, from the point of view of a given susceptible neighbour the probability that it drops its edge to a given infective is unchanged. Thus, for a given susceptible, such a warning (where all susceptible neighbours drop their edges) has the same effect as if its infective neighbour recovered. Hence, we consider a model without dropping, but with recovery rate $\gamma +\omega $ instead of $\gamma $. We use $(\gamma ,\omega )$ and $(\gamma +\omega ,0)$ to refer to the two models, where the first component refers to the recovery rate and the second component to the dropping rate.

The above reasoning suggests that the dropping model $(\gamma ,\omega )$ should in some ways resemble this modified $(\gamma +\omega ,0)$ model. In fact, we have seen already in Sect. 5.3 (Remark 5.3) that, as $N\rightarrow \infty $, the scaled process of susceptibles in the two epidemics converge to the same LLN limit, and the same LLN holds for the final fraction getting infected. However, the two models are stochastically different, even for the process of susceptibles. The underlying reason for this difference is that independent warning signals makes the total number of infections less variable compared to having one warning signal to all susceptible neighbours. Consequently, the probability of a major outbreak is greater in the dropping model $(\gamma ,\omega )$ than in the modified $(\gamma +\omega ,0)$ model, as we prove in Theorem 8.1 below. Furthermore, we expect that the decrease in variability of the number of infections made by an infective decreases the limiting variance of both the whole process of susceptibles and the final size in the event of a major outbreak compared to the modified $(\gamma +\omega ,0)$ model. This is illustrated by the numerical results in Sect. 10.6.

Consider the beginning of an outbreak and an infectious individual having k susceptible neighbours. Let $Y_k^{(\gamma ,\omega )}$ be the number of these k neighbours that the infectious individual infects in the dropping model and define $Y_k^{(\gamma +\omega ,0)}$ similarly for the modified model. We compute the distributions of these two offspring random variables.

In the $(\gamma ,\omega )$ model we first condition on the infectious period I, which has an $\mathrm{Exp}(\gamma )$ distribution, i.e. an exponential distribution with rate $\gamma $ and hence mean $\gamma ^{-1}$. Given the duration of the infectious period $I=t$, the infectious individual infects each of its k susceptible neighbours independently, and a given neighbour is infected if and only if there is an infectious contact before t and the edge has not been dropped before then. Thus, conditional upon $I=t$, the probability that the given neighbour is infected is

$$\begin{aligned} \int _0^t \beta \mathrm{e}^{-(\beta +\omega )s}\,\mathrm{d}s = \frac{\beta }{\beta +\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )t}\right) . \end{aligned}$$

Given $I=t$, the number of neighbours infected follows a binomial distribution with parameters k and the probability above. Hence, if we relax the conditioning, it follows that $Y_k^{(\gamma ,\omega )}$ has the mixed-Binomial distribution

$$\begin{aligned} Y_k^{(\gamma ,\omega )}\sim \mathrm{MixBin}\left( k,\ \frac{\beta }{\beta +\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )I}\right) \right) , \text { where }I\sim \mathrm{Exp}(\gamma ). \end{aligned}$$

(8.1)

Setting $\gamma =\gamma +\omega $ and $\omega =0$ yields immediately that

$$\begin{aligned} Y_k^{(\gamma +\omega ,0)}\sim \mathrm{MixBin}\left( k,\ 1-\mathrm{e}^{-\beta I^*} \right) , \text { where }I^*\sim \mathrm{Exp}(\gamma +\omega ). \end{aligned}$$

(8.2)

It is not hard to show that

$$\begin{aligned} \mathrm{E}\left[ Y_k^{(\gamma ,\omega )}\right] = \mathrm{E}\left[ Y_k^{(\gamma +\omega ,0)}\right] = k\frac{\beta }{\beta +\gamma +\omega }, \end{aligned}$$

(8.3)

and that $\mathrm{var}\left( Y_k^{(\gamma ,\omega )}\right) < \mathrm{var}\left( Y_k^{(\gamma +\omega ,0)}\right) $.

Suppose that the epidemic is initiated by a single individual, chosen uniformly at random from the entire population, becoming infective. Then the number of susceptible neighbours of the initial infective is distributed according to D and, during the early stages of an outbreak in a large population, the number of susceptible neighbours of a subsequently infected individual is distributed as $\tilde{D}-1$ (see Sect. 2). These results hold for both models. It follows that the early stages of the dropping model in a large population can be approximated, on a generation basis, by a Galton–Watson branching process having offspring distribution that is a mixture of $Y_k^{(\gamma ,\omega )}$, $k=0,1,\ldots $, with mixing probabilities $p_k,$$k=0,1,\ldots ,$ in the initial generation and mixing probabilities $\tilde{p}_k,$$k=0,1,\ldots ,$ in all subsequent generations, where $\tilde{p}_k=\mu _D^{-1}(k+1)p_{k+1}$. (Note that $\tilde{p}_k$, $k=0,1,\ldots $, is the probability mass function of $\tilde{D}-1$.) A similar approximation holds for the modified model, except $Y_k^{(\gamma ,\omega )}$ is replaced by $Y_k^{(\gamma +\omega ,0)}$. These approximations can be made rigorous in the limit as the population size $N \rightarrow \infty $ by using a coupling argument, as in e.g. Ball and Sirl (2012). In the limit as $N \rightarrow \infty $, the probability of a major outbreak in the epidemic model is given by the probability that the corresponding approximating branching process does not go extinct.

The following lemma, proved in Appendix F, is required for the proof of Theorem 8.1 below, which shows that the probability of a major outbreak is greater in the dropping model than in the corresponding modified model. First, some more notation is required. For $k=1,2,\ldots $ let $f_k^{(\gamma ,\omega )}(s)=\mathrm{E}\left[ s^{Y_k^{(\gamma ,\omega )}}\right] $, $s \in \mathbb {R}$, denote the probability-generating function (PGF) of $Y_k^{(\gamma ,\omega )}$, the number of neighbours that an infectious individual with k susceptible neighbours infects in the early stages of the $(\gamma ,\omega )$ dropping model, and define $f_k^{(\gamma +\omega )}(s)$ similarly for the $(\gamma +\omega ,0)$ modified model. Let $f_0^{(\gamma ,\omega )}(s)=f_0^{(\gamma +\omega )}(s)=1$$(s \in \mathbb {R})$. Then, for the dropping model, the approximating branching process has offspring PGF $f^{(\gamma ,\omega )}(s)= \sum _{k=0}^{\infty } p_k f_k^{(\gamma ,\omega )}(s)$ in the first generation and offspring PGF $\tilde{f}^{(\gamma ,\omega )}(s)= \sum _{k=0}^{\infty } \tilde{p}_k f_k^{(\gamma ,\omega )}(s)$ in all subsequent generations, with analogous results holding for the $(\gamma +\omega ,0)$ model.

Lemma 1

Suppose that $\beta >0$ and $\gamma >0$. Then, for $k=0,1,\ldots $,

$$\begin{aligned} f_k^{(\gamma ,\omega )}(s) \le f_k^{(\gamma +\omega )}(s) \qquad (0 \le s \le 1), \end{aligned}$$

(8.4)

with strict inequality for all $s \in [0,1)$ when $k \ge 2$.

Theorem 8.1

(Probability of a major outbreak)

(a)
The basic reproduction number $R_0$ for both the dropping and modified models is given by (2.1).
(b)
Suppose that $R_0>1$ and the epidemic is initiated by a single infective individual, chosen uniformly at random from the population. Then the probability of a major outbreak $p_\mathrm{maj}^{(\gamma ,\omega )}$ for the $(\gamma ,\omega )$ dropping model is strictly greater than the probability of a major outbreak $p_\mathrm{maj}^{(\gamma +\omega ,0)}$ for the modified $(\gamma +\omega ,0)$ model, i.e.
$$\begin{aligned} p_\mathrm{maj}^{(\gamma ,\omega )}> p_\mathrm{maj}^{(\gamma +\omega ,0)}. \end{aligned}$$
(8.5)

Proof

The basic reproduction number is given by the offspring mean of a typical (i.e. non-initial generation) infective, so for both models, using (8.3),

$$\begin{aligned} R_0=\sum _{i=1}^{\infty }\tilde{p}_k k\frac{\beta }{\beta +\gamma +\omega }=\frac{\beta }{\beta +\gamma +\omega }\left( \mu _D+\frac{\sigma _D^2}{\mu _D}-1\right) , \end{aligned}$$

which proves part (a).

Turning to part (b), suppose that $R_0>1$. Then, using standard branching process theory gives that, for the dropping model, the probability of a major outbreak is given by $p_\mathrm{maj}^{(\gamma ,\omega )}=1-f^{(\gamma ,\omega )}(\sigma ^{(\gamma ,\omega )})$, where $\sigma ^{(\gamma ,\omega )}$ is the unique solution in [0, 1) of $\tilde{f}^{(\gamma ,\omega )}(s)=s$; cf. Kenah and Robins (2007) and Ball and Sirl (2013). Analogously, for the modified model, $p_\mathrm{maj}^{(\gamma +\omega ,0)}=1-f^{(\gamma +\omega ,0)}(\sigma ^{(\gamma +\omega ,0)})$, where $\sigma ^{(\gamma +\omega ,0)}$ is the unique solution in [0, 1) of $\tilde{f}^{(\gamma +\omega ,0)}(s)=s$.

Note that if $\mathrm{P}(D \ge 3)=0$ then $R_0 \le 1$, so $R_0>1$ implies that $\mathrm{P}(D \ge 3)>0$. It then follows immediately from Lemma 1 that $f^{(\gamma ,\omega )}(s)< f^{(\gamma +\omega ,0)}(s)$ and $\tilde{f}^{(\gamma ,\omega )}(s)< \tilde{f}^{(\gamma +\omega ,0)}(s)$ for all $s \in [0,1)$. Hence, since $\tilde{f}^{(\gamma ,\omega )}(1)=\tilde{f}^{(\gamma +\omega ,0)}(1)=1$ and the derivative of both $\tilde{f}^{(\gamma ,\omega )}$ and $\tilde{f}^{(\gamma +\omega ,0)}$ at $s=1$ is $R_0>1$ , it follows that $\sigma ^{(\gamma ,\omega )}<\sigma ^{(\gamma +\omega ,0)}$, whence $f^{(\gamma ,\omega )}(\sigma ^{(\gamma ,\omega )})< f^{(\gamma +\omega ,0)}(\sigma ^{(\gamma ,\omega )})< f^{(\gamma +\omega ,0)}(\sigma ^{(\gamma +\omega ,0)})$, as $f^{(\gamma +\omega ,0)}$ is strictly increasing on [0, 1]. Thus we obtain our statement (8.5). $\square $

Remark 8.1

(Other choices for initial infectives) Theorem 8.1 is easily extended to other assumptions concerning initial infectives; for example, to an epidemic initiated by $k>1$ infective individuals chosen uniformly at random from the population, or to an epidemic initiated by an infective of a specified degree.

9 No dropping of edges

We use the results from this paper to analyse the Markovian SIR epidemic on a configuration model network in Sect. 9.1 and the giant component of a configuration model network in Sect. 9.2. Note that in the case that there is no dropping of edges, i.e. $\omega =0$, we are in the setting of a Markovian SIR epidemic on a configuration model network. We treat the asymptotic variance of the final size for this model in Conjecture 9.1. If additionally, there is no recovery, i.e. $\omega =0=\gamma $, then in the event of a major outbreak, all individuals in the giant component eventually get infected. By using this we can apply the results from this paper to make statements about the size of the giant component in configuration model random graphs, see Conjecture 9.2.

9.1 SIR epidemic on configuration network

When $\omega =0$, the model reduces to the Markov SIR epidemic on a configuration model network. The formulae for the asymptotic variance of the final size for the epidemic on MR and NSW random networks simplify and become fully explicit given z, defined below.

Recall that $\varepsilon _E=\sum _{i=1}^{\infty } i\varepsilon _i$. If $\varepsilon _E>0$, then setting $\omega =0$ in Proposition 5.1(a) shows that $\rho =1-f_{D_{\varepsilon }}(z)$, where z is the unique solution in [0, 1) of

$$\begin{aligned} (\beta +\gamma )z-\gamma =\beta \mu _D^{-1}f_{D_{\varepsilon }}'(z). \end{aligned}$$

(9.1)

If $\varepsilon _E=0$, so the epidemic is started by a trace of infection, and $R_0>1$ then, using Proposition 5.1(b), $\rho =1-f_D(z)$, where z is the unique solution in [0, 1) of (9.1) with $f_{D_{\varepsilon }}'$ replaced by $f_D'$.

Let $T^N_{\mathrm{MRND}}$ and $T^N_{\mathrm{NSWND}}$ denote the final size of the epidemic, with no dropping of edges, on an MR and NSW configuration model random network, respectively, each having N vertices. Let $\sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )=\sigma ^2_{\mathrm{MR}}(\beta , 0, \gamma )$ and $\sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )=\sigma ^2_{\mathrm{NSW}}(\beta , 0, \gamma )$ denote the asymptotic variance of the final size for the epidemic on an MR and an NSW configuration model random network, respectively. The following conjecture gives fully explicit formulae for $\sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )$ and $\sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )$ as functions of z, which are derived in Appendix G.

Conjecture 9.1

(CLT for final size of epidemic on configuration model networks)

(a)
For the SIR epidemic on an MR random network,
1. (i)
  if $\varepsilon _E>0, d_{\max }< \infty $ and $z>0$, then,
  $$\begin{aligned} \sqrt{N}\left( N^{-1}T^N_{\mathrm{MRND}}- \rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$
  (9.2)
  where
  (9.3)
  with
  $$\begin{aligned} h(\beta ,\gamma ,z)=\frac{\gamma -(\beta +\gamma )z}{\beta +\gamma -\beta \mu _D^{-1}f_{D_{\varepsilon }}''(z)}; \end{aligned}$$
  (9.4)
2. (ii)
  if $\varepsilon _E=0$, $d_{\max }<\infty $, $R_0>1$ and $z>0$, then, in the event of a major outbreak, (9.2) holds with $\varepsilon =0$ and $D_{\varepsilon }$ replaced by D in (9.3) and (9.4).
(b)
For the epidemic on an NSW network, suppose that $\varepsilon _E=0$, $d_{\max }<\infty $, $R_0>1$ and $z>0$. Then, in the event of a major outbreak,
$$\begin{aligned} \sqrt{N}\left( N^{-1}T^N_{\mathrm{NSWND}}- \rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )) \quad \text{ as } N \rightarrow \infty , \end{aligned}$$
(9.5)
where
$$\begin{aligned} \sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )&=\rho (1-\rho )+2h(\beta ,\gamma ,z)\left( \frac{\gamma -(\beta +\gamma )z}{\beta }\right) \left( \frac{\beta +\gamma }{2\beta +\gamma }\right) \mu _D\nonumber \\&\quad +\,h(\beta ,\gamma ,z)^2\left[ \frac{\gamma }{2\beta +\gamma } +\left( \frac{\gamma -(\beta +\gamma )z}{\beta }\right) ^2\right] (\sigma _D^2+\mu _D^2)\nonumber \\&\quad +\,2h(\beta ,\gamma ,z)^2\frac{(\beta +\gamma )[\gamma -(\beta +\gamma )z]}{\beta ^2}z\mu _D, \end{aligned}$$
(9.6)
and $h(\beta ,\gamma ,z)$ is given by (9.4), with $D_{\varepsilon }$ replaced by D.

Remark 9.1

(Proof of Conjecture 9.1) Although only conjectured here, Conjecture 9.1 (and hence also Conjecture 9.2 below) follow as a special case of Ball (2018), Theorems 2.1 and 2.2.

Remark 9.2

(Epidemics on NSW random network with$\varepsilon _E>0$) As for the model with dropping, Conjecture 9.1(b) can be extended to include the case $\varepsilon _E>0$; the asymptotic variance $\sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )$ then depends on how the initial infectives are chosen (cf. Remark 7.5).

9.2 Configuration model giant component

If $\omega =\gamma =0$ then the epidemic ultimately infects all individuals in all components of the random network that contain at least one initial infective. Thus, under suitable conditions, in the limit as $\varepsilon \downarrow 0 $, setting $\gamma =0$ in Conjecture 9.1(a)(ii) and (b) leads to CLTs for the size of the largest connected (i.e. giant) component in MR and NSW configuration model random graphs, respectively.

Let $\kappa =\mathrm{E}[D(D-2)]=\sigma _D^2+\mu _D^2-2\mu _D$ and note that, setting $\omega =\gamma =0$ in the formula for $R_0$, $\kappa >0$ if and only if $R_0>1$. The above configuration model random graphs possess a giant component if and only if $\kappa >0$, see e.g. Durrett (2007), Theorem 3.1.3. Suppose that $\kappa >0$. Setting $\gamma =0$ and $D_{\varepsilon }=D$ in (9.1) shows that z is now given by the unique solution in [0, 1) of

$$\begin{aligned} \mu _Dz=f_D'(z). \end{aligned}$$

(9.7)

and the asymptotic fraction of vertices in the giant components of the above configuration model random graphs is given by $\rho =1-f_D(z)$.

Let $R^N_{\mathrm{MR}}$ and $R^N_{\mathrm{NSW}}$ denote respectively the size of the giant component in an MR and an NSW random graph on N vertices. Setting $\gamma =0$ in Conjecture 9.1 (a)(ii) and (b) yields the following conjecture.

Conjecture 9.2

(CLT for the size of the giant component)

Suppose that $\kappa >0$, $d_{\max }<\infty $ and $p_1>0$. Then,

(a)
for an MR random graph,
$$\begin{aligned} \sqrt{N}\left( N^{-1}R^N_{\mathrm{MR}}-\rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{MRGC}})\quad \text{ as } N \rightarrow \infty , \end{aligned}$$
(9.8)
where
$$\begin{aligned} \sigma ^2_{\mathrm{MRGC}}=&1-\rho -f_D(z^2)-\frac{z^2}{\left[ 1-\mu _D^{-1}f_D''(z)\right] }\left[ 2f_D'(z^2)-\mu _D\right] \nonumber \\&-\frac{z^2}{\left[ 1-\mu _D^{-1}f_D''(z)\right] ^2}\left[ f_D'(z^2)+z^2f_D''(z^2)-2\mu _Dz^2\right] ; \end{aligned}$$
(9.9)
(b)
for an NSW random graph,
$$\begin{aligned} \sqrt{N}\left( N^{-1}R^N_{\mathrm{NSW}}-\rho \right) {\mathop {\longrightarrow }\limits ^{\mathrm{D}}}N(0,\sigma ^2_{\mathrm{NSWGC}})\quad \text{ as } N \rightarrow \infty , \end{aligned}$$
(9.10)
where
$$\begin{aligned} \sigma ^2_{\mathrm{NSWGC}}&=\rho (1-\rho )+\frac{z^2}{\left[ 1-\mu _D^{-1}f_D''(z)\right] }\mu _D\nonumber \\&\quad +\frac{z^4}{\left[ 1-\mu _D^{-1}f_D''(z)\right] ^2}\left( \sigma _D^2+\mu _D^2-2\mu _D\right) . \end{aligned}$$
(9.11)

It is easily verified that the expressions (9.9) and (9.11) for the asymptotic variances $\sigma ^{2}_{\mathrm{MRGC}}$ and $\sigma ^{2}_{\mathrm{NSWGC}}$ coincide with those first obtained by Ball and Neal (2017) using a completely different method; a CLT was conjectured in that paper and subsequently proved for an MR random graph in Barbour and Röllin (2019), and for both MR and NSW random graphs by Janson (2018). The results proved in these three papers allow for unbounded degrees under suitable conditions.

10 Numerical examples

In this section we give numerical results which exemplify some of the limit theorems and support some of the conjectures presented in the paper and give examples of using those limiting results for approximation. Such approximations follow from our asymptotic results in exactly the same way as the approximate distribution of the sum of independent and identically distributed random variables follows from the classical CLT. For example, we can use Eq. (3.15) in Theorem 3.2 to say that, for large N, the distribution of $\varvec{W}^N(t)$ is approximately that of $N \varvec{w}(t) + \sqrt{N}\varvec{V}(t)$, from which approximations for the mean and variance of $\varvec{W}^N(t)$ follow immediately from the corresponding properties of the Gaussian process $\varvec{V}(t)$. We also explore numerically some aspects of the behaviour of the model we have analysed, using the asymptotic results we have derived. In our numerical examples relating to the temporal evolution of the epidemic we look only at the mean and variance of the number of infective individuals in the population, we do not investigate any other quantities of interest or explicitly investigate the covariance/correlation structure in any way.

In this section we use the notation $D\sim \text{ Poi }(\lambda )$ or $D\sim \text{ Geo }(p)$ to denote that the network degree distribution follows a standard Poisson or Geometric distribution with mass functions $p_k=\mathrm{e}^{-\lambda }\lambda ^k/k!$ ($k=0,1,\ldots $) or $p_k=p(1-p)^k$ ($k=0,1,\ldots $), respectively. In particular we shall use repeatedly in our examples the distributions $D\sim \text{ Poi }(5)$ and $D\sim \text{ Geo }(1/6)$. These distributions both have mean 5 and their standard deviations are $\sqrt{5}\approx 2.2$ and $\sqrt{30}\approx 5.5$ respectively.

First, however, we discuss some of the issues that arise in relation to the numerical implementation of our analytical results.

10.1 Implementation

The numerical implementation of our asymptotic results concerning epidemic final size (the formulae laid out in Propositions 5.1 and 6.1 and Conjecture 7.1) is straightforward, involving root-finding, numerical integration and derivatives up to order 3 of the degree distribution PGF $f_D$. For the degree distributions we use, we have $f_D^{(i)}(s) = \lambda ^i e^{-\lambda (1-s)}$ when $D\sim \text{ Poi }(\lambda )$ and $f_D^{(i)}(s) = \frac{i! p (1-p)^i}{(1-(1-p)s)^{i+1}}$ when $D\sim \text{ Geo }(p)$, with both formulae being valid for $i=0,1,\ldots $. In the final size examples we always use the version of these results with $\varepsilon _E=0$, i.e. we work under the asymptotic regime where the epidemic starts with a trace of infection. The results concerning the evolution of the epidemic through time (Theorems 3.1, 3.2 and 7.2) warrant discussion of some issues that arise.

An obvious first issue is initial conditions $(\varvec{x}(0),\varvec{y}(0),z_E(0))$ and $\varSigma (0)$ for the system of ODEs given by (3.9)–(3.11) together with the variance/covariance-related matrix ODE (3.17) (see also Remark 7.3). In an MR network we take the initial infectives to comprise a fixed number of individuals, with numbers of individuals of the various degrees chosen in the same proportions as they are present in the whole population. In an NSW network we choose the required number of initial infectives uniformly at random from the population. Ideally we might want the initial conditions to represent a large outbreak initiated by few initial infectives; this is a rather more complex situation and could be addressed using the results of Ball and House (2017).

Let $\varepsilon $ be the proportion of individuals initially infected in the limit as $N\rightarrow \infty $. It is straightforward to show that $x_i(0) = \lim _{N\rightarrow \infty } N^{-1} E[X_i^N(0)] = p_i (1-\varepsilon )$ and similarly that $y_i(0) = p_i \varepsilon $ and $z_E(0) = 0$ (cf. the paragraph immediately before Theorem 3.1; with a NSW network these limits hold almost surely). Turning to $\varSigma (0)$, in the case of an MR network we have chosen the initial conditions so that there is no variability; i.e. all elements of $\varSigma _\mathrm{MR}(0)$ are zero. With an NSW network there is variability in the initial conditions; to characterise it we let $i_0^N = [\varepsilon N]$ be the number of initially infected individuals (or assume that $i_0^N$ is a function of N such that $\lim _{N\rightarrow \infty } N^{-1}i_0^N = \varepsilon $) and use the notation $\sigma _{x_i,x_j}(0)$ for the (i, j)-th element of the submatrix of $\varSigma _\mathrm{NSW}(0)$ corresponding to the susceptible elements (cf. the partitioning in (7.13)), so for example $\sigma _{x_i,y_j}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{cov}(X_i^N(0),Y_j^N(0))$. We find that the following elements of $\varSigma _\mathrm{NSW}(0)$ are non-zero: for all i, $\sigma _{x_i,x_i}(0) = p_i (1-p_i)(1-\varepsilon )$ and $\sigma _{y_i,y_i}(0) = p_i (1-p_i) \varepsilon $; and for all $i\ne j$, $\sigma _{x_i,x_j}(0) = -p_ip_j(1-\varepsilon )$ and $\sigma _{y_i,y_j}(0) = -p_ip_j\varepsilon $. Derivations can be found in Appendix H.

After solving the ODE systems numerically we can calculate the asymptotic means and variances for other quantities of interest, for example to approximate $I^N(t)$, the number of infected individuals at time t, we use

$$\begin{aligned} \lim _{N\rightarrow \infty } N^{-1}E[I^N(t)] = \sum _{i=0}^\infty y_i(t) \quad \text{ and } \quad \lim _{N\rightarrow \infty } N^{-1/2} \mathrm{var}[I^N(t)] = \sum _{i=0}^{\infty } \sum _{j=0}^{\infty } \sigma _{y_i,y_j}(t). \end{aligned}$$

The final ODE-related issue is choosing the value of M, the maximum degree, to use when the degree distribution does not have finite support. (This amounts to setting $x_i(t)=y_i(t)=0$ for all $t\ge 0$ and $i=M+1,M+2,\ldots $.) The upper bound M needs to be large enough that the approximation is accurate but not so large that the systems of ODEs are impractical to solve numerically (the number of ODEs grows like $M^2$). To decide on an appropriate value for M we compare plots of the asymptotic means and variances of I(t) (i.e. the solid lines in the lower plots of Fig. 1), increasing M until there is no observable difference in these plots. We also compare the predicted relative ‘final’ size $x(0)-x(t_\mathrm{end})$ from the numerical ODE solution to the asymptotic final size predicted by Proposition 5.1. For the degree distributions we find that $M=15$ is sufficient when $D\sim \text{ Poi }(5)$ and $M=50$ when $D\sim \text{ Geo }(1/6)$.

Simulation of the epidemic process is relatively straightforward. Given a sequence of degrees (either [MR] a specified sequence or [NSW] independent realisations from the distribution $\{p_k\}$) we (i) generate the network, (ii) choose initial infectives, (iii) spread the epidemic on the network. There is therefore randomness in each simulation deriving not just from the evolution of the epidemic, but also the graph construction and, in the case of an NSW graph, the degree sequence. When we calculate confidence intervals (CIs) for quantities associated with simulations of the temporal evolution of the epidemic they are calculated independently for each time point; i.e. they are not confidence bands for the process. Endpoints of CIs for standard deviations are calculated as the square roots of the endpoints of standard symmetric (in terms of probability) CIs for the variance.

10.2 Convergence and approximation of temporal properties

First we demonstrate numerically some of the limit theorems from earlier sections, showing both how the convergence is realised and thus how these limit theorems can be used for approximation. We give examples only with an NSW graph construction, but much the same observations apply in the MR graph scenario.

In Fig. 1 we demonstrate using Theorem 7.2 for approximation of the temporal evolution of the epidemic, comparing simulated trajectories of the prevalence $I^N(t)$ (for $N=1000$) versus time t of the model with predictions from the functional central limit theorem, for a Poisson and a Geometric degree distribution. The upper plots show the simulated trajectories together with the mean and a central 95% probability band predicted by the CLT; they suggest that the approximation is fairly good. The lower plots compare the mean and standard deviation of the prevalence through time with the LLN and CLT based asymptotic predictions.

In Fig. 2 we investigate the convergence of the distribution of $I^N(t)$ to its $N\rightarrow \infty $ limit at three time points $t_1$, $t_2$ and $t_3$. The times are chosen so that $t_2$ is close to the time of peak prevalence and $t_1$ and $t_3$ are when prevalence is increasing and decreasing, respectively, at a level roughly half that of the peak prevalence. (Effectively we are examining the upper-right plot of Fig. 1 in detail at these three time points.) In this figure we have used a geometric degree distribution, but very similar conclusions are obtained using different distributions. This convergence is further investigated/demonstrated in Fig. 3, where, separately for each of the same three time points, we plot the Kolmogorov distance between the empirical and asymptotic distributions of the number of infectives against population size N.

Broadly speaking, Fig. 1 and similar plots for other population sizes, together with Figs. 2 and 3 and similar plots for other degree distributions, show that the predicted convergence is apparent, but seems slower for the later times. Even for quite small population sizes in the low hundreds, the asymptotic approximation to the mean behaviour of the epidemic is excellent. With smaller population sizes of a few hundred the approximation of the variability seems quite good in the early phase of epidemic growth, begins to worsen at or slightly before the time of peak prevalence and consistently underestimates the variability of $I^N(t)$ after that. As the population size increases, the approximation for the standard deviation improves but not as quickly as one might hope: the agreement between asymptotic and empirical distributions seems to improve fairly slowly as N increases from 200 to 5000. Thus we can be very confident in using an LLN-based approximation for nearly any population size; but CLT-based approximations must be used with some caution, particularly at and after the time of peak prevalence. For these later times, a CLT-based approximation seems to systematically underestimate the variability in the number of infectives in the population. On a slightly more theoretical note, the plausibly linear (though also decidely noisy) behaviour of the plots in Fig. 3 is consistent with these Kolmogorov distances tending to 0 as $N\rightarrow \infty $. Consistent with the observations above, this convergence is at roughly the same rate for the time points in the early growth phase and near peak prevalence but much more slowly for the later time point $t=t_3$ in the phase where the infection is dying out.

10.3 Approximation of epidemic final size

In Fig. 4 we demonstrate approximation results for the final size of major outbreaks in our epidemic model on an NSW graph (Conjecture 7.1). Again we see that the approximation is quite reasonable for relatively small population sizes in the low hundreds and becomes very good indeed for population sizes in the thousands.

10.4 The effect of dropping

Next we investigate the behaviour of our model in respect of the introduction of the dropping mechanism. Starting with an epidemic without dropping we examine the behaviour of $R_0$ and $\rho $ (the fraction of the population that is ultimately infected in the limiting determinstic model—see Sect. 5.3) as the dropping rate $\omega $ is increased from 0 (no dropping) to a value which brings the model below threshold. Figure 5 does this for two ‘starting’ models, one with a Poisson and one with a geometric degree distribution, both well above threshold with with $\rho $ comfortably above 0.5. (Recall that $R_0$ and $\rho $ are independent of whether the network is MR or NSW.) In both cases we see that increasing $\omega $ reduces the virulence and severity of the epidemic as measured by $R_0$ and $\rho $. Perhaps noteworthy is that one of the plots of the mean final size $\rho $ is concave and the other convex.

10.5 The effect of random graph model on variances

We now demonstrate the effect of the random graph model (MR or NSW) on the variability of the final size of large outbreaks in our epidemic model. Figure 6 compares how the asymptotic scaled standard deviations for the final size of a major outbreak (i.e. $\sigma _\mathrm{MR}(\beta ,\gamma ,\omega )$ and $\sigma _\mathrm{NSW}(\beta ,\gamma ,\omega )$ in Proposition 6.1 and Conjecture 7.1 ) behave as dropping is included into a baseline model with no dropping. The upper plots show that these standard deviations can change quite dramatically with $\omega $; the lower plots show that the extra variability in the NSW network model can result in substantially more variability in the epidemic final size. As might be anticipated, this effect is more pronounced for the geometric compared to the Poisson case, i.e. when the degree distribution is more variable.

10.6 Increased recovery rate instead of dropping

Lastly we investigate the relationship between our model and the related model with increased recovery rate instead of dropping, as discussed in Sect. 8. We focus mainly on the claims about relative variability in the two models $E(\omega ,\gamma )$ with dropping and $E(0,\gamma +\omega )$ with increased recovery rate, though the results we present also illustrate Theorem 8.1, which gives an ordering of the major outbreak probabilities in the two models. Again we focus on the NSW graph model; similar conclusions (with less variability) are obtained with the MR graph model.

Figure 7 compares the final size distribution of the model with dropping to that of the model with increased recovery rate; again for two different degree distributions. The histograms and the normal approximation of the distribution of the size of a major outbreak confirm that the model with dropping does have a smaller variance in the size of major outbreaks and a larger chance of a major outbreak. Table 1 summarises the plots in Fig. 7. Here we see quite clearly that the major outbreak probabilities and the variances of the final size distributions are ordered as predicted by Theorem 8.1 and the argument involving differing dependence structures in Sect. 8. Differences between the two degree distributions are not very marked.

Figure 8 shows how the discrepancy in these variabilities generally increases with the dropping rate. Interestingly, we see that with the (more variable) geometric degree distribution the relative discrepancy increases with $\omega $ for most values of $\omega $; but decreases slightly with $\omega $ when $\omega $ is sufficiently large that the size of large outbreaks gets close to zero and the variability is quite large.

Figure 9 shows how the asymptotic quantities relating to the mean and standard deviation of $S^N(t)$ and $I^N(t)$ compare through time for these models. In the model with dropping we denote the asymptotic mean proportion infected by $\mu ^I(t; \beta ,\omega ,\gamma )$ and the asymptotic scaled standard deviation of $I^N(t)$ by $\sigma ^I_\mathrm{NSW}(t;\beta ,\omega ,\gamma )$; we let $\mu ^S(t; \beta ,\omega ,\gamma )$ and $\sigma ^S_\mathrm{NSW}(t;\beta ,\omega ,\gamma )$ denote the corresponding quantities for the number of susceptibles $S^N(t)$. Note that the absolute scale of the standard deviations here is not directly meaningful (to approximate the standard deviation in a population of size N these limiting quantities should be multiplied by $\sqrt{N}$); it is the relative values that are of interest here. Firstly, the upper plots confirm our assertions about the relative numbers of susceptibles in the two models: that the mean (LLN) behaviour of the two models is the same but the model with dropping exhibits less variability (cf. the final size behaviour in Fig. 7 and Table 1). In the lower plots the behaviour of the individual models $E(\omega ,\gamma )$ and $E(0,\gamma +\omega )$ is broadly in keeping with that observed in Fig. 1, however the differences between the two models are quite stark. Even though the two models have the same final size they achieve this through very different temporal behaviour: in the $E(0,\gamma +\omega )$ model individuals are infectious for less time but during that time infect others at a higher rate.

Table 1 Numerical summary of Fig. 7, using a final size of 0.15N to separate minor from major outbreaks

Full size table

11 Concluding comments

The current paper is concerned with a model for an epidemic taking place on a network in which susceptible individuals may drop their connections to infectious individuals as a preventive measure. A consequence of the behavioural dynamics is that the network changes in time, and the way the network changes depends on the epidemic process taking place on it (sometimes referred to as an adaptive network). We derive limiting properties of the epidemic process assuming a large outbreak in a large community: the LLN and functional CLT for the epidemic process, as well as conjecture a LLN and CLT for the final number getting infected. We also give a version of the functional CLT in Ethier and Kurtz (1986), Chapter 11, which allows for asymptotically random initial conditions (Theorem 7.1). Although it is a simple extension of Ethier and Kurtz (1986), Theorem 11.2.3, we have not seen the result previously in the literature and it (especially the covariance formula (7.3)) clearly has interest and applications well beyond the present setting. Furthermore, from the analysis of the dropping model we also obtain results for the Markovian SIR epidemic on a configuration model and for the configuration model giant component. In particular, we conjecture CLTs, with essentially fully explicit expressions for the asymptotic variances, for the final size of such epidemics on both MR and NSW random graphs, and for the size of the giant components of those graphs.

The above LLN and functional CLT are proved under the assumption of bounded degrees. As noted in Remark 4.1, the arguments in Ball (2018) should yield proofs of the final-size LLN and CLTs under this assumption. Rigorous extension of these results to networks with unbounded degrees is a natural mathematical next step, though bounded degrees are clearly sufficient for most biological purposes.

The simulations in Sects. 10.2 and 10.3 show that the limiting approximations kick in for moderate population sizes. Further, from the numerical investigations, dropping of edges seems to have the greatest preventive effect when the basic reproduction number $R_0$ is not too large, more specifically when it is close to the epidemic threshold value of one. In fact, if $R_0$ is moderate in the absence of dropping of edges, a fairly small dropping rate can make the epidemic sub-critical implying that large outbreaks are no longer possible in the large population limit.

This paper is inspired by the model in Britton et al. (2016), who study only the initial stages of an outbreak. In the current paper, in order to make progress in the analysis of the complete outbreak, we assume that edges can only be dropped, in contrast to Britton et al. (2016), which allows for some of the dropped edges to rewire to other individuals. It would of course be of interest to study limiting properties of this more general dropping/rewiring model. However, the effective degree approach does not apply immediately in a rigorous fashion to this setting, and rigorous analysis of the non-initial stages of the model including rewiring is left as an open problem. The model with rewiring is considered further in Leung et al. (2018), where it is demonstrated that such rewiring of edges, although always beneficial to the susceptible individual, can have an adverse effect at the population level. Other possible forms of social distancing include reducing contacts rather than dropping edges completely (e.g. Viljoen et al. (2014) and Zhang et al. (2014)) or only temporarily dropping the edge (e.g. Althouse and Hébert-Dufresne (2014)).

Another extension of the current model would be to allow the network to change in time also for reasons other than the epidemic process. One could for example consider some type of dynamic network model as the base network model (e.g. one of the dynamic network models of Leung and Diekmann (2016)), and increase the dropping rate indirectly by decreasing the rate of creation of new edges and/or increasing the rewiring rate between susceptible-infectious pairs of individuals, see e.g. Reniers and Armbruster (2012) for a simulation study where partnership dissolution rates depend on the HIV status of the couple. Obviously, rigorous analysis of such models will be appreciably harder, if indeed possible.

Finally, we note that we have restricted ourselves to the Markovian setting throughout this paper. As always, this assumption is not realistic and is made for mathematical convenience. In the setting of this paper, it is possible to generalize some of our results to include non-exponentially distributed infectious periods. Using a susceptibility set argument, as in e.g. Ball and Sirl (2013), Section 2.1.2, we can prove results for the deterministic final size similar to Proposition 5.1(b). Specifically, if the infectious period follows a random variable I, the deterministic final size is the same as that for a standard SIR epidemic on a configuration model network in which the infectious period is distributed as $I'=\min (I,W)$, where W is independent of I and has an exponential distribution with rate $\omega $. Recently, Sherborne et al. (2018) have extended edge-based compartmental models of epidemics on networks to allow for non-Markovian transmission and recovery processes, and that methodology should enable the limiting deterministic model for our model with dropping of edges and non-exponentially distributed infectious periods to be determined, as can be done using the binding site formulation of Leung and Diekmann (2016). It seems likely that our effective degree approach, together with LLN and functional CLT theorems in Wang (1975, 1977) for age and density dependent population processes, can be used to put such deterministic models in a fully rigorous asymptotic framework and provide an associated functional CLT.

References

Althouse B, Hébert-Dufresne L (2014) Epidemic cycles driven by host behaviour. J R Soc Interface 11:20140,575
Article Google Scholar
Ball F (2018) Central limit theorems for SIR epidemics and percolation on configuration model random graphs. arXiv:1812.03105v1
Ball F, House T (2017) Heterogeneous network epidemics: real-time growth, variance and extinction of infection. J Math Biol 75(3):577–619
Article MathSciNet MATH Google Scholar
Ball F, Neal P (2003) The great circle epidemic model. Stoch Process Appl 107(2):233–268
Article MathSciNet MATH Google Scholar
Ball F, Neal P (2008) Network epidemic models with two levels of mixing. Math Biosci 212(1):69–87
Article MathSciNet MATH Google Scholar
Ball F, Neal P (2017) The asymptotic variance of the giant component of configuration model random graphs. Ann Appl Probab 27(2):1057–1092
Article MathSciNet MATH Google Scholar
Ball F, Sirl D (2012) An SIR epidemic model on a population with random network and household structure, and several types of individuals. Adv Appl Probab 44(1):63–86
Article MathSciNet MATH Google Scholar
Ball F, Sirl D (2013) Acquaintance vaccination in an epidemic on a random graph with specified degree distribution. J Appl Probab 50(4):1147–1168
Article MathSciNet MATH Google Scholar
Barbour A, Reinert G (2013) Approximating the epidemic curve. Electron J Probab 18(54):1–30
MathSciNet MATH Google Scholar
Barbour AD, Luczak MJ (2012a) A law of large numbers approximation for Markov population processes with countably many types. Probab Theory Relat Fields 153(3–4):725–757
MathSciNet MATH Google Scholar
Barbour AD, Luczak MJ (2012b) Central limit approximations for Markov population processes with countably many types. Electron J Probab 17(90):1–16
MathSciNet MATH Google Scholar
Barbour AD, Röllin A (2019) Central limit theorems in the configuration model. Ann Appl Probab 29(2):1046–1069
Article MathSciNet MATH Google Scholar
Bohman T, Picollelli M (2012) SIR epidemics on random graphs with a fixed degree sequence. Random Struct Algorithms 41(2):179–214
Article MathSciNet MATH Google Scholar
Bollobás B (1980) A probabilistic proof of an asymptotic formula for the number of labelled regular graphs. Eur J Comb 1(4):311–316
Article MathSciNet MATH Google Scholar
Bollobás B (2001) Random graphs. Academic Press, New York
Book MATH Google Scholar
Britton T, Juher D, Saldaña J (2016) A network epidemic model with preventive rewiring: comparative analysis of the initial phase. Bull Math Biol 78(12):2427–2454
Article MathSciNet MATH Google Scholar
Decreusefond L, Dhersin JS, Moyal P, Tran VC (2012) Large graph limit for an SIR process in random network with heterogeneous connectivity. Ann Appl Probab 22(2):541–575
Article MathSciNet MATH Google Scholar
Durrett R (2007) Random graph dynamics. Cambridge University Press, Cambridge
MATH Google Scholar
Ethier SN, Kurtz TG (1986) Markov processes: characterization and convergence. Wiley series in probability and mathematical statistics. Wiley, Hoboken
Book MATH Google Scholar
Gross T, D’Lima CJD, Blasius B (2006) Epidemic dynamics on an adaptive network. Phys Rev Lett 96:208,701
Article Google Scholar
Jacobsen KA, Burch MG, Tien JH, Rempala GA (2018) The large graph limit of a stochastic epidemic on a dynamic multilayer network. J Biol Dyn 12(1):746–788
Article MathSciNet Google Scholar
Janson S (2018) Asymptotic normality in random graphs with given vertex degrees. arXiv:1812.08063v1
Janson S, Luczak M, Windridge P (2014) Law of large numbers for the SIR epidemic on a random graph with given degrees. Random Struct Algorithms 45(4):726–763
Article MathSciNet MATH Google Scholar
Kenah E, Robins JM (2007) Second look at the spread of epidemics on networks. Phys Rev E 76:036,113
Article MathSciNet Google Scholar
KhudaBukhsh WR, Woroszylo C, Rempala GA, Koeppl H (2017) Functional central limit theorem for susceptible-infected process on configuration model graphs. arXiv:1703.06328v1
Kiss IZ, Miller JC, Simon P (2017) Mathematics of epidemics on networks: from exact to approximate models. Springer, Berlin
Book MATH Google Scholar
Kurtz TG (1970) Solutions of ordinary differential equations as limits of pure jump Markov processes. J Appl Probab 7(1):49–58
Article MathSciNet MATH Google Scholar
Kurtz TG (1971) Limit theorems for sequences of jump Markov processes approximating ordinary differential equations. J Appl Probab 8(2):344–356
Article MATH Google Scholar
Leung K, Ball F, Sirl D, Britton T (2018) Individual preventive social distancing during an epidemic may have negative population-level outcomes. J R Soc Interface 15:20180,296
Article Google Scholar
Leung K, Diekmann O (2016) Dangerous connections: on binding site models of infectious disease dynamics. J Math Biol 74(3):619–671
Article MathSciNet MATH Google Scholar
Lindquist J, Ma J, van den Driessche P, Willeboordse FH (2011) Effective degree network disease models. J Math Biol 62(2):143–164
Article MathSciNet MATH Google Scholar
Martin-Löf A (1986) Symmetric sampling procedures, general epidemic processes and their threshold limit theorems. J Appl Probab 23(2):265–282
Article MathSciNet MATH Google Scholar
Miller JC (2011) A note on a paper by Erik Volz: SIR dynamics in random networks. J Math Biol 62(3):349–358
Article MathSciNet MATH Google Scholar
Molloy M, Reed B (1995) A critical point for random graphs with a given degree sequence. Random Struct Algorithms 6(2–3):161–179
Article MathSciNet MATH Google Scholar
Newman M (2002) Spread of epidemics on networks. Phys Rev E 66:016,128
Article MathSciNet Google Scholar
Newman M, Strogratz S, Watts D (2001) Random graphs with arbitrary degree distributions and their applications. Phys Rev E 64:026,118
Article Google Scholar
Reniers G, Armbruster B (2012) HIV status awareness, partnership dissolution and HIV transmission in generalized epidemics. PLoS ONE 7(12):e50,669
Article Google Scholar
Scalia-Tomba GP (1985) Asymptotic final size distribution for some chain-binomial models. Adv Appl Probab 17(3):477–495
Article MathSciNet MATH Google Scholar
Shaw L, Schwartz IB (2008) Fluctuating epidemics on adaptive networks. Phys Rev E 77:066,101
Article MathSciNet Google Scholar
Sherborne N, Miller JC, Blyuss KB, Kiss IZ (2018) Mean-field models for non-Markovian epidemics on networks. J Math Biol 76(3):755–778
Article MathSciNet MATH Google Scholar
Tunc I, Shaw LB (2014) Effects of community structure on epidemic spread in an adaptive network. Phys Rev E 90:022,801
Article Google Scholar
Viljoen T, Spoelstra J, Hemerik L, Molenaar J (2014) Modelling the impact of HIV on the populations of South Africa and Botswana. Acta Biotheoret 62(1):91–108
Article Google Scholar
Volz EM (2008) SIR dynamics in random networks with heterogeneous connectivity. J Math Biol 56(3):293–310
Article MathSciNet MATH Google Scholar
von Bahr B, Martin-Löf A (1980) Threshold limit theorems for some epidemic processes. Adv Appl Probab 12(2):319–349
Article MathSciNet MATH Google Scholar
Wang FJS (1975) Limit theorems for age and density dependent stochastic population processes. J Math Biol 2(4):373–400
Article MathSciNet MATH Google Scholar
Wang FJS (1977) A central limit theorem for age- and density-dependent population processes. Stoch Process Appl 5(2):173–193
Article MathSciNet MATH Google Scholar
Zanette DH, Risau-Gusmán S (2008) Infection spreading in a population with evolving contacts. J Biol Phys 34(1–2):135–148
Article Google Scholar
Zhang HF, Xie JR, Tang M, Lai YC (2014) Suppression of epidemic spreading in complex networks by local information based behavioral responses. Chaos Interdiscip J Nonlinear Sci 24:043,106
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was partially supported by a grant from the Simons Foundation and was carried out as a result of the authors’ visit to the Isaac Newton Institute for Mathematical Sciences during the programme Theoretical Foundations for Statistical Network Analysis in 2016 (EPSRC Grant Number EP/K032208/1). KYL was supported by the Swedish Research Council (VR) Grant Number 2015-05015. This work was also supported by a Grant from the Knut and Alice Wallenberg Foundation, which enabled FB to be a guest professor at the Department of Mathematics, Stockholm University. We thank Phil Pollett for some helpful discussions relating to Theorem 7.1 and the reviewers for their constructive comments which have improved the presentation of the paper.

Author information

Authors and Affiliations

School of Mathematical Sciences, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
Frank Ball & David Sirl
Department of Mathematics, Stockholm University, 106 91, Stockholm, Sweden
Tom Britton & Ka Yin Leung

Authors

Frank Ball
View author publications
You can also search for this author in PubMed Google Scholar
Tom Britton
View author publications
You can also search for this author in PubMed Google Scholar
Ka Yin Leung
View author publications
You can also search for this author in PubMed Google Scholar
David Sirl
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Frank Ball.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A Derivation of drift function $F(\varvec{x},\varvec{y},z_E)$

In this appendix we derive the expression (3.8) for $F(\varvec{x},\varvec{y},z_E)$. First note that (3.1) and (3.7) yield

$$\begin{aligned}&\sum _{\varvec{l}\in \varDelta _1}\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E)\nonumber \\&\quad =\sum _{i=1}^{\infty }\sum _{j=1}^{\infty } \frac{\beta i y_i j x_j}{\eta _E}(-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{I}_{j-1}) \nonumber \\&\quad =\frac{\beta }{\eta _E}\left[ x_E\sum _{i=1}^{\infty }i y_i (-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}) +y_E\sum _{j=1}^{\infty } j x_j (-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{I}_{j-1})\right] \nonumber \\&\quad = \frac{\beta }{\eta _E}\sum _{i=0}^{\infty }\left\{ x_E\left[ (i+1)y_{i+1}-iy_i\right] \varvec{e}^\mathrm{I}_i +y_E \left[ -i x_i \varvec{e}^\mathrm{S}_i+(i+1)x_{i+1}\varvec{e}^\mathrm{I}_i\right] \right\} , \end{aligned}$$

(A.1)

(3.2) and (3.7) yield

$$\begin{aligned} \sum _{\varvec{l}\in \varDelta _2}\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E)= & {} \sum _{i=1}^{\infty }\sum _{j=1}^{\infty } \frac{(\beta +\omega )i y_i j y_j}{\eta _E}(-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{I}_j+\varvec{e}^\mathrm{I}_{j-1}) \nonumber \\= & {} 2\frac{(\beta +\omega )}{\eta _E} \sum _{i=1}^{\infty }\sum _{j=1}^{\infty } i y_i j y_j (-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}) \nonumber \\= & {} 2\frac{(\beta +\omega )y_E}{\eta _E} \sum _{i=0}^{\infty }[-iy_i+(i+1)y_{i+1}] \varvec{e}^\mathrm{I}_i, \end{aligned}$$

(A.2)

and (3.3) and (3.7) yield

$$\begin{aligned}&\sum _{\varvec{l}\in \varDelta _3}\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E)\nonumber \\&\quad =\sum _{i=1}^{\infty }\sum _{j=1}^{\infty } \frac{\omega i y_i j x_j}{\eta _E}(-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{S}_{j-1}) \nonumber \\&\quad =\frac{\omega }{\eta _E}\left[ x_E\sum _{i=1}^{\infty } i y_i (-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}) +y_E\sum _{j=1}^{\infty } j x_j (\varvec{e}^\mathrm{S}_j+\varvec{e}^\mathrm{S}_{j-1})\right] \nonumber \\&\quad =\frac{\omega }{\eta _E}\sum _{i=0}^{\infty }\left\{ x_E [(i+1)y_{i+1}-iy_i] \varvec{e}^\mathrm{I}_i+ y_E[(i+1)x_{i+1}-ix_i] \varvec{e}^\mathrm{S}_i \right\} . \end{aligned}$$

(A.3)

Similarly, (3.4) and (3.7) yield

$$\begin{aligned}&\sum _{\varvec{l}\in \varDelta _4}\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E)\nonumber \\&\quad = \sum _{i=1}^{\infty } \frac{(\beta +\omega )i y_i z_E}{\eta _E}(-\varvec{e}^\mathrm{I}_i+\varvec{e}^\mathrm{I}_{i-1}-\varvec{e}^\mathrm{R}) \nonumber \\&\quad =-\frac{(\beta +\omega )y_Ez_E}{\eta _E}\varvec{e}^\mathrm{R}+\frac{(\beta +\omega )z_E}{\eta _E} \sum _{i=0}^{\infty } [(i+1) y_{i+1}-i y_i]\varvec{e}^\mathrm{I}_i, \end{aligned}$$

(A.4)

and (3.5) and (3.7) yield

$$\begin{aligned} \sum _{\varvec{l}\in \varDelta _5}\varvec{l}\beta _{\varvec{l}}(\varvec{x},\varvec{y},z_E)= & {} \sum _{i=0}^{\infty } \gamma y_i (-\varvec{e}^\mathrm{I}_i+i\varvec{e}^\mathrm{R}) \nonumber \\= & {} \gamma y_E \varvec{e}^\mathrm{R}- \gamma \sum _{i=0}^{\infty } y_i \varvec{e}^\mathrm{I}_i. \end{aligned}$$

(A.5)

Adding (A.1) to (A.5) and recalling that $\eta _E=x_E+y_E+z_E$ gives (3.8).

B Application of theorems for density dependent population processes

In this appendix we show that the conditions of the Theorems 11.2.2 and 11.2.3 in Ethier and Kurtz (1986), Chapter 11, concerning density dependent population processes are satisfied when there is a maximum degree, $d_{\max }$ say, and $\rho <1$. (Recall that $\rho $ is the fraction of the population that is ultimately infected by the limiting deterministic model.) Thus, for $t \ge 0$,

$$\begin{aligned} \varvec{W}^N(t)=\left( X_0^N(t), X_1^N(t),\ldots ,X_{d_{\max }}^N(t),Y_0^N(t), Y_1^N(t),\ldots ,Y_{d_{\max }}^N(t),Z_E^N(t)\right) , \end{aligned}$$

so $\{\varvec{W}^N(t)\}$ has dimension $d=2(d_{\max }+1)+1$. The limiting deterministic process is $\{\varvec{w}(t)\}$, where, for $t \ge 0$,

$$\begin{aligned} \varvec{w}(t)&=(x_0(t), x_1(t),\ldots ,x_{d_{\max }}(t), y_0(t), y_1(t),\ldots ,y_{d_{\max }}(t), z_E(t))\\&=(w_1(t),w_2(t),\ldots ,w_d(t)). \end{aligned}$$

The domain of the intensity functions $\beta _{\varvec{l}}(\varvec{w})$$(\varvec{l}\in \varDelta )$ is

$$\begin{aligned} H_*=\left\{ \varvec{w}:w_i \ge 0 \;(i=1,2,\ldots ,d),\sum _{i=1}^d w_i \le 1\right\} . \end{aligned}$$

The proofs of the theorems in Ethier and Kurtz (1986), Chapter 11, make it clear that the conditions need only hold in some small neighbourhood of $\{\varvec{w}(t)\}$. Thus, since $\rho <1$, there exists $\varepsilon >0$, so that $H_*$ can be replaced by $H_*(\varepsilon )= \left\{ \varvec{w}\in H_*: x_E \ge \varepsilon \right\} $, where $x_E=\sum _{i=1}^{d_{\max }} ix_i$. It follows that the density dependent condition (3.6) is satisfied for all sample paths of $\{\varvec{W}^N(t)\}$ such that $N^{-1}\varvec{W}^N(t)$ remains within $H_*(\varepsilon )$, which is sufficient for the proofs in Ethier and Kurtz (1986).

Considering first the LLN for $\{\varvec{W}^N(t)\}$, the conditions of Ethier and Kurtz (1986), Theorem 11.2.1, are satisfied if (i) $\sum _{\varvec{l}\in \varDelta }|\varvec{l}|\sup _{\varvec{w}\in H_*(\varepsilon )}\beta _{\varvec{l}}(\varvec{w})<\infty $; (ii) the drift function F is Lipschitz continuous on $H_*(\varepsilon )$; and (iii) $\lim _{N\rightarrow \infty } N^{-1} \varvec{W}^N(0)=\varvec{w}(0) \ne \varvec{0}$. It is easily seen from (3.7) that (i) is satisfied, since $\varDelta $ is finite and $\eta _E \ge x_E \ge \varepsilon >0$ for all $\varvec{w}\in H_*(\varepsilon )$. It follows from (3.8) that the partial derivatives $\partial _j F_i(\varvec{w})$$(i,j=1,2,\ldots ,d)$ are uniformly bounded on $H_*(\varepsilon )$, since $\eta _E \ge \varepsilon $ for all $\varvec{w}\in H_*(\varepsilon )$, so (ii) is satisfied. Finally, it is easily seen from the proof in Ethier and Kurtz (1986) that the result still holds if the convergence in (iii) holds almost surely, thus the LLN for $\{\varvec{W}^N(t)\}$, stated in Sect. 3, holds for epidemics on both MR and NSW random graphs.

Turning to the functional CLT (3.15), where to be more explict $\Rightarrow $ denotes weak convergence in the space of right-continuous functions $f:[0,\infty ) \rightarrow \mathbb {R}^d$ having limits from the left (i.e. càdlàg functions), endowed with the Skorohod metric, the conditions of Ethier and Kurtz (1986), Theorem 11.2.3, are satisfied if, in addition to (i)–(iii), (iv) $\sum {\varvec{l}\in \varDelta }|\varvec{l}|^2\sup _{\varvec{w}\in H_*(\varepsilon )}\beta _{\varvec{l}}(\varvec{w})<\infty $; (v) the intensity functions $\beta _{\varvec{l}}(\varvec{w})$$(\varvec{l}\in \varDelta )$ and the partial derivatives $\partial _j F_i(\varvec{w})$$(i,j=1,2,\ldots ,d)$ are continuous on $H_*(\varepsilon )$; and (vi) $\lim _{N \rightarrow \infty } \sqrt{N}\left( N^{-1} \varvec{W}^N(0)-\varvec{w}(0)\right) =\varvec{V}(0)$, where $\varvec{V}(0)$ is constant. Now (iv) is satisfied, for similar reasons to (i). It is easily seen from (3.7) and (3.8) that (v) is satisified, and (vi) follows from (3.14). Thus (3.15) is proved.

Consider now the random time-scale transformed process $\{\tilde{\varvec{W}}^N(t)\}$ introduced in Sect. 4. The limiting determinstic process is now $\{\tilde{\varvec{w}}(t): t \ge 0\}$, where

$$\begin{aligned} \tilde{\varvec{w}}(t)=(\tilde{x}_0(t),\tilde{x}_1(t),\ldots ,\tilde{x}_{d_{\max }}(t),\tilde{y}_0(t),\tilde{y}_1(t),\ldots ,\tilde{y}_{d_{\max }}(t),\tilde{z}_E(t)). \end{aligned}$$

For any $t_0 \in (0,\tilde{\tau })$, there exists $\varepsilon '>0$ such that $\tilde{y}_E(t)=\sum _{i=1}^{d_{\max }} i \tilde{y}_i(t) \ge \varepsilon '$ for all $0 \le t \le t_0$. Let $\tilde{H}_*(\varepsilon ')= \left\{ \tilde{\varvec{w}}\in H_*: \tilde{y}_E\ge \varepsilon \right\} $. The proofs that the conditions of Ethier and Kurtz (1986), Theorems 11.2.1 and 11.2.3, are satisfied for the transformed process $\{\tilde{\varvec{W}}^N(t): 0 \le t \le t_0\}$ are analagous to those above, except $H_*(\varepsilon )$ is replaced by $\tilde{H}_*(\varepsilon ')$. Note that the denominator in the intensity functions $\tilde{\beta }_{\varvec{l}}(\varvec{w})$$(\varvec{l}\in \varDelta )$ given at (4.1) (and hence in the drift function $\tilde{F}$ given at (4.2)) is $\tilde{y}_E$, where for the untransformed process it is $\eta _E$.

C Properties of $\tilde{\tau }_{\delta }$

In this appendix we prove that (i) $\tilde{\tau }<\infty $ and (ii) (4.12) holds for all $\delta \in [0,y_E(0))$. Recalling the definition of $\tilde{\tau }_{\delta }$ at (4.10), it follows that $\tilde{\tau }_{\delta }$ is the smallest positive solution of $\tilde{y}_E(t)=\delta $ with ${\tilde{y}}_E(t)$ given by (5.11) (Clearly, $\tilde{\tau }_{\delta }=0$ for $\delta >y_E(0)$.) Also, it follows from (5.9), (6.12), and $\tilde{y}_E(\tilde{\tau }_{\delta })=\delta $ that

$$\begin{aligned}&\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\\&\quad =-(\beta +\omega )\delta + \mathrm{e}^{-2(\beta +\omega )\tilde{\tau }_{\delta }}\left[ \beta f_{D_{\varepsilon }}''\left( \psi (\tilde{\tau }_{\delta })\right) -(\beta +\omega +\gamma )\mu _D\right] . \end{aligned}$$

Let $z_{\delta }=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }_{\delta }}$ and recall that $\psi (\tilde{\tau }_{\delta })=p_{\omega }+(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )\tilde{\tau }_{\delta }}=\widetilde{\psi }(z_{\delta })$. Then,

$$\begin{aligned} \nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))=-(\beta +\omega )\delta +z_{\delta }^2 \left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z_{\delta })\right) -(\beta +\omega +\gamma )\mu _D\right] . \end{aligned}$$

(C.1)

and from (5.11), if follows that $z_{\delta }$ satisfies

$$\begin{aligned} f_{D_{\varepsilon }}'\left( \widetilde{\psi }(z_{\delta })\right) -\frac{[(\beta +\omega +\gamma )z_{\delta }-\gamma ]}{\beta +\omega }\mu _D=-\frac{\delta }{z_{\delta }}. \end{aligned}$$

(C.2)

For $z \in [0,1]$, let

$$\begin{aligned} A(z)=f_{D_{\varepsilon }}'\left( \widetilde{\psi }(z)\right) -\frac{[(\beta +\omega +\gamma )z-\gamma ]}{\beta +\omega }\mu _D, \end{aligned}$$

(C.3)

so $z_0=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }_0}$ satisfies $A(z_0)=0$. Now $A(0)= f_{D_{\varepsilon }}'(p_{\omega })+\gamma /(\beta +\omega )$ and $A(1)=f_{D_{\varepsilon }}'(1)-\mu _D=-y_E(0)<0$. (Recall the definition of $f_{D_{\varepsilon }}$ at (5.3).) Further, unless $p_{\omega }=\gamma =f_{D_{\varepsilon }}'(0)=0$, then $A(0)>0$, so since A(z) is continuous, $z_0 \in (0,1)$ and $\tilde{\tau }$ (and hence also $\tilde{\tau }_{\delta }$) is finite. For $\delta \in (0,y_E(0))$, note that $z_{\delta }$ satisfies $A(z_{\delta })+\frac{\delta }{z_{\delta }}=0$. Thus, $A(1)+\frac{\delta }{1}=\delta -y_E(0)<0$ and $A(z)+\frac{\delta }{z} \rightarrow \infty $ as $z \downarrow 0$, so $z_{\delta }\in (0,1)$ and $\tilde{\tau }_{\delta }< \infty $. If $p_{\omega }=\gamma =f_{D_{\varepsilon }}'(0)=0$ and $\delta =0$, then it is easily verified using the convexity of $f_{D_{\varepsilon }}'$ that $z_0=0$, so $\tilde{\tau }=\infty $.

We show now that $\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))<0$ for $\delta \in [0,y_E(0))$. Differentiating (C.3) and recalling that $p_{\omega }=\frac{\omega }{\beta +\omega }$ yields

$$\begin{aligned} A'(z)=\frac{1}{\beta +\omega }\left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] . \end{aligned}$$

Suppose, for contradiction, that $\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))\cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \ge 0$. Then, recalling (C.1), $A'(z_{\delta }) \ge \frac{\delta }{z_{\delta }^2}$, whence $A'(z) > \frac{\delta }{z^2}$ for $z \in [z_{\delta },1]$, since $A'$ is increasing on [0, 1]. It follows from (C.2) that $A(z_{\delta })=-\frac{\delta }{z_{\delta }}$. Thus,

$$\begin{aligned} A(1) > A(z_{\delta })+ \int _{z_{\delta }}^1 \frac{\delta }{z^2}\,\mathrm{d}z = -\delta . \end{aligned}$$

But $A(1)=f_{D_{\varepsilon }}'(1)-\mu _D=-y_E(0)$, since, using (5.3), $f_{D_{\varepsilon }}'(1)=\sum _{k=1}^{\infty }k(p_k - \varepsilon _k)= \mu _D-y_E(0)$. Thus, $y_E(0) < \delta $, which is a contradiction as $\delta \in [0,y_E(0))$. Hence $\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau }_{\delta })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }_{\delta }))<0$, as required.

Finally, suppose that the epidemic is started by a trace of infection, so $y_E=0$, and that $\delta =0$. Then, Proposition 5.1(b) shows that (C.2) (with $D_{\varepsilon }$ replaced by D and $\delta =0$) has a (unique) solution, $z_0$, in [0, 1) if and only if $R_0>1$. Moreover, $z_0>0$ unless $p_{\omega }=\gamma =f_D'(0)=0$.

Further, the above proof is easily modified to show that $\nabla \varphi (\tilde{\varvec{w}}(\tilde{\tau })) \cdot \tilde{F}(\tilde{\varvec{w}}(\tilde{\tau }))<0$.

D Calculations pertaining to $\tilde{\varPhi }(t,u)$

Expanding (4.9) in partitioned form yields, using (6.16),

$$\begin{aligned} \dfrac{\partial }{\partial t}\tilde{\varPhi }_{XX}(t,u)=\partial \tilde{F}_{XX}(\tilde{\varvec{w}}(t))\tilde{\varPhi }_{XX}(t,u), \end{aligned}$$

(D.1)

and, for $A=Y, Z$ and $B=X, Y, Z$,

$$\begin{aligned}&\dfrac{\partial }{\partial t}\tilde{\varPhi }_{AB}(t,u)\nonumber \\&\quad = \partial \tilde{F}_{AX}(\tilde{\varvec{w}}(t))\tilde{\varPhi }_{XB}(t,u)+ \partial \tilde{F}_{AY}(\tilde{\varvec{w}}(t))\tilde{\varPhi }_{YB}(t,u)+ \partial \tilde{F}_{AZ}(\tilde{\varvec{w}}(t))\tilde{\varPhi }_{ZB}(t,u), \end{aligned}$$

(D.2)

where $\tilde{\varPhi }_{XY}(t,u)=0$ and $\tilde{\varPhi }_{XZ}(t,u)=\varvec{0}^\top $.

It follows from (4.2) that

$$\begin{aligned} \left( \partial \tilde{F}_{XX}(\tilde{\varvec{w}}(t))\right) _{ij}=-\beta i \delta _{i,j}+\omega \left[ -i \delta _{i,j}+(i+1)\delta _{i+1,j}\right] . \end{aligned}$$

Thus, letting $\tilde{\phi }_{ij}(t,u)$ denote the (i, j)th element of $\tilde{\varPhi }_{XX}(t,u)$, it follows from (D.1) that, for $t \ge u$,

$$\begin{aligned} \dfrac{\partial }{\partial t}\tilde{\phi }_{ij}=-(\beta +\omega )i \tilde{\phi }_{ij}+\omega (i+1)\tilde{\phi }_{i+1,j}\qquad (i=0,1,\ldots ), \end{aligned}$$

(D.3)

with the initial condition $\tilde{\phi }_{ij}(u,u)=\delta _{i,j}$. For fixed j, apart from the initial condition, $\tilde{\phi }_{ij}(t,u)$$(i=0,1,\ldots )$ satisfies the same system of ODEs as that given at (4.3) for $\tilde{x}_i$$(i=0,1,\ldots )$, and it follows from (5.1) that, for $t \ge u$,

$$\begin{aligned} \tilde{\phi }_{ij}(t,u)={\left\{ \begin{array}{ll} \left( {\begin{array}{c}j\\ i\end{array}}\right) \mathrm{e}^{-(\beta +\omega )i(t-u)}\left( 1-\mathrm{e}^{-(\beta +\omega )(t-u)t}\right) ^{j-i} p_{\omega }^{j-i} &{} \text { for } j \ge i,\\ 0&{} \text { for } j < i, \end{array}\right. } \end{aligned}$$

(D.4)

so

$$\begin{aligned} \left( \varvec{1}\tilde{\varPhi }_{XX}(t,u)\right) _j= & {} \sum _{i=0}^j \left( {\begin{array}{c}j\\ i\end{array}}\right) \mathrm{e}^{-(\beta +\omega )i(t-u)}\left( 1-\mathrm{e}^{-(\beta +\omega )(t-u)t}\right) ^{j-i} p_{\omega }^{j-i} \nonumber \\= & {} \left( p_{\omega }+(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )(t-u)}\right) ^j\nonumber \\= & {} \psi (t-u)^j \qquad (j=0,1,\ldots ), \end{aligned}$$

(D.5)

where $\psi (t)$ is defined at (5.5).

From (4.2), the coefficient of $\varvec{e}^\mathrm{I}_i$ in $\tilde{F}(\varvec{x},\varvec{y},z_E)$ is

$$\begin{aligned} (\beta +\omega )[-i y_i+(i+1) y_{i+1}]\left( 1+\frac{\eta _E}{y_E}\right) +\beta (i+1)x_{i+1}-\gamma y_i\frac{\eta _E}{y_E}, \end{aligned}$$

so

$$\begin{aligned} \left( \partial \tilde{F}_{YX}(\tilde{\varvec{w}}(t))\right) _{ij}= & {} (\beta +\omega )[-i \tilde{y}_i(t)+(i+1) \tilde{y}_{i+1}(t)]\frac{j}{\tilde{y}_E(t)}\\&+\,\beta (i+1)\delta _{i+1,j}-\gamma \frac{j \tilde{y}_i(t)}{\tilde{y}_E(t)}. \end{aligned}$$

Hence

$$\begin{aligned} \sum _{i=1}^{\infty } i \left( \partial \tilde{F}_{YX}(\tilde{\varvec{w}}(t))\right) _{ij}=-(\beta +\omega +\gamma )j+\beta j(j-1) \qquad (j=0,1,\ldots ), \end{aligned}$$

so

$$\begin{aligned} \varvec{p}\, \partial \tilde{F}_{YX}(\tilde{\varvec{w}}(t))= & {} -(\beta +\omega +\gamma )\varvec{p}+\beta \varvec{p}_{[2]}, \end{aligned}$$

(D.6)

where $\varvec{p}_{[2]}=(p_{[2],0}, p_{[2],1}, \ldots )$ with $p_{[2],i}=i(i-1)$$(i=0,1,\ldots )$. Similar calculations show that

$$\begin{aligned} \varvec{p}\, \partial \tilde{F}_{YY}(\tilde{\varvec{w}}(t))= & {} -[2(\beta +\omega )+\gamma ]\varvec{p}, \end{aligned}$$

(D.7)

$$\begin{aligned} \varvec{p}\, \partial \tilde{F}_{YZ}(\tilde{\varvec{w}}(t))= & {} -(\beta +\omega +\gamma ),\end{aligned}$$

(D.8)

$$\begin{aligned} \partial \tilde{F}_{ZX}(\tilde{\varvec{w}}(t))= & {} \gamma \varvec{p},\end{aligned}$$

(D.9)

$$\begin{aligned} \partial \tilde{F}_{ZY}(\tilde{\varvec{w}}(t))= & {} \gamma \varvec{p},\end{aligned}$$

(D.10)

$$\begin{aligned} \partial \tilde{F}_{ZZ}(\tilde{\varvec{w}}(t))= & {} \gamma -\beta -\omega . \end{aligned}$$

(D.11)

Setting $A=Y$ in (D.2) and using (D.6)–(D.8) yields, for $B=X,Y,Z$,

$$\begin{aligned} \dfrac{\partial }{\partial t}\varvec{p}\,\tilde{\varPhi }_{YB}(t,u)=&-(\beta +\omega +\gamma )\varvec{p}\, \tilde{\varPhi }_{XB}(t,u) +\beta \varvec{p}_{[2]}\, \tilde{\varPhi }_{XB}(t,u)\nonumber \\&-[2(\beta +\omega )+\gamma ]\varvec{p}\, \tilde{\varPhi }_{YB}(t,u)-(\beta +\omega +\gamma ) \tilde{\varPhi }_{ZB}(t,u). \end{aligned}$$

(D.12)

Setting $A=Z$ in (D.2) and using (D.9)–(D.11) yields, for $B=X,Y,Z$,

$$\begin{aligned} \dfrac{\partial }{\partial t} \tilde{\varPhi }_{ZB}(t,u)= \gamma \varvec{p}\, \tilde{\varPhi }_{XB}(t,u) +\gamma \varvec{p}\, \tilde{\varPhi }_{YB}(t,u)+(\gamma -\beta -\omega ) \tilde{\varPhi }_{ZB}(t,u). \end{aligned}$$

(D.13)

Setting $B=Z$ in (D.12) and (D.13), and recalling that $\tilde{\varPhi }_{XY}(t,u)$ and $\tilde{\varPhi }_{XZ}(t,u)$ are both identically zero, yields

$$\begin{aligned} \dfrac{\partial }{\partial t}\varvec{p}\,\tilde{\varPhi }_{YZ}(t,u)= & {} -[2(\beta +\omega )+\gamma ]\varvec{p}\, \tilde{\varPhi }_{YZ}(t,u) -(\beta +\omega +\gamma ) \tilde{\varPhi }_{ZZ}(t,u),\\ \dfrac{\partial }{\partial t} \tilde{\varPhi }_{ZZ}(t,u)= & {} \gamma \varvec{p}\, \tilde{\varPhi }_{YZ}(t,u)+(\gamma -\beta -\omega ) \tilde{\varPhi }_{ZZ}(t,u), \end{aligned}$$

with initial condition

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YZ}(u,u)=0 \qquad \text{ and } \qquad \tilde{\varPhi }_{ZZ}(u,u)=1. \end{aligned}$$

This linear system of two ODEs has solution, for $t \ge u$,

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YZ}(t,u)= & {} -\frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(t-u)}\left( 1-\mathrm{e}^{-(\beta +\omega )(t-u)}\right) , \nonumber \\ \tilde{\varPhi }_{ZZ}(t,u)= & {} \frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(t-u)}-\frac{\gamma }{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )(t-u)}. \end{aligned}$$

(D.14)

Similarly, setting $B=Y$ in (D.12) and (D.13) yields

$$\begin{aligned} \dfrac{\partial }{\partial t}\varvec{p}\,\tilde{\varPhi }_{YY}(t,u)= & {} -[2(\beta +\omega )+\gamma ]\varvec{p}\, \tilde{\varPhi }_{YY}(t,u) -(\beta +\omega +\gamma ) \tilde{\varPhi }_{ZY}(t,u),\\ \dfrac{\partial }{\partial t} \tilde{\varPhi }_{ZY}(t,u)= & {} \gamma \varvec{p}\, \tilde{\varPhi }_{YY}(t,u)+(\gamma -\beta -\omega ) \tilde{\varPhi }_{ZY}(t,u), \end{aligned}$$

with initial condition

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YY}(u,u)=\varvec{p}\qquad \text{ and } \qquad \tilde{\varPhi }_{ZY}(u,u)=\varvec{0}\end{aligned}$$

and solution, for $t \ge u$,

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YY}(t,u)= & {} \left( \frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )(t-u)}- \frac{\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(t-u)}\right) \varvec{p},\nonumber \\ \tilde{\varPhi }_{ZY}(t,u)= & {} \frac{\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )(t-u)}\left( 1-\mathrm{e}^{-(\beta +\omega )(t-u)}\right) \varvec{p}. \end{aligned}$$

(D.15)

Setting $B=X$ in (D.12) and (D.13) yields

$$\begin{aligned} \dfrac{\partial }{\partial t}\varvec{p}\,\tilde{\varPhi }_{YX}(t,u)&=-(\beta +\omega +\gamma )\varvec{p}\, \tilde{\varPhi }_{XX}(t,u)+\beta \varvec{p}_{[2]}\, \tilde{\varPhi }_{XX}(t,u) \nonumber \\&\qquad -[2(\beta +\omega )+\gamma ]\varvec{p}\, \tilde{\varPhi }_{YX}(t,u) -(\beta +\omega +\gamma ) \tilde{\varPhi }_{ZX}(t,u), \end{aligned}$$

(D.16)

$$\begin{aligned} \dfrac{\partial }{\partial t} \tilde{\varPhi }_{ZX}(t,u)&=\gamma \varvec{p}\, \tilde{\varPhi }_{XX}(t,u)+\gamma \varvec{p}\, \tilde{\varPhi }_{YX}(t,u)+(\gamma -\beta -\omega ) \tilde{\varPhi }_{ZX}(t,u), \end{aligned}$$

(D.17)

with initial condition

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YX}(u,u)=\varvec{0}\qquad \text{ and } \qquad \tilde{\varPhi }_{ZX}(u,u)=\varvec{0}. \end{aligned}$$

(D.18)

Further, using (D.4), for $j=0,1,\ldots $,

$$\begin{aligned} \left( \varvec{p}\tilde{\varPhi }_{XX}(t,u)\right) _j= & {} j \mathrm{e}^{-(\beta +\omega )(t-u)}\psi (t-u)^{j-1}, \end{aligned}$$

(D.19)

$$\begin{aligned} \left( \varvec{p}_{[2]}\tilde{\varPhi }_{XX}(t,u)\right) _j= & {} j(j-1) \mathrm{e}^{-2(\beta +\omega )(t-u)}\psi (t-u)^{j-2}. \end{aligned}$$

(D.20)

Note that (D.16)–(D.20) imply that, for $0 \le u \le t$,

$$\begin{aligned} \varvec{p}\,\tilde{\varPhi }_{YX}(t,u)=\varvec{p}\,\tilde{\varPhi }_{YX}(t-u,0) \qquad \text{ and } \qquad \tilde{\varPhi }_{ZX}(t,u)=\tilde{\varPhi }_{ZX}(t-u,0), \end{aligned}$$

(D.21)

so we consider the case when $u=0$.

Let

$$\begin{aligned} D=\begin{bmatrix} -2(\beta +\omega )-\gamma&\quad -(\beta +\omega +\gamma ) \\ \gamma&\quad \gamma -\beta -\omega \end{bmatrix}. \end{aligned}$$

Then,

$$\begin{aligned}&\begin{pmatrix} \varvec{p}\, \tilde{\varPhi }_{YX}(t,0)\\ \tilde{\varPhi }_{ZX}(t,0) \end{pmatrix}\nonumber \\&\quad = \int _0^t \mathrm{e}^{-D(t-s)} \begin{pmatrix} -(\beta +\omega +\gamma )\varvec{p}\, \tilde{\varPhi }_{XX}(s,0)+\beta \varvec{p}_{[2]}\, \tilde{\varPhi }_{XX}(s,0)\\ \gamma \varvec{p}\, \tilde{\varPhi }_{XX}(s,0) \end{pmatrix} \,\mathrm{d}s, \end{aligned}$$

(D.22)

with

$$\begin{aligned} \mathrm{e}^{-Dt}&=\frac{1}{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )t} \begin{bmatrix} \beta +\omega +\gamma&\quad \beta +\omega +\gamma \\ -\gamma&\quad -\gamma \end{bmatrix}\nonumber \\&\phantom {=\ }+\frac{1}{\beta +\omega }\mathrm{e}^{-(\beta +\omega )t} \begin{bmatrix} -\gamma&\quad -(\beta +\omega +\gamma )\\ \gamma&\quad \beta +\omega +\gamma \end{bmatrix}. \end{aligned}$$

(D.23)

Substituting (D.23) into (D.22) yields, after using (D.19) and (D.20), that, for $j=0,1,\ldots $,

$$\begin{aligned} \left( \varvec{p}\, \tilde{\varPhi }_{YX}(t,0)\right) _j= I_j^{(1)}(t)+I_j^{(2)}(t)+I_j^{(3)}(t), \end{aligned}$$

(D.24)

where

$$\begin{aligned} I_j^{(1)}(t)= & {} -(\beta +\omega +\gamma )\mathrm{e}^{-2(\beta +\omega )t}\int _0^t j\mathrm{e}^{(\beta +\omega )s}\psi (s)^{j-1}\,\mathrm{d}s,\\ I_j^{(2)}(t)= & {} \frac{\beta (\beta +\omega +\gamma )}{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )t}\int _0^t j(j-1)\psi (s)^{j-2}\,\mathrm{d}s,\\ I_j^{(3)}(t)= & {} -\frac{\beta \gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )t}\int _0^t j(j-1)\mathrm{e}^{-(\beta +\omega )s}\psi (s)^{j-2}\,\mathrm{d}s \end{aligned}$$

and, recalling (5.5), $\psi (s)= p_{\omega }+(1-p_{\omega })\mathrm{e}^{-(\beta +\omega )s}$. Integrating by parts,

$$\begin{aligned} \int _0^t&j(j-1)\psi (s)^{j-2}\,\mathrm{d}s\\&=\left[ -\frac{1}{\beta }\mathrm{e}^{(\beta +\omega )s}j\psi (s)^{j-1}\right] _0^t+ \int _0^t \frac{\beta +\omega }{\beta } j \mathrm{e}^{(\beta +\omega )s}\psi (s)^{j-1}\,\mathrm{d}s, \end{aligned}$$

so

$$\begin{aligned} I_j^{(2)}(t)=\frac{\beta +\omega +\gamma }{\beta +\omega }\mathrm{e}^{-2(\beta +\omega )t}j\left[ 1-\mathrm{e}^{(\beta +\omega )t}\psi (t)^{j-1}\right] -I_j^{(1)}(t). \end{aligned}$$

Also,

$$\begin{aligned} I_j^{(3)}(t)= & {} -\frac{\beta \gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )t}j \left[ -\frac{1}{(\beta +\omega )(1-p_{\omega })}\psi (s)^{j-1}\right] _0^t\\= & {} \frac{\gamma }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )t}j\left[ \psi (t)^{j-1}-1\right] . \end{aligned}$$

It then follows using (D.24) and (D.21) that, for $j=0,1,\ldots $,

$$\begin{aligned} \left( \varvec{p}\,\tilde{\varPhi }_{YX}(t,u)\right) _j&=I_j^{(1)}(t-u)+I_j^{(2)}(t-u)+I_j^{(3)}(t-u) \end{aligned}$$

(D.25)

$$\begin{aligned}&=\mathrm{e}^{-(\beta +\omega )(t-u)}\frac{\left( (\beta +\omega +\gamma ) \mathrm{e}^{-(\beta +\omega )(t-u)}-\gamma \right) }{\beta +\omega }j\nonumber \\&\quad -\,\mathrm{e}^{-(\beta +\omega )(t-u)}\psi (t-u)^{j-1}j. \end{aligned}$$

(D.26)

E Calculation of $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$

Recall (6.10) for $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$, where $\sigma ^2_1,\sigma ^2_2,\ldots ,\sigma ^2_5$ are given by (6.11). We first obtain closed-form expressions for the integrands in the definitions of $\sigma ^2_1,\sigma ^2_2,\ldots ,\sigma ^2_5$, then evaluate the integrals as a function of $\tilde{\tau }$ and finally show that the expression for $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$ reduces to that given in Proposition 6.1.

1.1 E.1 Integrands

We determine the integrands for $\sigma ^2_1,\sigma ^2_2,\ldots ,\sigma ^2_5$ in reverse order.

1.1.1 E.1.1 Integral for $\sigma ^2_5$

For $i=0,1,\ldots $, it follows from (3.5), (6.17) and (6.19) that

$$\begin{aligned} \varvec{c}(\tilde{\tau },u) (\varvec{l}_i^{(5)})^{\top }=i[h_R(\tilde{\tau },u)-h_I(\tilde{\tau },u)]=-ib(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}, \end{aligned}$$

so, using (4.1) and recalling (5.9) for $\tilde{\eta }_E(t)$,

$$\begin{aligned} \sum _{i=0}^{\infty } \left( \varvec{c}(\tilde{\tau },u) (\varvec{l}_i^{(5)}){^{\top }}\right) ^2 \tilde{\beta }_{\varvec{l}}^{(5)}(\tilde{\varvec{w}}(u))= & {} \sum _{i=0}^{\infty } i^{2} b(\tilde{\tau })^2 \mathrm{e}^{-2(\beta +\omega )(\tilde{\tau }-u)}\gamma \tilde{y}_i(u)\frac{\tilde{\eta }_E(u)}{\tilde{y}_E(u)}\\= & {} \gamma \mu _Db(\tilde{\tau })^2\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\frac{\tilde{y}_E^{(2)}(u)}{\tilde{y}_E(u)}, \end{aligned}$$

where $\tilde{y}_E^{(2)}(u)=\sum _{i=1}^{\infty } i^2 \tilde{y}_i(u)$. Thus, using (6.11),

$$\begin{aligned} \sigma ^2_5=\gamma \mu _Db(\tilde{\tau })^2\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }} \int _0^{\tilde{\tau }} \frac{\tilde{y}_E^{(2)}(u)}{\tilde{y}_E(u)}\,\mathrm{d}u. \end{aligned}$$

(E.1)

1.1.2 E.1.2 Integral for $\sigma ^2_4$

For $i=1,2,\ldots $, it follows from (3.4), (6.17) and (6.19) that

$$\begin{aligned} \varvec{c}(\tilde{\tau },u) (\varvec{l}_i^{(4)})^{\top }=-h_I(\tilde{\tau },u)-h_R(\tilde{\tau },u)=b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-2h_I(\tilde{\tau },u), \end{aligned}$$

so, using (4.1),

$$\begin{aligned}&\sum _{i=1}^{\infty } {\left( \varvec{c}(\tilde{\tau },u) ({\varvec{l}_{i^{(4)}})^{\top }}\right) }^{2} \tilde{\beta }_{\varvec{l}}^{(4)}(\tilde{\varvec{w}}(u))\\&\quad =\sum _{i=1}^{\infty } \left( b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-2h_I(\tilde{\tau },u)\right) ^2(\beta +\omega )i\tilde{y}_i(u)\frac{\tilde{z}_E(u)}{\tilde{y}_E(u)}\\&\quad = (\beta +\omega )\left( b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-2h_I(\tilde{\tau },u)\right) ^2 \tilde{z}_E(u). \end{aligned}$$

Thus, using (6.11),

$$\begin{aligned} \sigma ^2_4=\int _0^{\tilde{\tau }} (\beta +\omega )\left( b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-2h_I(\tilde{\tau },u)\right) ^2 \tilde{z}_E(u) \,\mathrm{d}u. \end{aligned}$$

(E.2)

1.1.3 E.1.3 Integral for $\sigma ^2_3$

For $i,j=1,2,\ldots $, it follows from (3.3), (6.17) and (6.20) that

$$\begin{aligned} \varvec{c}(\tilde{\tau },u) (\varvec{l}_{ij}^{(3)})^{\top }=-[2h_I(\tilde{\tau },u)+\hat{c}_j(\tilde{\tau },u)], \end{aligned}$$

where $\hat{c}_j(\tilde{\tau },u)=\tilde{c}_j(\tilde{\tau },u)-\tilde{c}_{j-1}(\tilde{\tau },u)$. Hence, using (4.1),

$$\begin{aligned} \sum _{i=1}^{\infty }\sum _{j=1}^{\infty } {\left( \varvec{c}(\tilde{\tau },u) ({\varvec{l}{_{ij}^{(3)}})^{\top }}\right) }^{2} \tilde{\beta }_{\varvec{l}}^{(3)}(\tilde{\varvec{w}}(u)) = \omega \sum _{j=1}^{\infty } \left( 2h_I(\tilde{\tau },u)+\hat{c}_j(\tilde{\tau },u)\right) ^2 j \tilde{x}_j(u) , \end{aligned}$$

and, using (6.11),

$$\begin{aligned} \sigma ^2_3=\omega \int _0^{\tilde{\tau }} \sum _{j=1}^{\infty } \left( 2h_I(\tilde{\tau },u)+\hat{c}_j(\tilde{\tau },u)\right) ^2 j \tilde{x}_j(u)\,\mathrm{d}u. \end{aligned}$$

(E.3)

1.1.4 E.1.4 Integral for $\sigma ^2_2$

For $i,j=1,2,\ldots $, it follows from (3.2) and (6.17) that

$$\begin{aligned} \varvec{c}(\tilde{\tau },u) (\varvec{l}_{ij}^{(2)})^{\top }=-2h_I(\tilde{\tau },u), \end{aligned}$$

so, using (4.1),

$$\begin{aligned} \sum _{i=1}^{\infty }\sum _{j=1}^{\infty } \left( \varvec{c}(\tilde{\tau },u) ({\varvec{l}{_{ij}^{(2)}})^{\top }}\right) ^{2} \tilde{\beta }_{\varvec{l}}^{(2)}(\tilde{\varvec{w}}(u)) =4 h_I(\tilde{\tau },u)^2 (\beta +\omega ) \tilde{y}_E(u), \end{aligned}$$

and, using (6.11),

$$\begin{aligned} \sigma ^2_2=4(\beta +\omega ) \int _0^{\tilde{\tau }} h_I(\tilde{\tau },u)^2 \tilde{y}_E(u) \,\mathrm{d}u. \end{aligned}$$

(E.4)

1.1.5 E.1.5 Integral for $\sigma ^2_1$

For $i,j=1,2,\ldots $, it follows from (3.1), (6.17) and (6.20) that

$$\begin{aligned} \varvec{c}(\tilde{\tau },u) (\varvec{l}_{ij}^{(1)})^{\top }=-[2h_I(\tilde{\tau },u)+\tilde{c}_j(\tilde{\tau },u)], \end{aligned}$$

so, using (4.1),

$$\begin{aligned} \sum _{i=1}^{\infty }\sum _{j=1}^{\infty } \left( \varvec{c}(\tilde{\tau },u) ({\varvec{l}{_{ij}^{(1)}})^{\top }}\right) ^{2} \tilde{\beta }_{\varvec{l}}^{(1)}(\tilde{\varvec{w}}(u)) = \beta \sum _{j=1}^{\infty } \left( 2h_I(\tilde{\tau },u)+\tilde{c}_j(\tilde{\tau },u)\right) ^2 j \tilde{x}_j(u) , \end{aligned}$$

and, using (6.11),

$$\begin{aligned} \sigma ^2_1=\beta \int _0^{\tilde{\tau }} \sum _{j=1}^{\infty } \left( 2h_I(\tilde{\tau },u)+\tilde{c}_j(\tilde{\tau },u)\right) ^2 j \tilde{x}_j(u)\,\mathrm{d}u. \end{aligned}$$

(E.5)

1.2 E.2 Evaluation of integrals

Recall that $\tilde{\eta }_E(u)=\tilde{x}_E(u)+\tilde{y}_E(u)+\tilde{z}_E(u)$. Then adding (E.2)–(E.5) gives,

$$\begin{aligned} \sum _{i=1}^4 \sigma _i^2=\sum _{i=1}^7 I_i, \end{aligned}$$

(E.6)

where

$$\begin{aligned} I_1= & {} 4(\beta +\omega )\int _0^{\tilde{\tau }} h_I(\tilde{\tau },u)^2 \tilde{\eta }_E(u) \,\mathrm{d}u, \end{aligned}$$

(E.7)

$$\begin{aligned} I_2= & {} -4(\beta +\omega )b(\tilde{\tau })\int _0^{\tilde{\tau }} h_I(\tilde{\tau },u) \mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)} \tilde{z}_E(u) \,\mathrm{d}u,\end{aligned}$$

(E.8)

$$\begin{aligned} I_3= & {} (\beta +\omega )b(\tilde{\tau })^2\int _0^{\tilde{\tau }} \mathrm{e}^{-2(\beta +\omega )(\tilde{\tau }-u)} \tilde{z}_E(u) \,\mathrm{d}u,\end{aligned}$$

(E.9)

$$\begin{aligned} I_4= & {} 4 \omega \int _0^{\tilde{\tau }} h_I(\tilde{\tau },u)\sum _{j=1}^{\infty } \hat{c}_j(\tilde{\tau },u)j \tilde{x}_j(u) \,\mathrm{d}u,\end{aligned}$$

(E.10)

$$\begin{aligned} I_5= & {} \omega \int _0^{\tilde{\tau }} \sum _{j=1}^{\infty } \hat{c}_j(\tilde{\tau },u)^2j \tilde{x}_j(u) \,\mathrm{d}u,\end{aligned}$$

(E.11)

$$\begin{aligned} I_6= & {} 4 \beta \int _0^{\tilde{\tau }} h_I(\tilde{\tau },u)\sum _{j=1}^{\infty } \tilde{c}_j(\tilde{\tau },u)j \tilde{x}_j(u) \,\mathrm{d}u,\end{aligned}$$

(E.12)

$$\begin{aligned} I_7= & {} \beta \int _0^{\tilde{\tau }} \sum _{j=1}^{\infty } \tilde{c}_j(\tilde{\tau },u)^2j \tilde{x}_j(u) \,\mathrm{d}u. \end{aligned}$$

(E.13)

Recalling (5.9), (5.10) and (6.18) allows us to evaluate immediately $I_1, I_2$ and $I_3$:

$$\begin{aligned} I_1=&\frac{4\mu _Db(\tilde{\tau })^2 \mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}}{\beta +\omega }\left[ \gamma ^2 \tilde{\tau }-\frac{2\gamma (\beta +\omega +\gamma )}{\beta +\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\right) \right. \nonumber \\&\qquad \qquad \qquad \qquad \qquad \qquad \left. +\frac{(\beta +\omega +\gamma )^2}{2(\beta +\omega )}\left( 1-\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\right) \right] , \end{aligned}$$

(E.14)

$$\begin{aligned} I_2=&-4\frac{\gamma \mu _Db(\tilde{\tau })^2\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}}{\beta +\omega } \left[ \gamma \tilde{\tau }\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\right. \nonumber \\&\qquad \qquad \qquad \qquad \qquad \qquad \quad \left. -\frac{(\beta +\omega +\gamma )\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}+\gamma }{\beta +\omega } \left( 1-\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\right) \right. \nonumber \\&\qquad \qquad \qquad \qquad \qquad \qquad \quad \left. +\frac{\beta +\omega +\gamma }{2(\beta +\omega )}\left( 1-\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\right) \right] ,\end{aligned}$$

(E.15)

$$\begin{aligned} I_3=&\frac{\gamma \mu _Db(\tilde{\tau })^2\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}}{\beta +\omega } \left\{ 1-\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\left[ 1+(\beta +\omega )\tilde{\tau }\right] \right\} . \end{aligned}$$

(E.16)

For $j,k=0,1,\ldots $, let $j_{[k]}=j(j-1)\ldots (j-k+1)$ denote a falling factorial, with the convention that $j_{[0]}=1$. To calculate $I_4, I_5, I_6$ and $I_7$, observe first using (5.2) that, for $\theta \in [0,1]$ and $k=1,2,\ldots $,

$$\begin{aligned} \sum _{j=1}^{\infty } j_{[k]} \tilde{x}_j(u) \theta ^{j-k}= & {} \sum _{j=k}^{\infty } \frac{j!}{(j-k)!}\theta ^{j-k}\frac{ \mathrm{e}^{-(\beta +\omega )ju}}{j!}f_{D_{\varepsilon }}^{(j)}\left( p_{\omega }\left[ 1-\mathrm{e}^{-(\beta +\omega )u}\right] \right) \nonumber \\= & {} \mathrm{e}^{-(\beta +\omega )ku}\sum _{j=k}^{\infty }\frac{\left[ \theta \mathrm{e}^{-(\beta +\omega )u}\right] ^{j-k}}{(j-k)!}f_{D_{\varepsilon }}^{(j)}\left( p_{\omega }\left[ 1-\mathrm{e}^{-(\beta +\omega )u}\right] \right) \nonumber \\= & {} \mathrm{e}^{-k(\beta +\omega )u}f_{D_{\varepsilon }}^{(k)}\left( \theta \mathrm{e}^{-(\beta +\omega )u}+p_{\omega }\left[ 1-\mathrm{e}^{-(\beta +\omega )u}\right] \right) , \end{aligned}$$

(E.17)

and that

$$\begin{aligned} \mathrm{e}^{-(\beta +\omega )u}\psi (\tilde{\tau }-u)+p_{\omega }\left[ 1-\mathrm{e}^{-(\beta +\omega )u}\right] =\psi (\tilde{\tau }). \end{aligned}$$

Thus, using (E.17) with $\theta =\psi (\tilde{\tau }-u)$ and $k=1,2$,

and

$$\begin{aligned}&\sum _{j=1}^{\infty } \tilde{c}_{j-1}(\tilde{\tau },u)j \tilde{x}_j(u)\\&\quad =\sum _{j=1}^{\infty } j \tilde{x}_j(u)\psi (\tilde{\tau }-u)^{j-1}-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\sum _{j=2}^{\infty }j(j-1)\tilde{x}_j(u)\psi (\tilde{\tau }-u)^{j-2}\\&\quad =\mathrm{e}^{-(\beta +\omega )u}\left[ f_{D_{\varepsilon }}'(\psi (\tilde{\tau }))-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}f_{D_{\varepsilon }}^{(2)}(\psi (\tilde{\tau }))\right] . \end{aligned}$$

Hence, recalling (6.18),

$$\begin{aligned} I_4+I_6=&4\left\{ \beta \left[ f_{D_{\varepsilon }}'(\psi (\tilde{\tau }))-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}f_{D_{\varepsilon }}^{(2)}(\psi (\tilde{\tau }))\right] \mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\right. \nonumber \\&\qquad \qquad \left. -4 (\beta +\omega ) b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}f_{D_{\varepsilon }}'(\psi (\tilde{\tau }))\right\} I_8, \end{aligned}$$

(E.18)

where

$$\begin{aligned} I_8&=\int _0^{\tilde{\tau }} h_I(\tilde{\tau },u) \,\mathrm{d}u \nonumber \\&= -\frac{b(\tilde{\tau })}{\beta +\omega }\left[ \frac{\gamma \left( 1-\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}\right) }{\beta +\omega }-\frac{(\beta +\omega +\gamma )\left( 1-\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\right) }{2(\beta +\omega )}\right] . \end{aligned}$$

(E.19)

Turning to $I_5$ and $I_7$, note that

$$\begin{aligned} \tilde{c}_j(\tilde{\tau },u)=\psi (\tilde{\tau }-u)^{j-1}\left( \psi (\tilde{\tau }-u)-b(\tilde{\tau })j\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right) , \end{aligned}$$

(E.20)

so

$$\begin{aligned} \sum _{j=1}^{\infty } \tilde{c}_j(\tilde{\tau },u)^2j \tilde{x}_j(u)&=\psi (\tilde{\tau }-u)^2 S_1(\tilde{\tau },u) -2b(\tilde{\tau })\psi (\tilde{\tau }-u)\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}S_2(\tilde{\tau },u)\\&\quad +\,b(\tilde{\tau })^2 \mathrm{e}^{-2(\beta +\omega )(\tilde{\tau }-u)}S_3(\tilde{\tau },u), \end{aligned}$$

where

$$\begin{aligned} S_k(\tilde{\tau },u)=\sum _{j=1}^{\infty }\psi (\tilde{\tau }-u)^{2(j-1)} j^k \tilde{x}_j(u) \qquad (k=1,2,3). \end{aligned}$$

Let

$$\begin{aligned} \psi _2(\tilde{\tau },u)=\mathrm{e}^{-(\beta +\omega )u}\psi (\tilde{\tau }-u)^2 +p_{\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )u}\right) . \end{aligned}$$

(E.21)

Then, since $j^2=j_{[2]}+j$ and $j^3=j_{[3]}+3j_{[2]}+j$, it follows using (E.17) that

$$\begin{aligned} S_1(\tilde{\tau },u)&= \mathrm{e}^{-(\beta +\omega )u}f_{D_{\varepsilon }}'(\psi _2(\tilde{\tau },u)),\\ S_2(\tilde{\tau },u)&= \psi (\tilde{\tau }-u)^2 \mathrm{e}^{-2(\beta +\omega )u}f_{D_{\varepsilon }}^{(2)}(\psi _2(\tilde{\tau },u))+ \mathrm{e}^{-(\beta +\omega )u}f_{D_{\varepsilon }}'(\psi _2(\tilde{\tau },u)),\\ S_3(\tilde{\tau },u)&= \psi (\tilde{\tau }-u)^4 \mathrm{e}^{-3(\beta +\omega )u}f_{D_{\varepsilon }}^{(3)}(\psi _2(\tilde{\tau },u))\\&\quad +\,3\psi (\tilde{\tau }-u)^2 \mathrm{e}^{-2(\beta +\omega )u}f_{D_{\varepsilon }}^{(2)}(\psi _2(\tilde{\tau },u))+\mathrm{e}^{-(\beta +\omega )u}f_{D_{\varepsilon }}'(\psi _2(\tilde{\tau },u)), \end{aligned}$$

whence

$$\begin{aligned}&\sum _{j=1}^{\infty } \tilde{c}_j(\tilde{\tau },u)^2j \tilde{x}_j(u)\nonumber \\&\quad =\left[ \psi (\tilde{\tau }-u)-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right] ^2 \mathrm{e}^{-(\beta +\omega )u} f_{D_{\varepsilon }}'(\psi _2(\tilde{\tau },u))\nonumber \\&\qquad +b(\tilde{\tau }) \psi (\tilde{\tau }-u)^2 \mathrm{e}^{-(\beta +\omega )(\tilde{\tau }+u)}\left[ 3b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}-2\psi (\tilde{\tau }-u)\right] f_{D_{\varepsilon }}^{(2)}(\psi _2(\tilde{\tau },u))\nonumber \\&\qquad +\,b(\tilde{\tau })^2 \psi (\tilde{\tau }-u)^4 \mathrm{e}^{-(\beta +\omega )(2\tilde{\tau }+u)} f_{D_{\varepsilon }}^{(3)}(\psi _2(\tilde{\tau },u)). \end{aligned}$$

(E.22)

Further, (E.20) implies

$$\begin{aligned} \hat{c}_j(\tilde{\tau },u)&=\psi (\tilde{\tau }-u)^{j-2}\left\{ \psi (\tilde{\tau }-u)\left[ \psi (\tilde{\tau }-u)-1-b(\tilde{\tau })\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right] \right. \\&\left. \qquad \qquad \qquad \qquad \quad -(j-1)b(\tilde{\tau })(\psi (\tilde{\tau }-u)-1)\mathrm{e}^{-(\beta +\omega )(\tilde{\tau }-u)}\right\} , \end{aligned}$$

so

(E.23)

To calculate the integral in (E.1) for $\sigma ^2_5$, let

$$\begin{aligned} \tilde{y}_E^{[2]}(t)=\sum _{i=2}^{\infty } i(i-1)\tilde{y}_i(t) \end{aligned}$$

(E.24)

and note that

$$\begin{aligned} \sum _{i=2}^{\infty } \left[ (i+1)i(i-1)\tilde{y}_{i+1}(t)-i^2(i-1)\tilde{y}_i(t)\right] =-2\tilde{y}_E^{[2]}(t). \end{aligned}$$

Multiplying (4.4) by $i(i-1)$ and summing over $i=2,3,\ldots $ yields, after recalling (5.9) and invoking (E.17) with $\theta =1$ and $k=3$, that

$$\begin{aligned} \dfrac{d\tilde{y}_E^{[2]}}{dt}+2(\beta +\omega )\tilde{y}_E^{[2]}=-\mu _D\mathrm{e}^{-2(\beta +\omega )t}[2(\beta +\omega )+\gamma ] \frac{\tilde{y}_E^{[2]}}{\tilde{y}_E}+\beta \mathrm{e}^{-3(\beta +\omega )t}f_{D_{\varepsilon }}^{(3)}\left( \psi (t)\right) , \end{aligned}$$

so

$$\begin{aligned} \dfrac{d}{dt}\left( \mathrm{e}^{2(\beta +\omega )t}\tilde{y}_E^{[2]}(t)\right)= & {} -\mu _D[2(\beta +\omega )+\gamma ]\frac{\tilde{y}_E^{[2]}(t)}{\tilde{y}_E(t)}+\beta \mathrm{e}^{-(\beta +\omega )t} f_{D_{\varepsilon }}^{(3)}\left( \psi (t)\right) \\= & {} -\mu _D[2(\beta +\omega )+\gamma ]\frac{\tilde{y}_E^{[2]}(t)}{\tilde{y}_E(t)} -\dfrac{d}{dt}\left[ f_{D_{\varepsilon }}^{(2)}\left( \psi (t)\right) \right] , \end{aligned}$$

since $p_{\omega }=\frac{\omega }{\beta +\omega }$. Thus,

$$\begin{aligned} \int _0^{\tilde{\tau }} \frac{\tilde{y}_E^{[2]}(u)}{\tilde{y}_E(u)}\,\mathrm{d}u= & {} \left[ -\frac{1}{\mu _D[2(\beta +\omega )+\gamma ]}\left( \mathrm{e}^{2(\beta +\omega )u}\tilde{y}_E^{[2]}(u)+ f_{D_{\varepsilon }}^{(2)}\left( \psi (u)\right) \right) \right] _0^{\tilde{\tau }}\\= & {} \frac{1}{\mu _D[2(\beta +\omega )+\gamma ]}\left[ \tilde{y}_E^{[2]}(0)+f_{D_{\varepsilon }}^{(2)}(1)- f_{D_{\varepsilon }}^{(2)}\left( \psi (\tilde{\tau })\right) \right] , \end{aligned}$$

as $\tilde{y}_i(\tilde{\tau })=0$$(i=1,2,\ldots )$. Using (E.24) gives $\tilde{y}_E^{[2]}(0)=\sum _{i=2}^{\infty } i(i-1)\varepsilon _i$ and differentiating (5.3) twice yields $f_{D_{\varepsilon }}^{(2)}(1)=\sum _{i=2}^{\infty }i(i-1)(p_i-\varepsilon _i)$. Thus, $\tilde{y}_E^{[2]}(0)+f_{D_{\varepsilon }}^{(2)}(1)=\sum _{i=2}^{\infty }i(i-1)p_i=f_D''(1)$. Further, $\tilde{y}_E^{(2)}(u)=\tilde{y}_E^{[2]}(u)+\tilde{y}_E(u)$, so $\int _0^{\tilde{\tau }} \frac{\tilde{y}_E^{(2])}(u)}{\tilde{y}_E(u)}\,\mathrm{d}u =\tilde{\tau }+\int _0^{\tilde{\tau }} \frac{\tilde{y}_E^{[2]}(u)}{\tilde{y}_E(u)}\,\mathrm{d}u$ and using (E.1),

$$\begin{aligned} \sigma ^2_5=\gamma \mu _Db(\tilde{\tau })^2\mathrm{e}^{-2(\beta +\omega )\tilde{\tau }}\left\{ \tilde{\tau }+ \frac{1}{\mu _D[2(\beta +\omega )+\gamma ]}\left[ f_D''(1)- f_{D_{\varepsilon }}^{(2)}\left( \psi (\tilde{\tau })\right) \right] \right\} . \end{aligned}$$

(E.25)

1.3 E.3 Expression for $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$

We now use (6.10), (E.6) and (E.25) to obtain an expression for $\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$.

Let $z=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}$. Then, since $\tilde{\tau }$ is the unique solution in $(0,\infty )$ of (5.24), z is the unique solution in [0, 1) of (5.25). Recall that $\widetilde{\psi }(z)=p_{\omega }+(1-p_{\omega })z$. It then follows from (6.13) that

$$\begin{aligned} a(\tilde{\tau })=z^2\left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] , \end{aligned}$$

whence, using $b(\tilde{\tau })=a(\tilde{\tau })^{-1}\beta \tilde{x}_E(\tilde{\tau })$ and (5.4)

$$\begin{aligned} b(\tilde{\tau })= & {} \frac{\beta zf_{D_{\varepsilon }}'\left( \widetilde{\psi }(z)\right) }{z^2 \left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] }\nonumber \\= & {} \frac{\beta \left[ \frac{(\beta +\omega +\gamma )z-\gamma }{\beta +\omega }\right] \mu _D}{z\left[ \beta f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) -(\beta +\omega +\gamma )\mu _D\right] }, \end{aligned}$$

(E.26)

using (5.25), so $b(\tilde{\tau })=\tilde{b}(z)$ defined at (6.3). For future reference, note that (E.26) implies

$$\begin{aligned} b(\tilde{\tau })zf_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right)= & {} f_{D_{\varepsilon }}'\left( \widetilde{\psi }(z)\right) +\frac{(\beta +\omega +\gamma )}{\beta }zb(\tilde{\tau })\mu _D\end{aligned}$$

(E.27)

$$\begin{aligned}= & {} \left[ (\beta +\omega +\gamma )\left( \frac{1}{\beta +\omega } +\frac{b(\tilde{\tau })}{\beta }\right) z-\frac{\gamma }{\beta +\omega }\right] \mu _D. \end{aligned}$$

(E.28)

Adding (E.14) and (E.15) yields, after substituting $z=\mathrm{e}^{-(\beta +\omega )\tilde{\tau }}$,

$$\begin{aligned} I_1+I_2=&2\frac{\mu _Db(\tilde{\tau })^2 z(1-z)}{(\beta +\omega )^2} \Big [\gamma (\gamma -\beta -\omega )\nonumber \\&\qquad \qquad \qquad \qquad \qquad +(\beta +\omega +\gamma )(\beta +\omega -2\gamma )z+(\beta +\omega +\gamma )^2 z^2\Big ]. \end{aligned}$$

(E.29)

Using (E.18) and (E.27),

$$\begin{aligned} I_4+I_6= & {} -4\left[ (\beta +\omega +\gamma )z^2 b(\tilde{\tau })\mu _D-4(\beta +\omega )z b(\tilde{\tau })f_{D_{\varepsilon }}'\left( \widetilde{\psi }(z)\right) \right] I_8\\= & {} -4z b(\tilde{\tau })\mu _D\left[ (2(\beta +\omega +\gamma )z-\gamma \right] I_8, \end{aligned}$$

using (5.25). Substituting for $I_8$ from (E.19) and rearranging then gives

$$\begin{aligned} I_4+I_6=2\frac{b(\tilde{\tau })^2z(1-z)}{(\beta +\omega )^2}\left[ 2(\beta +\omega +\gamma )z-\gamma \right] \left[ \gamma -\beta -\omega -(\beta +\omega +\gamma )z\right] \mu _D. \end{aligned}$$

(E.30)

Adding (E.29) and (E.30) yields after a little algebra that

$$\begin{aligned}&I_1+I_2+I_4+I_6\nonumber \\&\quad =2\frac{(\beta +\omega +\gamma )[\gamma -\beta -\omega -(\beta +\omega +\gamma )z]}{(\beta +\omega )^2}\mu _Db(\tilde{\tau })^2 z^2(1-z). \end{aligned}$$

(E.31)

Adding (E.16) and (E.25) yields

$$\begin{aligned} I_3+\sigma ^2_5=\frac{\gamma }{\beta +\omega }\mu _Db(\tilde{\tau })^2z(1-z)+\frac{\gamma }{2(\beta +\omega )+\gamma }b(\tilde{\tau })^2 z^2\left[ f_D''(1)-f_{D_{\varepsilon }}''\left( \widetilde{\psi }(z)\right) \right] . \end{aligned}$$

Substituting from (E.28) and noting that $f_D''(1)=\sigma _D^2+\mu _D^2-\mu _D$ yields, after a little algebra, that

$$\begin{aligned} I_3+\sigma ^2_5=&\frac{\gamma }{\beta (\beta +\omega )}\mu _Db(\tilde{\tau })^2 z\left[ \beta -(2\beta +\omega )z\right] \nonumber \\&+ \frac{\gamma }{\beta [2(\beta +\omega )+\gamma ]}b(\tilde{\tau })^2 z^2\left[ \beta (\sigma _D^2+\mu _D^2)+\omega \mu _D\right] \nonumber \\&-\frac{\gamma [(\beta +\omega +\gamma )z-\gamma ]z}{[2(\beta +\omega )+\gamma ](\beta +\omega )} \mu _Db(\tilde{\tau }). \end{aligned}$$

(E.32)

Adding (E.31) and (E.32), and comparing with (6.2), shows that

$$\begin{aligned} I_1+I_2+I_3+I_4+I_6+\sigma ^2_5=\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )-I_A-I_B-I_C-I_D, \end{aligned}$$

(E.33)

with $I_A, I_B, I_C$ and $I_D$ given by (6.4)–(6.7). Making the substitution $v=\mathrm{e}^{-(\beta +\omega )u}$ in the integrals in (E.11) and (E.13), using the expressions (E.22) and (E.23) for the respective integrands, shows that

$$\begin{aligned} I_5+I_7=I_A+I_B+I_C+I_D. \end{aligned}$$

(E.34)

The expression (6.2) for $=\sigma ^2_{\mathrm{MR}}(\beta ,\omega ,\gamma )$ then follows using (6.10) and (E.6).

F Proof of Lemma 1

For $k=1,2,\ldots $, let $X_k^{(\gamma ,\omega )}=k-Y_k^{(\gamma ,\omega )}$ and $X_k^{(\gamma +\omega ,0)}=k-Y_k^{(\gamma +\omega ,0)}$. Thus, for example, $X_k^{(\gamma ,\omega )}$ is the number of neighbours an infective, $i^*$ say, with k susceptible neighbours fails to infect in the dropping model. For $k,r \in \mathbb {Z}_+$, let $k_{[r]}=k(k-1)\ldots (k-r+1)$ denote a falling factorial, with the convention that $k_{[0]}=1$. Further let $\mu _{k,[r]}^{(\gamma ,\omega )}=\mathrm{E}\left[ X_{k,[r]}^{(\gamma ,\omega )}\right] $, where $X_{k,[r]}^{(\gamma ,\omega )}=X_k^{(\gamma ,\omega )}(X_k^{(\gamma ,\omega )}-1)\ldots (X_k^{(\gamma ,\omega )}-r+1)$, be the rth factorial moment of $X_k^{(\gamma ,\omega )}$ and define $\mu _{k,[r]}^{(\gamma +\omega ,0)}$ analogously for the modified model. Note that $\mu _{k,[r]}^{(\gamma ,\omega )}= \mu _{k,[r]}^{(\gamma +\omega ,0)}=0$ for all $r>k$. We prove first that

$$\begin{aligned} \mu _{k,[r]}^{(\gamma ,\omega )} \le \mu _{k,[r]}^{(\gamma +\omega ,0)}\quad \text{ for } \text{ all } k,r, \end{aligned}$$

(F.1)

with strict inequality for $2 \le r \le k$, and then consider the Taylor expansions of $f_k^{(\gamma ,\omega )}(s)$ and $f_k^{(\gamma +\omega ,0)}(s)$ about $s=1$ to prove Lemma 1.

To determine the factorial moment $\mu _{k,[r]}^{(\gamma ,\omega )}$, fix $k \ge 1$, give the k neighbours of $i^*$ the labels $1,2,\ldots ,k$ and let $A_k^{(\gamma ,\omega )}$ be the set of neighbours that are not infected by $i^*$. Then, for any $B \subseteq \{1,2,\ldots ,k\}$, $\mathrm{P}\left( A_k^{(\gamma ,\omega )}=B\right) $ depends on B only through its size |B|, so $A_k^{(\gamma ,\omega )}$ is a symmetric sampling procedure (Martin-Löf 1986). It follows from Lemma 1 in that paper that $\mu _{k,[r]}^{(\gamma ,\omega )}= k_{[r]} P_{k,r}^{(\gamma ,\omega )}$$(r=0,1,\ldots ,k)$, where $P_{k,r}^{(\gamma ,\omega )}$ is the probability that no one in any fixed set of r neighbours of $i^*$ is infected by $i^*$, with $P_{k,0}^{(\gamma ,\omega )}=1$. Similarly, in an obvious notation, $\mu _{k,[r]}^{(\gamma +\omega ,0)}= k_{[r]} P_{k,r}^{(\gamma +\omega ,0)}$$(r=0,1,\ldots ,k)$. To prove (F.1), we assume without loss of generality that $\gamma =1$, since otherwise time can be rescaled linearly so that $\gamma =1$. Note that $P_{k,r}^{(1,\omega )}= P_{r,r}^{(1,\omega )}= \mathrm{P}\left( Y_r^{(1,\omega )}=0\right) $, so using (8.1),

$$\begin{aligned} P_{k,r}^{(1,\omega )}= & {} \mathrm{E}\left[ \left( 1-\frac{\beta }{\beta +\omega }\left( 1-\mathrm{e}^{-(\beta +\omega )I}\right) \right) ^r\right] \\= & {} \mathrm{E}\left[ \left( \frac{\omega }{\beta +\omega }+\frac{\beta }{\beta +\omega }\mathrm{e}^{-(\beta +\omega )I}\right) ^r\right] \\= & {} \sum _{i=0}^r \left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \mathrm{E}\left[ \mathrm{e}^{-i(\beta +\omega )I}\right] \\= & {} \sum _{i=0}^r \left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \frac{1}{1+i(\beta +\omega )}, \end{aligned}$$

since $I \sim \mathrm{Exp}(1)$. A similar but simpler argument using (8.2) yields

$$\begin{aligned} P_{k,r}^{(1+\omega ,0)}=\frac{1+\omega }{1+\omega +r\beta }. \end{aligned}$$

Thus $\mu _{k,[r]}^{(1,\omega )} \le \mu _{k,[r]}^{(1+\omega ,0)}$ for all k, r if and only if

$$\begin{aligned} \sum _{i=0}^r \left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \frac{1}{1+i(\beta +\omega )} \le \frac{1+\omega }{1+\omega +r\beta } , \end{aligned}$$

(F.2)

$(r=0,1,\ldots )$, which we now show.

First note that both sides of (F.2) equal 1 when $r=0$. Suppose $r>0$. Then

$$\begin{aligned} \sum _{i=0}^r&\left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \frac{1}{1+i(\beta +\omega )} \le \frac{1+\omega }{1+\omega +r\beta }\\&\iff \sum _{i=0}^r \left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \left[ 1-\frac{i(\beta +\omega )}{1+i(\beta +\omega )}\right] \le 1-\frac{r\beta }{1+\omega +r\beta }\\&\iff \sum _{i=0}^r \left( {\begin{array}{c}r\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i} \left( \frac{\beta }{\beta +\omega }\right) ^i \frac{i(\beta +\omega )}{1+i(\beta +\omega )} \ge \frac{r\beta }{1+\omega +r\beta }\\&\iff r \beta \sum _{i=1}^r \left( {\begin{array}{c}r-1\\ i-1\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-i}\left( \frac{\beta }{\beta +\omega }\right) ^{i-1} \frac{1}{1+i(\beta +\omega )} \ge \frac{r\beta }{1+\omega +r\beta }\\&\iff \sum _{i=0}^{r-1}\left( {\begin{array}{c}r-1\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-1-i}\left( \frac{\beta }{\beta +\omega }\right) ^i \frac{1}{1+(i+1)(\beta +\omega )} \ge \frac{1}{1+\omega +r\beta }\\&\iff H(r) \ge 0, \end{aligned}$$

where

$$\begin{aligned}&H(r)\\&\quad =\sum _{i=0}^{r-1} \left( {\begin{array}{c}r-1\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-1-i}\left( \frac{\beta }{\beta +\omega }\right) ^i \left[ \frac{1}{1+(i+1)(\beta +\omega )}-\frac{1}{1+\omega +r\beta }\right] . \end{aligned}$$

Now $H(1)=0$, so $\mu _{k,[1]}^{(1,\omega )} = \mu _{k,[1]}^{(1+\omega ,0)}$$(k=0,1,\ldots )$, as noted (in a different notation) after (8.2). For $r \ge 2$,

$$\begin{aligned} H(r)= & {} \sum _{i=0}^{r-1}\left( {\begin{array}{c}r-1\\ i\end{array}}\right) \left( \frac{\omega }{\beta +\omega }\right) ^{r-1-i}\left( \frac{\beta }{\beta +\omega }\right) ^i \left[ \frac{(r-1-i)\beta -i\omega }{[1+(i+1)(\beta +\omega )](1+\omega +r\beta )}\right] \\= & {} \frac{1}{1+\omega +r\beta }\left( \frac{1}{\beta +\omega }\right) ^{r-1}\tilde{H}(r), \end{aligned}$$

where

$$\begin{aligned} \tilde{H}(r)=&\sum _{i=0}^{r-2}\left( {\begin{array}{c}r-1\\ i\end{array}}\right) \omega ^{r-1-i}\beta ^i\frac{(r-1-i)\beta }{1+(i+1)(\beta +\omega )}\\&-\sum _{i=1}^{r-1}\left( {\begin{array}{c}r-1\\ i\end{array}}\right) \omega ^{r-1-i}\beta ^i \frac{i \omega }{1+(i+1)(\beta +\omega )}\\ =&\sum _{i=0}^{r-2}\left( {\begin{array}{c}r-1\\ i\end{array}}\right) \omega ^{r-1-i}\beta ^{i+1}\frac{(r-1-i)}{1+(i+1)(\beta +\omega )}\\&-\sum _{i=0}^{r-2}\left( {\begin{array}{c}r-1\\ i+1\end{array}}\right) \omega ^{r-1-i}\beta ^{i+1}\frac{i+1}{1+(i+2)(\beta +\omega )}\\ =&(r-1)\sum _{i=0}^{r-2} \left( {\begin{array}{c}r-2\\ i\end{array}}\right) \omega ^{r-1-i}\beta ^{i+1}\left[ \frac{1}{1+(i+1)(\beta +\omega )}-\frac{1}{1+(i+2)(\beta +\omega )}\right] \\ >&0. \end{aligned}$$

Thus, $H(r)>0$ for $r=2,3,\ldots $, proving (F.1).

Turning to Lemma 1 note that for $k=1,2,\ldots $ and $s \ne 0$, $f_k^{(\gamma ,\omega )}(s)=s^k \hat{f}_k^{(\gamma ,\omega )}(s^{-1})$, where $\hat{f}_k^{(\gamma ,\omega )}(s)=\mathrm{E}\left[ s^{X_k^{(\gamma ,\omega )}}\right] $$(s \in \mathbb {R})$ is the PGF of $X_k^{(\gamma ,\omega )}$. Similarly, in an obvious notation, $f_k^{(\gamma +\omega ,0)}(s)=s^k \hat{f}_k^{(\gamma +\omega ,0)}(s^{-1})$. Now, for $s<1$,

$$\begin{aligned} \hat{f}_k^{(\gamma ,\omega )}(s^{-1}) = \sum _{r=0}^k \mu _{k,[r]}^{(\gamma ,\omega )} (s^{-1}-1)^r \le \sum _{r=0}^k \mu _{k,[r]}^{(\gamma +\omega ,0)} (s^{-1}-1)^r = \hat{f}_k^{(\gamma +\omega ,0)}(s^{-1}), \end{aligned}$$

with strict inequality if $k \ge 2$. Thus, $f_k^{(\gamma ,\omega )}(s) \le f_k^{(\gamma +\omega ,0)}(s)$ for all $s \in (0,1)$, again with strict inequality if $k \ge 2$, proving Lemma 1 for $s \in (0,1)$. The lemma holds trivially when $s=1$ since $f_k^{(\gamma ,\omega )}(1)=f_k^{(\gamma +\omega ,0)}(1)=1$. Finally, note that $f_k^{(\gamma ,\omega )}(0)=\mathrm{P}(Y_k^{(\gamma ,\omega )}=0)=\mathrm{P}(X_k^{(\gamma ,\omega )}=k)=\mu _{k,[k]}^{(\gamma ,\omega )}/k_{[k]}$ and, similarly, $f_k^{(\gamma +\omega ,0)}(0)= \mu _{k,[k]}^{(\gamma +\omega ,0)}/k_{[k]}$, so (F.1) implies the lemma holds also when $s=0$.

G Derivation of asymptotic variances in Conjecture 9.1

In this appendix we derive the expressions for $\sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )$ and $\sigma ^2_{\mathrm{NSW}}(\beta ,\gamma )$ given in Conjecture 9.1 by setting $\omega =0$ in Conjectures 6.1 and 7.1. We consider first the epidemic on an MR random network.

From (E.11) and (E.34), $I_A+I_B+I_C+I_D=I_7$, since $\omega =0$. We derive a closed-form expression for $I_7$ when $\omega =0$. Note that now $p_{\omega }=0$, so using (5.5) and (E.21), $\psi (t)=\mathrm{e}^{-\beta t}$ and $\psi _2(\tilde{\tau },u)=\mathrm{e}^{-\beta (2\tilde{\tau }-u)}$. Substituting these into (E.22) yields

$$\begin{aligned} \sum _{j=1}^{\infty } \tilde{c}_j(\tilde{\tau },u)^2j \tilde{x}_j(u)&= (1-b(\tilde{\tau }))^2 \mathrm{e}^{-\beta (2\tilde{\tau }-u)}f_{D_{\varepsilon }}'\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) \nonumber \\&\quad +\,b(\tilde{\tau })(3b(\tilde{\tau })-2)\mathrm{e}^{-2\beta (2\tilde{\tau }-u)}f_{D_{\varepsilon }}^{(2)}\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) \nonumber \\&\quad +\,b(\tilde{\tau })^2 \mathrm{e}^{-3\beta (2\tilde{\tau }-u)}f_{D_{\varepsilon }}^{(3)}\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) . \end{aligned}$$

(G.1)

For $k=0,1,\ldots $, let

$$\begin{aligned} J_k=\int _0^{\tilde{\tau }} \mathrm{e}^{-k\beta (2\tilde{\tau }-u)}f_{D_{\varepsilon }}^{(k)}\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) \,\mathrm{d}u. \end{aligned}$$

Integrating by parts, for $k=1,2,\ldots $,

$$\begin{aligned} J_k=&\left[ \mathrm{e}^{-(k-1)\beta (2\tilde{\tau }-u)} \frac{1}{\beta } f_{D_{\varepsilon }}^{(k-1)}\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) \right] _0^{\tilde{\tau }}\nonumber \\&-\int _0^{\tilde{\tau }} (k-1) \beta \mathrm{e}^{-(k-1)\beta (2\tilde{\tau }-u)}\frac{1}{\beta } f_{D_{\varepsilon }}^{(k-1)}\left( \mathrm{e}^{-\beta (2\tilde{\tau }-u)}\right) \,\mathrm{d}u\nonumber \\ =&\frac{1}{\beta }\left[ \mathrm{e}^{-(k-1)\beta \tilde{\tau }} f_{D_{\varepsilon }}^{(k-1)}\left( \mathrm{e}^{-\beta \tilde{\tau }}\right) - \mathrm{e}^{-2(k-1)\beta \tilde{\tau }}f_{D_{\varepsilon }}^{(k-1)}\left( \mathrm{e}^{-2\beta \tilde{\tau }}\right) \right] -(k-1)J_{k-1}, \end{aligned}$$

(G.2)

so, setting $k=1$,

$$\begin{aligned} J_1=\frac{1}{\beta }\left[ f_{D_{\varepsilon }}\left( \mathrm{e}^{-\beta \tilde{\tau }}\right) -f_{D_{\varepsilon }}\left( \mathrm{e}^{-2\beta \tilde{\tau }}\right) \right] . \end{aligned}$$

(G.3)

Substituting (G.1) into (E.13), and using (G.3) and (G.2) with $k=2,3$ yields

$$\begin{aligned} I_7=&f_{D_{\varepsilon }}\left( \mathrm{e}^{-\beta \tilde{\tau }}\right) -f_{D_{\varepsilon }}\left( \mathrm{e}^{-2\beta \tilde{\tau }}\right) \nonumber \\&+b(\tilde{\tau })(b(\tilde{\tau })-2)\left[ \mathrm{e}^{-\beta \tilde{\tau }} f_{D_{\varepsilon }}'\left( \mathrm{e}^{-\beta \tilde{\tau }}\right) - \mathrm{e}^{-2\beta \tilde{\tau }}f_{D_{\varepsilon }}'\left( \mathrm{e}^{-2\beta \tilde{\tau }}\right) \right] \nonumber \\&+b(\tilde{\tau })^2 \left[ \mathrm{e}^{-2\beta \tilde{\tau }} f_{D_{\varepsilon }}''\left( \mathrm{e}^{-\beta \tilde{\tau }}\right) - \mathrm{e}^{-4\beta \tilde{\tau }}f_{D_{\varepsilon }}''\left( \mathrm{e}^{-2\beta \tilde{\tau }}\right) \right] . \end{aligned}$$

(G.4)

Recall that $z=\mathrm{e}^{-\beta \tilde{\tau }}$ and $\tilde{b}(z)=b(\tilde{\tau })$. Setting $\omega =0$ in (E.28) gives

$$\begin{aligned} \tilde{b}(z)z f_{D_{\varepsilon }}''(z)=\left[ \frac{(\beta +\gamma )(1+\tilde{b}(z))z-\gamma }{\beta }\right] \mu _D. \end{aligned}$$

Substituting these into (G.4) and using (9.1) yields

$$\begin{aligned} I_7&=f_{D_{\varepsilon }}\left( z\right) -f_{D_{\varepsilon }}\left( z^2\right) -\tilde{b}(z)(\tilde{b}(z)-2)z^2f_{D_{\varepsilon }}'\left( z^2\right) -\tilde{b}(z)^2 z^4 f_{D_{\varepsilon }}''\left( z^2\right) \nonumber \\&\quad +\tilde{b}(z)^2 z \left( \frac{2(\beta +\gamma )z-\gamma }{\beta }\right) \mu _D-\tilde{b}(z)z \left( \frac{(\beta +\gamma )z-\gamma }{\beta }\right) \mu _D. \end{aligned}$$

Setting $\omega =0$ in (6.2) and recalling that now $I_A+I_B+I_C+I_D=I_7$ then yields

$$\begin{aligned} \sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )=&f_{D_{\varepsilon }}\left( z\right) -f_{D_{\varepsilon }}\left( z^2\right) -\tilde{b}(z)(\tilde{b}(z)-2)z^2f_{D_{\varepsilon }}'\left( z^2\right) -\tilde{b}(z)^2 z^4 f_{D_{\varepsilon }}''\left( z^2\right) \nonumber \\&+\left( \frac{\gamma }{2\beta +\gamma }\right) \tilde{b}(z)^2 z^2 (\sigma _D^2+\mu _D^2)\nonumber \\&+2\left( \frac{\gamma -(\beta +\gamma )z}{\beta }\right) \left( \frac{\beta +\gamma }{2\beta +\gamma }\right) z\tilde{b}(z)\mu _D\nonumber \\&+2\left( \frac{\gamma -(\beta +\gamma )z}{\beta }\right) ^2 z^2\tilde{b}(z)^2\mu _D. \end{aligned}$$

(G.5)

Setting $\omega =0$ in (7.27) shows that $h(\beta ,\gamma ,z)=z\tilde{b}(z)$, where $h(\beta ,\gamma ,z)$ is defined at (9.4). Further $f_{D_{\varepsilon }}\left( z\right) =1-\rho $; see immediately after (9.1). The expression (9.3) for $\sigma ^2_{\mathrm{MRND}}(\beta ,\gamma )$ then follows immediately from (G.5).

Turning to the epidemic on an NSW random network, setting $\omega =0$ in (7.7) and noting that then $\widetilde{\psi }(z)=z$, yields

$$\begin{aligned} \sigma _0^2(\beta , 0, \gamma )=&f_D\left( z^2\right) -(1-\rho )^2+\tilde{b}(z)^2 z^4 f_D''\left( z^2\right) +\tilde{b}(z)(\tilde{b}(z)-2)z^2f_{D_{\varepsilon }}'\left( z^2\right) \nonumber \\&+\tilde{b}(z)^2z^2\left( \frac{(\beta +\gamma )z-\gamma }{\beta }\right) ^2\left( \sigma _D^2+\mu _D^2\right) \nonumber \\&-2 \left( \frac{(\beta +\gamma )z-\gamma }{\beta }\right) \left( \frac{(\beta +\gamma )z-\gamma }{\beta }+\frac{(\beta +\gamma )}{\beta }z\right) z^2 \tilde{b}(z)^2 \mu _D. \end{aligned}$$

(G.6)

Setting $\omega =0$ in (7.6) shows that $\sigma ^2_{\mathrm{NSW}}(\beta ,\gamma )$ is given by the sum of the right-hand sides of (G.5), with $D_{\varepsilon }$ replaced by D, and (G.6). The expression (9.6) for $\sigma ^2_{\mathrm{NSWND}}(\beta ,\gamma )$ now follows since $f_D(z)=1-\rho $ and $h(\beta ,\gamma ,z)=z\tilde{b}(z)$.

H ODE initial conditions for the epidemic on an NSW graph

In this appendix we derive the initial conditions $\varSigma _\mathrm{NSW}(0)$ that are given in Sect. 10.1. We assume that the number of initial infectives is $i^N_0 = [\varepsilon N]$ (or that $i^N_0$ is any function of N such that $\lim _{N\rightarrow \infty } N^{-1} i^N_0 = \varepsilon $) and that these individuals are chosen uniformly from the population. Since there is nothing special about the labelling of the individuals $1,2,\ldots ,N$ in the population we can assume that individuals $1,2,\ldots ,i^N_0$ are initially infected.

First consider the term $\sigma _{x_i,x_i}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{var}(X_i^N(0))$. Writing $X_i^N(0)$ as a sum of indicator variables, we use the independence of different individuals’ degrees to find that

$$ \begin{aligned} \mathrm{var}(X_i^N(0))&= \mathrm{var}\left( \sum _{k=1}^N \mathbb {1}_{\{\text {indiv } k \text { is deg } i \& \text { susc}\}}\right) \\&= \mathrm{var}\left( \sum _{k=i_0^N+1}^N \mathbb {1}_{\{\text {indiv } k\text { is deg } i\}}\right) \\&= \sum _{k=i_0^N+1}^N \mathrm{var}\left( \mathbb {1}_{\{\text {indiv } k\text { is deg } i\}}\right) \\&= (N-i_0^N) p_i(1-p_i), \end{aligned}$$

so

$$\begin{aligned} \sigma _{x_i,x_i}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{var}(X_i^N(0)) = (1-\varepsilon ) p_i (1-p_i) \end{aligned}$$

for all i. Considering infectives instead, essentially the same arguments establish that

$$\begin{aligned} \sigma _{y_i,y_i}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{var}(Y_i^N(0)) = \varepsilon p_i (1-p_i). \end{aligned}$$

For the covariances we use the same independence and $\mathrm{cov}(X,Y)=E[XY]-E[X]E[Y]$ to find that, for $i\ne j$,

$$\begin{aligned} \mathrm{cov}(X_i^N(0),X_j^N(0))&= \mathrm{cov}\left( \sum _{k=1}^N \mathbb {1}_{\{\text {indiv } k\text { is deg }i \text {and susc}\}},\sum _{l=1}^N \mathbb {1}_{\{\text {indiv } l \text { is deg } j \text {and susc}\}}\right) \\&= \mathrm{cov}\left( \sum _{k=i_0^N+1}^N \mathbb {1}_{\{\text {indiv } k\text { is deg } i\}},\sum _{l=i_0^N+1}^N \mathbb {1}_{\{\text {indiv } l\text { is deg } j\}}\right) \\&= \sum _{k=i_0^N+1}^N \sum _{l=i_0^N+1}^N \mathrm{cov}(\mathbb {1}_{\{\text {indiv } k\text { is deg }i\}},\mathbb {1}_{\{\text {indiv } l\text { is deg } j\}}) \\&= \sum _{k=i_0^N+1}^N \mathrm{cov}(\mathbb {1}_{\{\text {indiv } k \text { is deg }i\}},\mathbb {1}_{\{\text {indiv }k\text { is deg } j\}}) \\&= (N-i_0^N) (0 - p_ip_j) = -(N-i_0^N) p_ip_j, \end{aligned}$$

so that we have

$$\begin{aligned} \sigma _{x_i,x_j}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{cov}(X_i^N(0),X_j^N(0)) = -(1-\varepsilon ) p_ip_j. \end{aligned}$$

The same calculations for $\mathrm{cov}(Y_i^N(0),Y_j^N(0))$ yield

$$\begin{aligned} \sigma _{y_i,y_j}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{cov}(Y_i^N(0),Y_j^N(0)) = -\varepsilon p_ip_j \end{aligned}$$

for $i\ne j$. Next, for all i, j,

$$ \begin{aligned} \mathrm{cov}(X_i^N(0),Y_j^N(0))&= \mathrm{cov}\left( \sum _{k=1}^N \mathbb {1}_{\{\text {indiv } k \text { is deg }i\,\mathrm{and}\,\text { susc}\}},\sum _{l=1}^N \mathbb {1}_{\{\text {indiv } l\text { is deg } j\, \& \text { inf}\}}\right) \\&= \mathrm{cov}\left( \sum _{k=i_0^N+1}^N \mathbb {1}_{\{\text {indiv } k\text { is deg } i\}},\sum _{l=1}^{i_0^N} \mathbb {1}_{\{\text {indiv } l\text { is deg } j\}}\right) \\&= \sum _{k=i_0^N+1}^N \sum _{l=1}^{i_0^N} \mathrm{cov}(\mathbb {1}_{\{\text {indiv } k \text { is deg }i\}},\mathbb {1}_{\{\text {indiv } l\text { is deg }j\}}) \\&= 0, \end{aligned}$$

by independence of individuals (there are no terms with $k=l$ since the indices take values in disjoint sets). Thus

$$\begin{aligned} \sigma _{x_i,y_j}(0) = \lim _{N\rightarrow \infty } N^{-1} \mathrm{cov}(X_i^N(0),Y_j^N(0)) = 0. \end{aligned}$$

Finally, we have $Z_E^N(0) = 0$, so all (co)variances involving it are zero and remain so when divided by N, whence for all i we have

$$\begin{aligned} \sigma _{x_i,z_E}(0) = \sigma _{y_i,z_E}(0) = \sigma _{z_E,z_E}(0) = 0. \end{aligned}$$

Rights and permissions

OpenAccess This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Ball, F., Britton, T., Leung, K.Y. et al. A stochastic SIR network epidemic model with preventive dropping of edges. J. Math. Biol. 78, 1875–1951 (2019). https://doi.org/10.1007/s00285-019-01329-4

Download citation

Received: 04 June 2018
Revised: 18 January 2019
Published: 13 March 2019
Issue Date: 01 May 2019
DOI: https://doi.org/10.1007/s00285-019-01329-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A stochastic SIR network epidemic model with preventive dropping of edges

Abstract

Similar content being viewed by others

Emergence in complex networks of simple agents

Ergodic stationary distribution of age-structured HBV epidemic model with standard incidence rate

Infection-induced increases to population size during cycles in a discrete-time epidemic model

1 Introduction

2 The stochastic SIR network epidemic model with preventive dropping

3 Effective degree formulation

Remark 3.1

Remark 3.2

Theorem 3.1

Proof

Remark 3.3

Theorem 3.2

Proof

Remark 3.4

4 Final outcome of epidemic on MR random graph

Proposition 4.1

Remark 4.1

Conjecture 4.1

Remark 4.2

Remark 4.3

5 Deterministic temporal behaviour and final size

5.1 Temporal behaviour

5.1.1 Time-transformed process

Remark 5.1

5.1.2 Real-time process

5.2 Connection to other approaches

5.3 Final size

Proposition 5.1

Proof

Remark 5.2

Remark 5.3

6 Asymptotic variance of final size of epidemic on an MR random graph

Proposition 6.1

Proof

Conjecture 6.1

Remark 6.1

Remark 6.2

7 Extension to iid degrees: epidemics on an NSW random graph

Theorem 7.1

Proof

Remark 7.1

Remark 7.2

Theorem 7.2

Proof

Remark 7.3

Conjecture 7.1

Remark 7.4

Remark 7.5

8 Increased recovery rate instead of dropping edges

Lemma 1

Theorem 8.1

Proof

Remark 8.1

9 No dropping of edges

9.1 SIR epidemic on configuration network

Conjecture 9.1

Remark 9.1

Remark 9.2

9.2 Configuration model giant component

Conjecture 9.2

10 Numerical examples

10.1 Implementation

10.2 Convergence and approximation of temporal properties

10.3 Approximation of epidemic final size

10.4 The effect of dropping

10.5 The effect of random graph model on variances

10.6 Increased recovery rate instead of dropping

11 Concluding comments

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

A Derivation of drift function \(F(\varvec{x},\varvec{y},z_E)\)