Abstract
We suggest a method for constructing a positive harmonic function for a wide class of transition kernels on \({{\mathbb {Z}}^+}\). We also find natural conditions under which this harmonic function has a positive finite limit at infinity. Further, we apply our results on harmonic functions to asymptotically homogeneous Markov chains on \({{\mathbb {Z}}^+}\) with asymptotically negative drift which arise in various queueing models. More precisely, assuming that the Markov chain satisfies Cramér’s condition, we study the tail asymptotics of its stationary distribution. In particular, we clarify the impact of the rate of convergence of chain jumps towards the limiting distribution.
Similar content being viewed by others
1 Introduction
Let us consider a positive recurrent Markov chain \(X_n\) on \({{\mathbb {Z}}^+}\) with stationary probabilities \(\pi (i)>0\), that is,
where P(j, i) is the transition probability from j to i. In terms of infinite matrices, \(\pi \) is a positive left eigenvector of P corresponding to the eigenvalue 1. We are interested in the asymptotic behaviour of \(\pi (i)\) for large values of i. Markov chains on \({{\mathbb {Z}}^+}\) and their stationary probabilities naturally arise in various queueing models. A classical example is given by the queue length process at service completion epochs in the M/G/1 queue, which goes back to Kendall [10]; see Boxma and Lotov [3] for further analysis. A natural modification of this process is a system where the service rate depends on the current state of the system.
We study the case where the distribution of \(X_n\) has some positive exponential moments finite, more precisely, the so-called Cramér case. Let \(\xi (i)\) denote a random variable distributed as the jump of the chain \(X_n\) from the state i, that is,
We shall always assume that \(X_n\) is asymptotically homogeneous in space, that is,
We assume \({{\mathbb {E}}}\xi <0\) and that \({{\mathbb {Z}}}\) is the lattice with minimal span for the distribution of \(\xi \). By the Cramér case, we mean the case where
The simplest and one of the most important examples of an asymptotically homogeneous Markov chain is a random walk with delay at zero—the Lindley recursion [11]:
where \(\{\zeta _k\}\) are independent copies of \(\xi \). As is well known, the stationary measure of \(W_n\), say \(\pi _{W}\), coincides with the distribution of \(\sup _{n\ge 0}\sum _{k=1}^n\zeta _k\). Then, by the classical result due to Cramér and Lundberg, for some \(c>0\),
Since the jumps of the chains \(X_n\) and \(W_n\) are asymptotically equivalent, one could expect that their stationary distributions are asymptotically equivalent too. This is true on the logarithmic scale only: Borovkov and Korshunov have shown, see Theorem 3 in [2], that if \(\sup _{i\ge 0}{{\mathbb {E}}}e^{\beta \xi (i)}<\infty \), then
It turns out that the exact (without logarithmic scaling) asymptotic behaviour of \(\pi \) depends not only on the distribution of \(\xi \), but also on the speed of weak convergence in (1); see Theorems 2, 3 and Corollaries 2, 3. Our aim in this contribution is to show how such results may be proved via an exponential change of measure, similarly to how it is done for sums of iid random variables, where an exponential change of measure is used to change the sign of the drift. The issue with transition probabilities is that if we apply an exponential change of measure with parameter \(\beta \), then we get a nonstochastic transition kernel instead of a transition probability. The good news is that the transition kernel obtained, say Q(i, j), is asymptotically homogeneous in space, that is, \(Q(i,i+j)\rightarrow e^{\beta j}{{\mathbb {P}}}\{\xi =j\}\) as \(i\rightarrow \infty \), where the limit corresponds to the jump distribution of a random walk with positive mean \({{\mathbb {E}}}\xi e^{\beta \xi }\).
So, we need to alter this transition kernel a little in order to get a transition probability which makes it possible to analyse the corresponding stochastic object, which is a Markov chain with asymptotically positive drift. We perform such an alteration via Doob’s h-transform with a suitably chosen harmonic function. To make this approach possible, we first need to understand how to construct harmonic functions for transition kernels and how to analyse the harmonic functions constructed.
Let Q be a nonnegative finite transition kernel on \({{\mathbb {Z}}^+}\), that is, \(Q(i,j)\ge 0\) and
We additionally assume that \(Q(i,{{\mathbb {Z}}^+})>0\) for all \(i\in {{\mathbb {Z}}^+}\). This kernel is also assumed irreducible in the sense that, for all i and j, there exists n such that \(Q^n(i,j)>0\).
A function u(i) is called harmonic if \(Qu=u\), which means
Then u is a right eigenvector of Q corresponding to the eigenvalue 1. In this paper, we only consider nonnegative harmonic functions. Pruitt [14] has found sufficient and necessary conditions for the existence of such a function, but these conditions are quite hard to verify. Furthermore, his results do not give any information on the limiting behaviour of harmonic functions as \(i\rightarrow \infty \). Since this information is important for the study of asymptotic properties of Markov chains (see, for example, Foley and McDonald [8]), we are interested in a constructive approach to harmonic functions which would allow us to determine their asymptotics. This is the first problem we solve here; see Sect. 2.
As mentioned above, our primary motivation for studying harmonic functions of transition kernels comes from asymptotic tail analysis of stationary measures of Markov chains. The standard tool for studying large deviations is an exponential change of measure (Cramér transform). If we follow this approach in the context of Markov chains, then we usually get a positive transition kernel which is not stochastic, in general. In Sect. 5, we show how the results on asymptotics for harmonic functions obtained in Sect. 3 can be helpful in the study of stationary measures of asymptotically homogeneous Markov chains.
If the jumps of a positive recurrent Markov chain are bounded, then the equation for its invariant measure can be considered as a system of linear difference equations. The asymptotics of a fundamental solution to these equations can be found using refinements of the Poincaré–Perron theorem; see Elaydi [6] for details. However, one can not apply these results directly as we are only interested in a positive solution.
2 Construction of harmonic function
Any transition kernel can be seen as a combination of a stochastic transition kernel and of a total mass evolution. Indeed, let us consider the following stochastic transition kernel:
and a Markov chain \(X^Q_n\) with transition probabilities \(P^Q(i,j)\). Then we have, for all \(n\ge 1\) and i, \(j\in {{\mathbb {Z}}^+}\), that
hereinafter, \({{\mathbb {E}}}_i\) means the expectation given \(X^Q_0=i\). We call \(X^Q_n\) the underlying Markov chain.
Assume that
This condition ensures that the following function is well defined:
Under the condition (2), the function f is harmonic for the kernel Q. Indeed, it follows by conditioning on \(X^Q_1\) that
The expression (3) for a harmonic function f(i) originates from what we observe in the following two particular cases: The first simple case is provided by a stochastic kernel Q where we have the harmonic function \(f(i)\equiv 1\) which is the unique (up to a constant factor) bounded harmonic function for recurrent Markov kernels; see Meyn and Tweedie ([12], Theorem 17.1.5).
The second case concerns a kernel Q which is obtained from some stochastic kernel P of a Markov chain \(Y_n\) by killing it in some set \(B\subset {{\mathbb {Z}}^+}\), that is,
which is only defined for those i where \(P(i,{{\mathbb {Z}}^+}\setminus B)>0\). In this case, \(Q(i,{{\mathbb {Z}}^+})={{\mathbb {P}}}_i\{Y_1\not \in B\}\) and (3) reads as
where \(\tau _B:=\min \{n\ge 1:Y_n\in B\}\). So, if the original Markov chain \(Y_n\) with transition probabilities P is transient and B is finite, then the probability of not returning to B is a harmonic function for this Markov chain killed in B. It was proved by Doney [5, Theorem 1] that there is a unique harmonic function for a transient (towards \(\infty \)) random walk on \({{\mathbb {Z}}}\) killed at leaving \({{\mathbb {Z}}^+}\). This harmonic function equals the renewal function generated by descending ladder heights, which in turn is equal to \({{\mathbb {P}}}_i\{\tau _B=\infty \}\) with \(B=\{-1,-2,\ldots \}\); see Sect. 4.
An equivalent way to introduce the condition (2) is as follows:
where \(\delta (j):=\log Q(j,{{\mathbb {Z}}^+})\); hereinafter, \(\delta ^+:=\max (\delta ,0)\) and \(\delta ^-:=(-\delta )^+\) so that \(\delta =\delta ^+-\delta ^-\). Then the function f(i) may be also defined as
where \(\ell (j)\) is the local time at state j,
3 On the limiting behaviour of the harmonic function for a transient kernel
In this section, we answer, in particular, the following question: What are natural conditions sufficient for (2) in the case when Q is transient? These sufficient conditions are presented in Proposition 3, and they guarantee that \(\limsup _{i\rightarrow \infty }f(i)\le 1\). Another question is what conditions guarantee that f(i) is a positive function and, moreover, \(\liminf _{i\rightarrow \infty }f(i)\ge 1\). This is answered in Proposition 2. Combining these two statements, we find sufficient conditions for the existence of a harmonic function satisfying \(f(i)\rightarrow 1\), which is our main result in this section, given in the following theorem.
Theorem 1
Suppose that the chain \(X^Q_n\) is transient,
and that
where
Then the function f is harmonic and \(f(i)\rightarrow 1\) as \(i\rightarrow \infty \).
We split the proof of this result into two parts: the lower and upper bounds, given in Propositions 2 and 3, respectively.
Our construction of a harmonic function is alternative to the construction of Foley and McDonald; see [8, Proposition 2.1]. Their analysis is based on the assumption that the series
has the (common for all i and j) radius of convergence R bigger than 1. It seems to be quite difficult to compare this assumption with our condition (8). Clearly, the condition (8) is ready for verification in particular cases because the total masses and the embedded Markov chain are factorised there. Also, our condition (7) is weaker than the ‘closeness’ condition in [8].
3.1 Lower bound for the harmonic function f
We start with a solidarity property for the kernel Q related to positivity of the function f, which may be considered as a generalisation of the Harnack inequality well known for sub-stochastic matrices.
Proposition 1
If \(f(i)>0\) for some \(i\in {{\mathbb {Z}}^+}\), then \(f(j)>0\) for all \(j\in {{\mathbb {Z}}^+}\).
Proof
Irreducibility of Q implies that
where \(\nu _i:=\min \{n\ge 1: X^Q_n=i\}\) is a stopping time. By the strong Markov property,
which is a positive quantity by the hypothesis and (10), so, hence, the proof follows. \(\square \)
Proposition 2
Suppose that the chain \(X^Q_n\) is transient,
and that the local times satisfy
Then
In particular,
Proof
First notice that, since \(X_n^Q\) is irreducible due to irreducibility of the kernel Q, transience of \(X_n^Q\) is equivalent to the following: for any fixed N,
Further notice that
by Jensen’s inequality applied to the convex function \(e^x\). Fix an N. By the Markov property, for every \(j\le N\),
Therefore, by (13) and the condition (12),
On the other hand,
due to the conditions (11) and (12). Together with (15), this implies that
which being substituted into (14) completes the proof. \(\square \)
3.2 Upper bound for the harmonic function f
Proposition 3
Suppose that the sequence \(\delta ^+(j)\) is summable, that is, the \(\delta \) defined in (9) is finite. Then, for every \(i\in {{\mathbb {Z}}^+}\),
If, in addition, the chain \(X^Q_n\) is transient and
then
Proof
Take \(r(j):=\frac{\delta }{\delta ^+(j)}\) so that \(\sum _{j=0}^\infty \frac{1}{r(j)}=1\). We have
Applying Hölder’s inequality,
Therefore,
which yields the upper bound for f(i) in terms of exponential moments of local times, by (9).
Now let us turn to a more precise upper bound. Fix some N and rewrite (17) as follows:
By the Markov property, for any fixed \(j\le N\),
by (13) and the condition (16). Therefore, for any fixed N,
The second product possesses the following upper bound:
Since \(\sum _{j=N+1}^\infty \delta ^+(j)/\delta \rightarrow 0\) as \(N\rightarrow \infty \), by (16),
as \(N\rightarrow \infty \) uniformly for all \(i\in {{\mathbb {Z}}^+}\), which together with (18) yields the required upper bound for f(i). \(\square \)
3.3 On exponential moments for local times
Example 1
Let Q be a local perturbation at the origin of the transition kernel of a simple random walk on \({{\mathbb {Z}}^+}\) as follows:
Then we have \(Q(0,{{\mathbb {Z}}^+})=\alpha \) and \(Q(i,{{\mathbb {Z}}^+})=1\) for all \(i\ge 1\). In other words, \(\delta (0)=\log \alpha \) and \(\delta (i)=0\) for \(i\ge 1\), so that \(\delta =\log \alpha \). The underlying Markov chain \(X^Q_n\) is a simple random walk with reflection at zero. More precisely, its transition kernel is given by
According to Theorem 1, the condition \({{\mathbb {E}}}_0\alpha ^{\ell (0)}<\infty \) implies that the function \(f(i)={{\mathbb {E}}}_i \alpha ^{\ell (0)}\) is a positive harmonic function with \(f(i)\rightarrow 1\) as \(i\rightarrow \infty \). The local time \(\ell (0)\) is geometrically distributed with parameter q/p, that is,
Therefore,
Moreover, for all \(i\ge 1\),
Since any harmonic function f(i) for Q is a solution to the system of equations
we may determine it using standard methods in the theory of difference equations. Indeed, equations for \(i\ge 1\) can be rewritten as
Consequently,
Noting that \(f(1)=f(0)/\alpha \), we get
Choosing f(0) as in (19), we conclude that the expressions in (20) and (21) are equal for all \(\alpha <p/q\). Further, for every \(\alpha >p/q\) and every \(f(0)>0\), the function f(i) from (21) becomes negative for i large enough. Therefore, there is no positive harmonic function for \(\alpha >p/q\). Finally, in the critical case \(\alpha =p/q\), we have \(f(i)=f(0)(q/p)^i\rightarrow 0\). \(\Box \)
Example 2
Consider the following transition kernel:
Aggregating the states 0,..., \(N-1\) into a new state, we obtain the transition kernel from Example 1 with \(\alpha =\alpha _0\ldots \alpha _{N-1}\). Therefore, there exists a positive harmonic function f with \(f(i)\rightarrow 1\) as \(i\rightarrow \infty \) if and only if \(p/q>\alpha _0\ldots \alpha _{N-1}\). This is equivalent to
This shows that exponential moment assumption on the local times in Theorem 1 is quite close to a necessary one. \(\Box \)
Next let us give simple sufficient conditions that guarantee finiteness of some exponential moment of local times for a Markov chain \(X_n\) valued in \({{\mathbb {Z}}^+}\).
Proposition 4
Suppose that there exist a level \(N\in {{\mathbb {Z}}^+}\) and a random variable \(\eta \) such that \({{\mathbb {E}}}\eta >0\) and, for all \(i\ge N\) and \(j\in {{\mathbb {Z}}}\),
that is, \(\eta \) is a stochastic minorant for the family of jumps of \(X_n\) above the level N. Then
Proof
Let \(\nu _i:=\min \{n\ge 1:X_n=i\}\). By the Markov property, the distribution of the local time \(\ell (i)\) given \(X_0=i\) is geometric with parameter \({{\mathbb {P}}}_i\{\nu _i<\infty \}\). Thus, the statement on exponential moments of local times is equivalent to the following:
Due to irreducibility of the transition kernel, the solidarity property says that either all states are transient or recurrent. Therefore, (24) is equivalent to
and it suffices to check that
Indeed, by the condition (22), for any initial state \(i\ge N\), we may construct the chain \(X_0=i\), \(X_1\),... and independent copies \(\eta _1\), \(\eta _2\),... of \(\eta \) on some probability space in such a way that
on the event where \(\eta _1+\cdots +\eta _n\ge 1\) for all \(n\ge 1\). Since \({{\mathbb {E}}}\eta >0\), owing to the strong law of large numbers,
which implies (26) and the proof is complete. \(\square \)
As follows from the proof, (23) holds with any \(\gamma <\log \displaystyle \frac{1}{1-p_\eta }\) provided the chain \(X_n\) satisfies the condition (22) with \(N=0\).
The last result may be generalised to the case where there is no common minorant for jumps, but the drift is everywhere positive. In order to produce such a generalisation, we start with the following statement, where
Proposition 5
Assume that, for any \(i\in {{\mathbb {Z}}^+}\), there exists a positive decreasing function \(g_i(j)\) such that \(g_i(X_{n\wedge \tau _i})\) is a supermartingale and
If, in addition,
then (23) holds.
Proof
For any \(i\ge 1\), applying Doob’s inequality to the supermartingale \(g_i(X_{n\wedge \tau _i})\) with \(X_0=i\), we obtain that
Therefore,
uniformly for all \(i\in {{\mathbb {Z}}^+}\). Then
hence, (23) holds for all \(\gamma <\log \displaystyle \frac{1}{1-p}\). \(\square \)
The last proposition helps us to deduce finiteness of exponential moments of local times for a Markov chain with everywhere positive drift.
Proposition 6
Assume that there exist \(\varepsilon >0\) and \(M<\infty \) such that
Assume also that there exists a random variable \(\zeta \ge 0\) with \({{\mathbb {E}}}\zeta <\infty \) such that
Then (23) holds.
Proof
Since \({{\mathbb {E}}}\zeta <\infty \), there exists a decreasing integrable function \(h_1(x):{{\mathbb {R}}^+}\rightarrow {{\mathbb {R}}^+}\), such that \({{\mathbb {P}}}\{\zeta >x\}=o(h_1(x))\) as \(x\rightarrow \infty \). In turn, by [4], there exists a function \(h_2(x)\) which is continuous, decreasing, integrable and regularly varying at infinity with index \(-1\) such that \(h_1(x)\le h_2(x)\). Put
Then there exists a sufficiently small \(\delta >0\) and a sufficiently large T such that
and, for all \(t\ge T\),
Now take
and consider a positive decreasing function
which is linear for \(x\in [-T,T+M]\) with slope \(g'(0)=-\,h(T)<0\) and \(h(-T)\le J\). By (32) and (33), the function h satisfies
Define \(g_i(j):=g(j-i)\). By construction,
which guarantees fulfilment of (27). The condition (28) follows from (29). Then it remains to show that, for every \(i\in {{\mathbb {Z}}^+}\), \(g_i(X_{n\wedge \tau _i})\) is a supermartingale, that is,
Indeed, for all j between i and \(i+T\), g(x) is linear on \([-T,j-i+M]\). Therefore, for \(i\le j\le i+T\),
by (29) and (31). Now let us consider the case \(j>i+T\). Since g is decreasing,
We have, by the upper bound \(g(x)\le J\) and (35),
Since \(g'(x)=-\,h(x+T)\) for all \(x\ge -T\), the second term possesses the following upper bound:
Therefore, due to (34),
The third term is not greater than
Substituting (38)–(40) into (37), we get the desired inequality \({{\mathbb {E}}}_j g(X_1-i)-g(j-i) \le 0\) for all \(j>i+T\). Together with (36), this proves that \(g_i(X_{n\wedge \tau _i})\) constitutes a nonnegative bounded supermartingale and the proof is complete. \(\square \)
4 Random walk with negative drift conditioned to stay nonnegative
Consider the simplest application of our method of construction of harmonic functions. It deals with random walk conditioned to stay nonnegative. Let \(S_0=0\), \(S_n=\sum _{k=1}^n\xi _k\) be a random walk with independent and identically distributed jumps, \({{\mathbb {E}}}\xi _k<0\). One of the possible ways to define a random walk conditioned to stay nonnegative—see for instance [1]—consists of performing Doob’s h-transform over \(S_n\) killed at leaving \({{\mathbb {Z}}^+}\), that is, a Markov chain on \({{\mathbb {Z}}^+}\) with transition probabilities
where f is a positive harmonic function for the killed random walk, that is,
According to Theorem 1 of [5], such a function exists if and only if
This function is unique (up to a constant factor) and is defined in [5] as
where u(j) stands for the mass function of the renewal process of strict descending ladder heights of \(S_n\).
Now let us show how our approach provides another representation of the harmonic function f(i). Start with the following transition kernel on \({{\mathbb {Z}}^+}\):
This kernel represents the transition probabilities for the random walk \(S_n^{(\beta )}\) killed at leaving \({{\mathbb {Z}}^+}\), where \(S_n^{(\beta )}\) is the result of an exponential change of measure with parameter \(\beta \), that is,
As we have already mentioned in the introduction, see (4),
is harmonic for the kernel Q. Hence, the function
is harmonic for the random walk \(S_n\) killed at leaving \({{\mathbb {Z}}^+}\). Notice that this harmonic function possesses the following lower and upper bounds:
The upper bound immediately follows from \(f^*(i)\le 1\). The lower bound follows by the Cramér–Lundberg estimate
In addition to (43), notice that, by the Cramér–Lundberg approximation,
Let us show that the functions defined in (41) and (42) coincide up to a constant factor. Indeed, let \((\tau _k,\chi _k)\) and \((\tau ^{(\beta )}_k,\chi ^{(\beta )}_k)\) denote descending ladder processes for \(S_n\) and \(S_n^{(\beta )}\), respectively. It follows from the definition of \(S_n^{(\beta )}\) that
for all x, \(j>0\). Therefore,
This gives us the desired equivalence with the multiplier \({{\mathbb {P}}}\{\tau _1^{(\beta )}=\infty \}=1-{{\mathbb {E}}}e^{-\beta \chi _1}\).
The random walk conditioned to stay nonnegative is the simplest Markov chain where the general scheme of construction of a harmonic function helps. In the next section, we follow almost the same techniques in our study of tail behaviour for asymptotically homogeneous in space Markov chains with negative drift under Cramér-type assumptions. Although the scheme is the same in the main aspects, some additional arguments are required.
5 Positive recurrent Markov chains: asymptotic behaviour of stationary distribution
In this section, we consider an asymptotically homogeneous in space Markov chain \(X_n\) with jumps \(\xi (i)\), that is, \(\xi (i)\Rightarrow \xi \) as \(i\rightarrow \infty \). Our next result describes the case when the convergence of jumps is so fast that the stationary measure \(\pi \) of \(X_n\) is asymptotically proportional to that of the random walk \(W_n\) delayed at the origin.
Theorem 2
Suppose that the jumps of \(X_n\) possess a stochastic majorant \({\varXi }\),
such that \({{\mathbb {E}}}{\varXi } e^{\beta {\varXi }}<\infty \). If
then \(\pi (i)\sim c e^{-\beta i}\) as \(i\rightarrow \infty \), where \(c>0\).
It is worth mentioning that (45) is weaker than conditions we found in the literature. First, using so-called equilibrium identities, Borovkov and Korshunov [2] proved exponential asymptotics for \(\pi \) under the condition
which is definitely stronger than (45) and implies, in particular, that the expectations of \(\xi ^{(\beta )}(i)\) also converge with a summable speed. Further, to show that the constant c in front of \(e^{-\beta i}\) is positive they introduced the following condition:
Second, Foley and McDonald [8] used an assumption which can be rewritten in our notations as follows:
Furthermore, the condition (45) is quite close to the optimal one. If, for example, \({{\mathbb {E}}}e^{\beta \xi (i)}-1\) are of the same sign and not summable, then \(\pi (i)e^{\beta i}\) converges either to zero or to infinity; see Corollary 2 below. Thus, if (45) is violated, then \(\pi (i)\) may have exponential asymptotics only in the case when \({{\mathbb {E}}}e^{\beta \xi (i)}-1\) is changing its sign infinitely often.
Example 3
Consider a Markov chain \(X_n\) which jumps to neighbours only:
Assume that, as \(i\rightarrow \infty \),
with some \(\gamma \in (1/2,1)\). Clearly, (45) is not satisfied. Let us have a look at the values of \(X_n\) at even time moments, i.e. \(Y_k=X_{2k}\), \(k\ge 0\). Then we have
where \(q:=1-p\). From these equalities, we obtain
Noting that \(\varphi (i)+\varphi (i+1)=O(i^{-\gamma -1})\), we conclude that the sequence \(|{{\mathbb {E}}}_i(q/p)^{Y_1-i}-1|\) is summable and, consequently, we may apply Theorem 2. Since \(\pi \) is stationary for \(Y_n\) too, we obtain \(\pi (i)\sim c(p/q)^i\) as \(i\rightarrow \infty \). \(\Box \)
Proof of Theorem 2
Fix some \(N\in {{\mathbb {Z}}^+}\). As is well known (see, for example, [12, Theorem 10.4.9]), the invariant measure \(\pi \) satisfies the equality
where
Let h(i) be a harmonic function for \(X_n\) killed at entering [0, N], that is,
Then we can perform Doob’s h-transform on \(X_n\) killed at entering [0, N] and define a new Markov chain \({\widehat{X}}_n\) on \({{\mathbb {Z}}^+}\) with the following transition kernel:
if \(h(i)>0\), and \({{\mathbb {P}}}_i\{{\widehat{X}}_1=j\}\) being arbitrarily defined if \(h(i)=0\). Since h is harmonic, then we also have
Combining (47) and (46), we get
where
is the renewal measure generated by the chain \({\widehat{X}}_n\) with initial distribution
Suppose that the harmonic function h(i) is such that the jumps \({\widehat{\xi }}(i)\) of the chain \({\widehat{X}}_n\) satisfy the following conditions:
the family of random variables \(\{|{\widehat{\xi }}(i)|,\ i\in {{\mathbb {Z}}^+}\}\) admits an integrable majorant \({\widehat{{\varXi }}}\), that is,
and
Then the key renewal theorem for asymptotically homogeneous in space Markov chains from Korshunov [9] states that \({\widehat{U}}(i)\rightarrow 1/{{\mathbb {E}}}{\widehat{\xi }}\) as \(i\rightarrow \infty \). Substituting this into (48), we deduce the following asymptotics:
So, now we need to choose a level N and to construct a harmonic function h(i) for \(X_n\) killed at entering [0, N] such that h satisfies the conditions (49)–(51). The intuition behind our construction of the function h is simple. Since we consider asymptotically homogeneous Markov chain, the chain behaves similarly to the random walk with jumps like \(\xi \). We assume that the limiting jump satisfies Cramér’s condition; hence, it should be such that \(h(i)\sim e^{\beta i}\) as \(i\rightarrow \infty \).
Consider the transition kernel
which is the result of an exponential change of measure. By the theorem conditions, \(Q^{(\beta )}_N(i,{{\mathbb {Z}}^+})\) is finite for all i. Let us find a level N such that the kernel \(Q^{(\beta )}_N\) satisfies the conditions of Theorem 1.
Denote
Since
we conclude by the condition (45) that the condition (7) of Theorem 1 holds for any \(N\in {{\mathbb {Z}}^+}\).
Further, \(\delta _N^+(i)\rightarrow 0\) as \(N\rightarrow \infty \), for every \(i\in {{\mathbb {Z}}^+}\). Moreover, \(\delta _N^+(i)\le \log ^+{{\mathbb {E}}}e^{\beta \xi (i)},\) where the series
is convergent, due to the condition (45). Then the dominated convergence theorem yields that
So, if we proved that, for some \(\gamma >0\),
then we may choose a sufficiently large N such that condition (8) of Theorem 1 holds; here \(\ell ^{(\beta )}_N(i)\) is the local time at state i of the underlying chain \(X^{(\beta )}_{N,n}\), \(n\ge 0\), of the kernel \(Q^{(\beta )}_N\).
By Proposition 4, the relation (53) would follow if we constructed, for a sufficiently large N, a minorant with positive mean for the jumps \(\xi ^{(\beta )}_N(i)\) of the chain \(X^{(\beta )}_{N,n}\). The asymptotic homogeneity of the Markov chain \(X_n\) implies that
where the limiting random variable has distribution
with positive mean \({{\mathbb {E}}}\xi e^{\beta \xi }\). The left tails of \(\xi ^{(\beta )}_N(i)\) are exponentially bounded. Therefore, there exist a sufficiently large N and a random variable \(\eta \) with positive mean, \({{\mathbb {E}}}\eta >0\), such that
and a minorant is identified.
Finally, (13) also follows from minorisation and convergence: for every fixed N,
So, for a sufficiently large N, the kernel \(Q^{(\beta )}_N\) satisfies all the conditions of Theorem 1. Therefore, there exists a positive harmonic function f for this kernel such that \(f(i)\rightarrow 1\) as \(i\rightarrow \infty \).
Let us consider a function \(h(i):=e^{\beta i}f(i)\). For any \(i\in {{\mathbb {Z}}^+}\), we have the equality
and, hence, h(i) is a harmonic function for the Markov chain \(X_n\) killed at entering [0, N]. Let us check that h produces \({\widehat{X}}_n\) satisfying the conditions (49)–(51). First, the condition (49) holds because, for any \(j\ge N-i\),
Notice that \({{\mathbb {E}}}{\widehat{\xi }}={{\mathbb {E}}}\xi e^{\beta \xi }\).
Second, let us prove that the condition (50) holds. From the upper bound
owing to the condition (44). We deduce that \({\widehat{\xi }}(i) \le _{st} {\varXi }_1\), where
On the other hand,
so that \({\widehat{\xi }}(i)\ge -{\varXi }_2\), where \({\varXi }_2\) has some positive exponential moment finite. Together with (55), this implies fulfilment of the condition (50) for the function \(h(i)=e^{\beta i}f(i)\).
Third, the condition (51) follows from the equalities
and from boundedness (53) of an exponential moment of the local times of \(X^{(\beta )}_{N,n}\).
Therefore, we may apply (52) and deduce that
The proof of Theorem 2 is complete. \(\square \)
We now turn our attention to the case where \({{\mathbb {E}}}e^{\beta \xi (i)}\) converges to 1 in a nonsummable way. The next result describes the behaviour of \(\pi \) in terms of a nonuniform exponential change of measure.
Theorem 3
Suppose that, for some \(\varepsilon >0\),
Assume also that there exists a differentiable function \(\beta (x)\) such that
and \(|\beta '(x)|\le \gamma (x)\), where \(\gamma (x)\) is a decreasing function which is integrable at infinity of order o(1/x). Then, for some \(c>0\),
It should be noticed that Theorem 2 can be seen as a special case of Theorem 3 with \(\beta (x)\equiv \beta \). We have decided to split these two statements because of the milder moment condition (44) in Theorem 2 and since the proof of Theorem 3 is simpler via a reduction to the case of summable rate of convergence, which has been considered in Theorem 2.
Proof of Theorem 3
Consider the function \(g(x):=e^{\int _0^x \beta (y)\text {d}y}\) and perform the following change of measure:
First let us estimate
Observe that, with necessity, \(\beta (i)\rightarrow \beta \) so that, by the condition (56),
Further, the condition on the derivative of \(\beta (y)\) implies that
uniformly for \(|\xi (i)|\le \sqrt{i}\). We have \(\gamma \left(i-\sqrt{i}\right)\xi ^2(i)\le \gamma \left(i-\sqrt{i}\right)i\rightarrow 0\) as \(i\rightarrow \infty \). Therefore, again in view of the condition (56),
Hence,
Taking into account (57) and summability of the sequence \(\gamma \left(i-\sqrt{i}\right)\), we conclude that
This allows us to apply Theorem 1 to the kernel \(Q^{(g)}\) killed at entering some set [0, N] in the same way as in the proof of Theorem 2 and to deduce that \(\pi (i)\sim c/g(i)\) as \(i\rightarrow \infty \), which completes the proof. \(\square \)
Since the function \(\beta (x)\) is stated implicitly in Theorem 3, it calls for specification of some cases where \(\beta (x)\) can be expressed in terms of the difference \({{\mathbb {E}}}e^{\beta \xi (i)}-1\).
Corollary 1
Assume the condition (56) and that there exists a differentiable function \(\alpha (x)\) such that \(\alpha '(x)\) is regularly varying at infinity with index \(r\in (-2,-3/2)\) and
where \(\sum _{j=0}^\infty |\gamma (i)|<\infty \). Suppose also that
where \(m:={{\mathbb {E}}}\xi e^{\beta \xi }\). Then
where \(c>0\) and \(A(x):=\int _0^x\alpha (y)\text {d}y\).
Proof
Notice that, since \(r\in (-2,-3/2)\), \(\alpha (x)\) is regularly varying at infinity with index \(r+1\in (-1,-1/2)\), \(A(x)\rightarrow \infty \), \(A(x)=o(x)\) as \(x\rightarrow \infty \) and \(\sum _{i=1}^\infty \alpha ^2(i)<\infty \).
Take \(\beta (x):=\beta -\alpha (x)/m\). Since \(r<-3/2\), \(\alpha (i)=o\left(1/\sqrt{i}\right)\). Hence, by Taylor’s theorem, uniformly for \(|\xi (i)|\le \sqrt{i}\),
which yields
Thus, the function \(\beta (x)\) satisfies all the conditions of Theorem 3 and the proof is complete. \(\square \)
Notice that the key condition on the rate of convergence of \({{\mathbb {E}}}e^{\beta \xi (i)}\) to 1 that implies the asymptotics (60) in the last corollary is that the sequence \(\alpha ^2(i)\) is summable. If it is not so, that is, if the index \(r+1\) of regular variation in the function \(\alpha (x)\) is between \(-1/2\) and 0, then the asymptotic behaviour of \(\pi (i)\) is different from (60). This is specified in the following corollary.
Corollary 2
Assume the condition (56) and that there exists a differentiable function \(\alpha (x)\) such that
for some \(c<\infty \), \(M\in {{\mathbb {N}}}\), and \(\varepsilon >0\),
for some decreasing and integrable at infinity function \(\gamma _1(x)\), and
where \(\sum _{i=0}^\infty \gamma _2(i)<\infty \). Assume also that, for every \(k=1\), 2,..., M, there exist constants \(D_{k,j}\) such that
where \(m_k(i):={{\mathbb {E}}}\xi ^k(i)e^{\beta \xi (i)}\) and \(m_k:={{\mathbb {E}}}\xi ^ke^{\beta \xi }\). Then there exist real numbers \(c > 0, R_{1}, R_{2}, \ldots , R_{M} \in {{\mathbb{R}}}\) such that
Proof
Define
In view of Theorem 3, it suffices to show that there exist \(R_1\), \(R_2\),..., \(R_M\in {{\mathbb {R}}}\) such that
Indeed, \({\varDelta }(x)\) is differentiable and \(|{\varDelta }'(x)|\le C|\alpha '(x)|\). Therefore, we may apply Theorem 3 with \(\beta (x)=\beta +{\varDelta }(x)\).
By Taylor’s theorem, calculations similar to those in the previous corollary show that, as \(i\rightarrow \infty \),
From this equality, we infer that we can determine \(R_1\), \(R_2\),..., \(R_M\) by the relation
It follows from the assumption (62) and the bound \({\varDelta }(x)=O(\alpha (x))\) that (65) is equivalent to
Consequently, the coefficient of \(z^k\) should be zero for all \(k\le M\), and we can determine all \(R_k\) recursively. For example, the coefficient of z equals \(1+m_1R_1\). Thus, \(R_1=-\,1/m_1\). Further, the coefficient of \(z^2\) is \(D_{1,1}R_1+m_1R_2+m_2R_1^2/2\) and, consequently,
All further coefficients can be found in the same way. \(\square \)
If \(\alpha (x)\) from Corollary 2 decreases slower than any power of x, but (61) and (62) remain valid, then one has, by the same arguments,
which can be seen as a corrected logarithmic asymptotic for \(\pi \). To obtain precise asymptotics, one needs more information on the moments of the jumps \(m_k(i)\).
Corollary 3
Assume the condition (56) and that there exists a differentiable function \(\alpha (x)\) such that (61) holds,
and there exist constants \(D_{k,j}\) such that
for all \(k\ge 1\), where \(m_k(i):={{\mathbb {E}}}\xi ^k(i)e^{\beta \xi (i)}\) and \(m_k:={{\mathbb {E}}}\xi ^ke^{\beta \xi }\). Assume further that
Then there exist real numbers \(R_1\), \(R_2\),..., such that
Proof
For any \(i\ge 1\), let \(\beta (i)\) denote a positive solution to the equation
Since \({{\mathbb {E}}}e^{\gamma \xi (i)}\) is finite for all \(\gamma \le \beta +\varepsilon \), we can rewrite the last equation using Taylor’s series:
where \({\varDelta }(i)=\beta (i)-\beta \). Taking into account (66) and (67), we get then
Set \(D_{0,1}=1\) and define
Therefore, (68) can be rewritten as \(F(\alpha (x),{\varDelta }(x))=0\). In other words, we are looking for a function w(z) satisfying \(F(z,w(z))=0\). Since \(F(0,0)=0\) and \(\frac{\partial }{\partial w}F(0,0)=m_1>0\), we can apply Theorem B.4 from Flajolet and Sedgewick [7] which says that w(z) is analytic in a vicinity of zero, that is, there exists a \(\rho >0\) such that
Consequently,
for all i such that \(|\alpha (i)|<\rho \).
Applying Theorem 3 with \(\beta (x)=\beta +{\varDelta }(x)\), we get
Integrating \({\varDelta }(y)\) as a series, we complete the proof. \(\square \)
We conclude with the following remark. In the proof of Corollary 3, we have adapted the derivation of the Cramér series in large deviations for sums of independent random variables; see, for example, Petrov [13]. There is just one difference: we need analyticity of an implicit function instead of analyticity of an inverse function.
References
Bertoin, J., Doney, R.: On conditioning a random walk to stay nonnegative. Ann. Probab. 22, 2152–2167 (1994)
Borovkov, A.A., Korshunov, D.: Large-deviation probabilities for one-dimensional Markov chains. Part 1: stationary distributions. Theory Probab. Appl. 41, 1–24 (1997)
Boxma, O.J., Lotov, V.I.: On a class of one-dimensional random walks. Markov Process. Relat. Fields 2, 349–362 (1996)
Denisov, D.E.: On the existence of a regularly varying majorant of an integrable monotone function. Math. Notes 76, 129–133 (2006)
Doney, R.: The Martin boundary and ratio limit theorems for killed random walks. J. Lond. Math. Soc. 58, 761–768 (1998)
Elaydi, S.: An introduction to difference equations. 3rd Ed. Undergraduate Texts in Mathematics, Springer, New York (2005)
Flajolet, P., Sedgewick, R.: Analytic Combinatorics. Cambridge University Press, Cambridge (2009)
Foley, R.D., McDonald, D.R.: Constructing a harmonic function for an irreducible nonnegative matrix with convergence parameter \(R>1\). Bull. Lond. Math. Soc. 44, 533–544 (2012)
Korshunov, D.: The key renewal theorem for a transient Markov chain. J. Theoret. Probab. 21, 234–245 (2008)
Kendall, D.G.: Stochastic processes occurring in the theory of queues and their analysis by the method of the imbedded Markov chain. Ann. Math. Stat. 24, 338–354 (1953)
Lindley, D.V.: The theory of queues with a single server. Proc. Camb. Philos. Soc. 48, 277–289 (1952)
Meyn, S., Tweedie, R.: Markov Chains and Stochastic Stability. Wiley, New York (1993)
Petrov, V.V.: Sums of Independent Random Variables. Springer, Berlin (1975)
Pruitt, W.E.: Eigenvalues of non-negative matrices. Ann. Math. Stat. 35, 1797–1800 (1964)
Acknowledgements
We are thankful to the referee whose comments helped to clarify some conditions of Theorem 1.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Denisov, D., Korshunov, D. & Wachtel, V. Markov chains on \({{\mathbb {Z}}^+}\): analysis of stationary measure via harmonic functions approach. Queueing Syst 91, 265–295 (2019). https://doi.org/10.1007/s11134-019-09602-5
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-019-09602-5
Keywords
- Transition kernel
- Harmonic function
- Markov chain
- Stationary distribution
- Renewal function
- Exponential change of measure
- Queues