Improved bounds for mixing rates of Markov chains and multicommodity flow

Sinclair, Alistair

doi:10.1007/BFb0023849

Alistair Sinclair¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 583))

Included in the following conference series:

Latin American Symposium on Theoretical Informatics

306 Accesses
16 Citations

Abstract

In recent years, Markov chain simulation has emerged as a powerful algorithmic paradigm. Its chief application is to the random sampling of combinatorial structures from a specified probability distribution. Such a sampling procedure lies at the heart of efficient probabilistic algorithms for a wide variety of problems, such as approximating the size of combinatorially defined sets, estimating the expectation of certain operators in statistical physics, and combinatorial optimisation by stochastic search.

The algorithmic idea is simple. Suppose we wish to sample the elements of a large but finite set X of structures from a distribution π. First, construct a Markov chain whose states are the elements of X and which converges asymptotically to the stationary or equilibrium distribution π over X; it is usually possible to do this using as transitions simple random perturbations of the structures in X. Then, starting from an arbitrary state, simulate the chain until it is close to equilibrium: the distribution of the final state will be close to the desired distribution π.

To take a typical example, let H be a connected graph and X the set of spanning trees of H, and suppose we wish to sample elements of X from a uniform distribution. Consider the Markov chain MC(H) with state space X which, given a spanning tree T ε X, makes transitions as follows: select uniformly at random an edge e of H which does not belong to T, add e to T, thereby creating a single cycle C, and finally remove an edge of C uniformly at random to create a new spanning tree T′. It is not hard to check that this Markov chain converges to the uniform distribution over X.

Analysing the efficiency of the above technique in a given application presents a considerable challenge. The key issue is to determine the mixing rate of the chain, i.e., the number of simulation steps needed to ensure that it is sufficiently close to its equilibrium distribution π. An efficient algorithm can result only if this number is reasonably small, which usually means dramatically less than the size of the state space X itself. For example, in the spanning tree problem above we would want MC(H) to reach equilibrium in time bounded by some polynomial in n, the size of the problem instance H; however, the number of states ¦X¦ will typically be exponential in n. Informally, we will call chains having this property rapidly mixing. (More correctly, this is a property of families of chains, such as MC(H), parameterised on problem instances.)

The first analyses of the complex Markov chains arising in the combinatorial applications mentioned above were made possible using a quantity called the conductance [20,

A useful piece of technology for obtaining lower bounds on Φ in complex examples was developed in [8, 20]. The idea is to construct a canonical path γ_xy in the graph G between each ordered pair of distinct states x and y. If the paths can be chosen in such a way that no edge is overloaded by paths, then the chain cannot contain a constriction, so Φ is not too small. (The existence of a constriction between S and X−S would imply that any choice of paths must overload the edges in the constriction.) More precisely, suppose ρ is the maximum loading of an edge by paths; then it is not hard to show (see Theorem 3) that Φ ≥ (2ρ)⁻¹, so ρ does indeed provide a bound on the mixing rate of the chain. The power of this observation lies in the fact that a good collection Γ = {γ _xy} of canonical paths can sometimes be constructed for which ρ can be bounded rather tightly.

Recently Diaconis and Stroock [6] observed that path arguments similar to that described above can lead directly to bounds on the mixing rate, independently of the conductance Φ. In this paper, we present a new bound which is a modification of that of Diaconis and Stroock. The new bound also involves the maximum loading of an edge by paths, but takes into account the lengths of the paths. A simplified form of the bound (Corollary 6) relates the mixing rate to the product ρℓ for a collection of paths Γ, where ℓ is the length of a longest path in Γ. This bound turns out to be sharper than the conductance-based bound above when the maximum path length ℓ is small compared to ρ.

In Section 3 of the paper, we illustrate the effectiveness of the new bound by obtaining significantly improved estimates for the mixing rate of several important complex Markov chains, which have been used in the design of algorithms for problems involving monomerdimer systems, matchings in graphs, the Ising model, and almost uniform generation of combinatorial structures. The factors saved in the mixing rate translate directly to the runtime of the algorithms that use the chains. These improvements apparently do not follow from the similar bound given by Diaconis and Stroock.

Finally, in Section 4, we address the problem of characterising the rapid mixing property for reversible Markov chains. It is already known that the conductance Φ characterises rapid mixing, in the sense that Φ⁻¹ measures the mixing rate up to a polynomial factor (in fact, a square). We show that a similar characterisation in terms of the path measure ρ also holds, provided ρ is generalised in a natural way. To do this we view the graph G describing the Markov chain as a flow network and consider a multicommodity flow problem in which a certain quantity of some commodity (x, y) is to be transported from x to y for all pairs x, y ε X. For a given flow, ρ may then be interpreted as the maximum total flow through any edge e as a fraction of its weight, or capacity. Minimising over all possible flows, we get a quantity which we call the resistance ρ ≡ ρ(G) of the Markov chain. Our last result states that, if a Markov chain is close to equilibrium after τ steps, then its resistance cannot exceed O(τ log [itπ ⁻¹_min ), where π _min = min_x∈X π(x). Therefore, under reasonable assumptions about the stationary distribution π, the resistance also characterises the rapid mixing property. In fact it is possible to show something a little stronger: the quantities Φ⁻¹ and ρ are equal up to a factor O(logπ ⁻¹_min ). This is actually an approximate max-flow min-cut theorem for the multicommodity now problem, and generalises a result obtained in a different context by Leighton and Rao [14].

In this Extended Abstract, we omit some details in proofs and examples. For a more complete treatment the reader is referred to the full paper [21].

The author wishes to acknowledge the support of the International Computer Science Institute at Berkeley and DIMACS Center, Rutgers University.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aldous, D. Some inequalities for reversible Markov chains. Journal of the London Mathematical Society (2) 25 (1982), pp. 564–576
Google Scholar
Aldous, D. On the Markov chain simulation method for uniform combinatorial distributions and simulated annealing. Probability in the Engineering and Informational Sciences 1 (1987), pp. 33–46
Google Scholar
Alon, N. Eigenvalues and expanders. Combinatorica 6 (1986), pp. 83–96
Google Scholar
Alon, N. and Milman, V.D. λ ₁, isoperimetric inequalities for graphs and superconcentrators. Journal of Combinatorial Theory Series B 38 (1985), pp. 73–88
Google Scholar
Broder, A.Z. How hard is it to marry at random? (On the approximation of the permanent). Proceedings of the 18th ACM Symposium on Theory of Computing, 1986, pp. 50–58. Erratum in Proceedings of the 20th ACM Symposium on Theory of Computing, 1988, p. 551
Google Scholar
Diaconis, P., and Stroock, D. Geometric bounds for eigenvalues of Markov chains. Annals of Applied Probability 1 (1991), pp. 36–61
Google Scholar
Dyer, M., Frieze, A. and Kannan, R. A random polynomial time algorithm for approximating the volume of convex bodies. Proceedings of the 21st ACM Symposium on Theory of Computing (1989), pp. 375–381
Google Scholar
Jerrum, M. R. and Sinclair, A. J. Approximating the permanent. SIAM Journal on Computing 18 (1989), pp. 1149–1178
Google Scholar
Jerrum, M. R. and Sinclair, A. J. Fast Uniform Generation of Regular Graphs. Theoretical Computer Science 73 (1990), pp. 91–100
Google Scholar
Jerrum, M. R. and Sinclair, A. J. Polynomial-time approximation algorithms for the Ising model. Technical Report CSR-1-90, Dept. of Computer Science, University of Edinburgh. (Submitted to SIAM Journal on Computing; Extended Abstract in Proceedings of the 17th International Colloquium on Automata, Languages and Programming (1990), pp. 462–475)
Google Scholar
Karzanov, A. and Khachiyan, L. On the conductance of order Markov chains. Technical Report DCS 268, Rutgers University, June 1990
Google Scholar
Keilson, J. Markov chain models — rarity and exponentiality. Springer-Verlag, New York, 1979
Google Scholar
Lawler, G.F. and Sokal, A.D. Bounds on the L² spectrum for Markov chains and Markov processes: a generalization of Cheeger's inequality. Transactions of the American Mathematical Society 309 (1988), pp. 557–580
Google Scholar
Leighton, T. and Rao, S. An approximate max-flow min-cut theorem for uniform multicommodity flow problems with applications to approximation algorithms. Proceedings of the 29th IEEE Symposium on Foundations of Computer Science (1988), pp. 422–431
Google Scholar
Matula, D. W. and Shahrokhi, F. Sparsest cuts and bottlenecks in graphs. Discrete Applied Mathematics 27 (1990), pp. 113–123
Google Scholar
Mihail, M. Conductance and convergence of Markov chains: a combinatorial treatment of expanders. Proceedings of the 30th IEEE Symposium on Foundations of Computer Science (1989), pp. 526–531
Google Scholar
Mihail, M. and Winkler, P. On the number of Eulerian orientations of a graph. Bellcore Technical Memorandum TM-ARH-018829,1991. (To appear in Proceedings of SODA 1992)
Google Scholar
Mohar, B. Isoperimetric numbers of graphs. Journal of Combinatorial Theory, Series B 47 (1989), pp. 274–291
Google Scholar
Shahrokhi, F. and Matula, D. W. The maximum concurrent flow problem. Journal of the ACM 37 (1990), pp. 318–334
Google Scholar
Sinclair, A.J. Randomised algorithms for counting and generating combinatorial structures. PhD Thesis, University of Edinburgh, June 1988. (To appear as a monograph in the series Progress in Theoretical Computer Science, Birkhäuser Boston, 1991)
Google Scholar
Sinclair, A.J. Improved bounds for mixing rates of Markov chains and multicommodity flow. Technical Report ECS-LFCS-91-178, Dept. of Computer Science, University of Edinburgh.
Google Scholar
Sinclair, A.J. and Jerrum, M.R. Approximate counting, uniform generation and rapidly mixing Markov chains. Information and Computation 82 (1989), pp. 93–133
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Edinburgh, The King's Buildings, EH9 3JZ, Edinburgh, Scotland
Alistair Sinclair

Authors

Alistair Sinclair
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Imre Simon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sinclair, A. (1992). Improved bounds for mixing rates of Markov chains and multicommodity flow. In: Simon, I. (eds) LATIN '92. LATIN 1992. Lecture Notes in Computer Science, vol 583. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0023849

Download citation

DOI: https://doi.org/10.1007/BFb0023849
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-55284-0
Online ISBN: 978-3-540-47012-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics