Maximum-Entropy Ensembles of Graphs

Squartini, Tiziano; Garlaschelli, Diego

doi:10.1007/978-3-319-69438-2_2

Tiziano Squartini²¹ &
Diego Garlaschelli²²

Part of the book series: SpringerBriefs in Complexity ((BRIEFSCOMPLEXITY))

834 Accesses

Abstract

In this chapter we describe the core method that will be used throughout the rest of the book, i.e. the construction of a constrained maximum-entropy ensemble of networks. This procedure requires the definition of the entropy of a network ensemble, the specification of structural properties to be enforced as constraints, the calculation of the resulting maximum-entropy probability of network configurations, and the maximization of the likelihood, given the empirical values of the enforced constraints. We describe this procedure explicitly, after giving some general motivations. In particular, we discuss the crucial importance of enforcing local constraints that preserve the (empirical) heterogeneity of node properties. The maximum-entropy method not only generates the exact probabilities of occurrence of any graph in the ensemble, but also the expectation values and the higher moments of any quantity of interest. Moreover, unlike most alternative approaches, it is applicable to networks that are either binary or weighted, either undirected or directed, either sparse or dense, either tree-like or clustered, either small or large. We also discuss various likelihood-based statistical criteria to rank competing models resulting from different choices of the constraints. These criteria are useful to assess the informativeness of different network properties.

Whereof one cannot speak, thereof one must be silent.

—Ludwig Josef Johann Wittgenstein, Logisch-Philosophische Abhandlung

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A topological property f, where \(f(\mathbf {G})\) is the value of the property in graph \(\mathbf {G}\), is said to evaluate to a graphic (or graphical) value \(\tilde{f}\) if there exist at least one graph \(\tilde{\mathbf {G}}\) that realizes such value, i.e. for which \(f(\tilde{\mathbf {G}})=\tilde{f}\).
2.
An undirected graph (or network) is a graph where no direction is specified for the edges. An undirected graph is binary or simple if each pair of nodes i and j (with \(i\ne j\)) is connected by at most one edge, i.e. if there are no multiple edges between the same two nodes. We will also assume the absence of self-loops (edges starting and ending at the same node) throughout the book.
3.
A weighted graph (or network) is a graph where links may carry different intensities. When dealing with weighted networks, throughout the book we will assume non-negative integer link weights (i.e. \(w_{ij}=0,1,2\dots +\infty \)) for simplicity. This corresponds to the assumption that an indivisible unit of measure of link weights has been preliminary specified. Under this assumption, a weighted network can also be regarded as a graph that is in general not simple, i.e. where multiple links of unit weight are allowed between the same two nodes. We will still exclude the possibility of self-loops. Ideally, one may think of link weights becoming continuous as the unit of measure is chosen to be vanishingly small.
4.
A directed graph is a graph where a direction is specified for each edge (self-loops are not allowed in this case as well). A directed graph is binary (or simple) if any two nodes i and j are connected in one of the following four mutually-exclusive ways: via only a directed link from i to j, via only a directed link from j to i, via both such links, or via no link at all. A directed graph is weighted if links can carry different intensities, including when they are pointing in opposite direction between the same two nodes. Again, we will assume non-negative integer weights.
5.
The empirical degree distribution is defined, for a given network, as the fraction P(k) of nodes that have degree k.
6.
The empirical strength distribution is defined, for a given network, as the fraction P(s) of nodes that have strength s.
7.
Throughout the book, by expected value (or expectation) of a topological property we mean the average of that property over the ensemble of random graphs under consideration. We denote expectation values with angular brackets \(\langle \cdot \rangle \). The rigorous definition is given later in Eq. (2.7).
8.
The average degree in a simple undirected graph with N nodes is defined as \(\bar{k}=N^{-1}\sum _{i=1}^N k_i\) and necessarily equals 2L / N, where L is the total number of links.
9.
The average strength in a weighted undirected graph with N nodes is defined as \(\bar{s}=N^{-1}\sum _{i=1}^N s_i\) and necessarily equals 2W / N, where W is the total weight of all links in the network.
10.
The maximum value of the entropy S(P) depends on the total number of configurations over which the sum in Eq. (2.8) runs. This number can be rescaled to one for all probability distributions, upon normalizing S(P) by the maximum value itself.
11.
In statistical physics, the thermodynamic limit is defined as the limit where the number of fundamental units that describe the microscopic configurations of the system diverges. In our graph ensembles, we regard the nodes as the units and their connections as the interactions.

References

S. Maslov, K. Sneppen, Specificity and stability in topology of protein networks. Science 296, 910–913 (2002)
Article ADS Google Scholar
S. Maslov, K. Sneppen, A. Zaliznyak, Detection of topological patterns in complex networks: correlation profile of the Internet. Physica A 333, 529–540 (2004)
Article ADS Google Scholar
R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii, U. Alon, Network motifs: simple building blocks of complex networks. Science 298, 824–827 (2002)
Article ADS Google Scholar
F. Chung, L. Lu, “Connected components in random graphs with given expected degree sequences”, Ann. Combin. 6(125) (2002)
Google Scholar
J. Park, M.E.J. Newman, "Origin of degree correlations in the Internet and other networks". Phys. Rev. E 68, 026112 (2003)
Article ADS Google Scholar
M. Catanzaro, M. Boguná, R. Pastor-Satorras, Generation of uncorrelated random scale-free networks. Phys. Rev. E 71, 027103 (2005)
Article ADS Google Scholar
D. Garlaschelli, M.I. Loffredo, Multispecies grand-canonical models for networks with reciprocity. Phys. Rev. E 73, 015101(R) (2006)
Article ADS MathSciNet Google Scholar
D.B. Stouffer, J. Camacho, W. Jiang, L.A.N. Amaral, Evidence for the existence of a robust pattern of prey selection in food webs. Proc. R. Soc. B 274, 1931–1940 (2007)
Article Google Scholar
R. Guimerá, M. Sales-Pardo, L.A.N. Amaral, Classes of complex networks defined by role-to-role connectivity profiles. Nat. Phys. 3, 63–69 (2007)
Article ADS Google Scholar
J. Park, M.E.J. Newman, Statistical mechanics of networks. Phys. Rev. E 70, 066117 (2004)
Article ADS MathSciNet Google Scholar
M.A. Serrano, M. Boguná, “Weighted configuration model” AIP Conf. Proc. 776(101) (2005)
Google Scholar
M.A. Serrano, M. Boguná, R. Pastor-Satorras, Correlations in weighted networks. Phys. Rev. E 74, 055101(R) (2006)
Article ADS Google Scholar
M.A. Serrano, Rich-club vs rich-multipolarization phenomena in weighted networks. Phys. Rev. E 78, 026101 (2008)
Article ADS Google Scholar
A. Barrat, M. Barthelemy, R. Pastor-Satorras, A. Vespignani, The architecture of complex weighted networks. Proc. Nat. Acad. Sci. 101, 3747–3752 (2004)
Article ADS Google Scholar
T. Opsahl, V. Colizza, P. Panzarasa, J.J. Ramasco, Prominence and Control: The Weighted Rich-Club Effect. Phys. Rev. Lett. 101, 168702 (2008)
Article ADS Google Scholar
K. Bhattacharya, G. Mukherjee, J. Saramaki, K. Kaski, S.S. Manna, “The International Trade Network: weighted network analysis and modelling”, J. Stat. Mech., P02002 (2008)
Google Scholar
G. Bianconi, The entropy of network ensembles. Phys. Rev. E 79, 036114 (2009)
Article ADS MathSciNet Google Scholar
D. Garlaschelli, M.I. Loffredo, Generalized Bose-Fermi statistics and structural correlations in weighted networks. Phys. Rev. Lett. 102, 038701 (2009)
Article ADS Google Scholar
D. Garlaschelli, The weighted random graph model. New J. Phys. 11, 073005 (2009)
Article ADS Google Scholar
R. Milo, N. Kashtan, S. Itzkovitz, M.E.J. Newman, U. Alon, “On the uniform generation of random graphs with prescribed degree sequences”, http://arxiv.org/abs/cond-mat/0312028
Y. Artzy-Randrup, L. Stone, Generating uniformly distributed random networks. Phys. Rev. E 72, 056708 (2005)
Article ADS MathSciNet Google Scholar
L. Tabourier, C. Roth, J.-P. Cointet, “Generating constrained random graphs using multiple edge switches”, http://arxiv.org/abs/1012.3023
M.E.J. Newman, S.H. Strogatz, D.J. Watts, Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 026118 (2001)
Article ADS Google Scholar
G. Caldarelli, Scale-free Networks. Complex Webs in Nature and Technology (Oxford University Press, 2007)
Book MATH Google Scholar
P. Erdös, A. Rényi, On random graphs. Publicationes Mathematicae Debrecen 6, 290–297 (1959)
Google Scholar
A.C.C. Coolen, A. De Martino, A. Annibale, Constrained Markovian dynamics of random graphs. J. Stat. Phys. 136, 1035–1067 (2009)
Article ADS MathSciNet MATH Google Scholar
E.S. Roberts, A.C.C. Coolen, Unbiased degree-preserving randomization of directed binary networks. Phys. Rev. E 85, 046103 (2012)
Article ADS Google Scholar
C.I. Del Genio, H. Kim, Z. Toroczkai, K.E. Bassler, Efficient and exact sampling of simple graphs with given arbitrary degree sequence. PLoS ONE 5(4), e10012 (2010)
Article Google Scholar
H. Kim, C.I. Del Genio, K.E. Bassler, Z. Toroczkai, Constructing and sampling directed graphs with given degree sequences. New J. Phys 14(2), 023012 (2012)
Article ADS Google Scholar
J. Blitzstein, P. Diaconis, A sequential importance sampling algorithm for generating random graphs with prescribed degrees. Internet Mathematics 6(4), 489–522 (2011)
Article MathSciNet MATH Google Scholar
P. Erdös, T. Gallai, “Graphs with prescribed degree of vertices”, Mat. Lapok 11(477) (1960)
Google Scholar
S. Melnik, A. Hackett, M.A. Porter, P.J. Mucha, J.P. Gleeson, The unreasonable effectiveness of tree-based theory for networks with clustering. Phys. Rev. E 83, 036112 (2011)
Article ADS MathSciNet Google Scholar
M.E.J. Newman, Random graphs with clustering. Phys. Rev. Lett. 103, 058701 (2009)
Article ADS Google Scholar
M. Boguná, R. Pastor-Satorras, A. Vespignani, Cut-offs and finite size effects in scale-free networks. Eur. Phys. J. B 38, 205–209 (2004)
Article ADS MATH Google Scholar
D. Garlaschelli, M.I. Loffredo, Maximum likelihood: Extracting unbiased information from complex networks. Phys. Rev. E 78, 015101(R) (2008)
Article ADS Google Scholar
M.E.J. Newman, Analysis of weighted networks. Phys. Rev. E 70, 056131 (2004)
Article ADS Google Scholar
J.W. Gibbs, Elementary principles in statistical mechanics (Charles Scribner’s Sons, New York, 1902)
Google Scholar
C. Shannon, A mathematical theory of communication. Bell System Tech. Jour. 27(379–423), 623–656 (1948)
Article MathSciNet MATH Google Scholar
E.T. Jaynes, “Information theory and statistical mechanics”, Phys. Rev. 106(620) (1957)
Google Scholar
E.T. Jaynes, “On the rationale of maximum-entropy methods”, Proc. IEEE 70(939) (1982)
Google Scholar
T. Squartini, D. Garlaschelli, Analytical maximum-likelihood method to detect patterns in real networks. New J. Phys. 13, 083001 (2011)
Article ADS Google Scholar
P. Holland, S. Leinhardt, Sociological Methodology, ed. by D. Heise. (Jossey-Bass, San Francisco, 1975), pp. 1–45
Google Scholar
S. Wasserman, K. Faust, Social Network Analysis (Cambridge University Press, Cambridge, 1994)
Google Scholar
T.A.B. Snijders, “Markov chain Monte Carlo estimation of exponential random graph models”, J. Soc. Struct. 3(2) (2002)
Google Scholar
K.P. Burnham, D.R. Anderson, Model selection and multi-model inference: a practical information-theoretic approach (Springer, New York, 2002)
Google Scholar
J.B. Johnson, K.S. Omland, Model selection in ecology and evolution. Trends Ecol. Evol. 9, 101–108 (2004)
Article Google Scholar
D.R. Cox, D.V. Hinkley, Theoretical statistics (Chapman and Hall, Boca Raton, 1974)
Google Scholar
H. Akaike, A new look at the statistical model identification. IEEE Trans. Aut. Cont. 19, 716–723 (1974)
Article MathSciNet MATH Google Scholar
K.P. Burnham, D.R. Anderson, Multimodel inference: understanding AIC and BIC in Model Selection. Soc. Met. Res. 33, 261–304 (2004)
Article MathSciNet Google Scholar
E.J. Wagenmakers, S. Farrell, AIC model selection using Akaike weights. Psych. Bull Rev. 11, 192–196 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

IMT School for Advanced Studies Lucca, Lucca, Italy
Tiziano Squartini
Lorentz Institute for Theoretical Physics, University of Leiden, Leiden, The Netherlands
Diego Garlaschelli

Authors

Tiziano Squartini
View author publications
You can also search for this author in PubMed Google Scholar
Diego Garlaschelli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Diego Garlaschelli .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Squartini, T., Garlaschelli, D. (2017). Maximum-Entropy Ensembles of Graphs. In: Maximum-Entropy Networks. SpringerBriefs in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-69438-2_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-69438-2_2
Published: 22 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69436-8
Online ISBN: 978-3-319-69438-2
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics