Abstract
In this chapter we describe the core method that will be used throughout the rest of the book, i.e. the construction of a constrained maximum-entropy ensemble of networks. This procedure requires the definition of the entropy of a network ensemble, the specification of structural properties to be enforced as constraints, the calculation of the resulting maximum-entropy probability of network configurations, and the maximization of the likelihood, given the empirical values of the enforced constraints. We describe this procedure explicitly, after giving some general motivations. In particular, we discuss the crucial importance of enforcing local constraints that preserve the (empirical) heterogeneity of node properties. The maximum-entropy method not only generates the exact probabilities of occurrence of any graph in the ensemble, but also the expectation values and the higher moments of any quantity of interest. Moreover, unlike most alternative approaches, it is applicable to networks that are either binary or weighted, either undirected or directed, either sparse or dense, either tree-like or clustered, either small or large. We also discuss various likelihood-based statistical criteria to rank competing models resulting from different choices of the constraints. These criteria are useful to assess the informativeness of different network properties.
Whereof one cannot speak, thereof one must be silent.
—Ludwig Josef Johann Wittgenstein, Logisch-Philosophische Abhandlung
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
A topological property f, where \(f(\mathbf {G})\) is the value of the property in graph \(\mathbf {G}\), is said to evaluate to a graphic (or graphical) value \(\tilde{f}\) if there exist at least one graph \(\tilde{\mathbf {G}}\) that realizes such value, i.e. for which \(f(\tilde{\mathbf {G}})=\tilde{f}\).
- 2.
An undirected graph (or network) is a graph where no direction is specified for the edges. An undirected graph is binary or simple if each pair of nodes i and j (with \(i\ne j\)) is connected by at most one edge, i.e. if there are no multiple edges between the same two nodes. We will also assume the absence of self-loops (edges starting and ending at the same node) throughout the book.
- 3.
A weighted graph (or network) is a graph where links may carry different intensities. When dealing with weighted networks, throughout the book we will assume non-negative integer link weights (i.e. \(w_{ij}=0,1,2\dots +\infty \)) for simplicity. This corresponds to the assumption that an indivisible unit of measure of link weights has been preliminary specified. Under this assumption, a weighted network can also be regarded as a graph that is in general not simple, i.e. where multiple links of unit weight are allowed between the same two nodes. We will still exclude the possibility of self-loops. Ideally, one may think of link weights becoming continuous as the unit of measure is chosen to be vanishingly small.
- 4.
A directed graph is a graph where a direction is specified for each edge (self-loops are not allowed in this case as well). A directed graph is binary (or simple) if any two nodes i and j are connected in one of the following four mutually-exclusive ways: via only a directed link from i to j, via only a directed link from j to i, via both such links, or via no link at all. A directed graph is weighted if links can carry different intensities, including when they are pointing in opposite direction between the same two nodes. Again, we will assume non-negative integer weights.
- 5.
The empirical degree distribution is defined, for a given network, as the fraction P(k) of nodes that have degree k.
- 6.
The empirical strength distribution is defined, for a given network, as the fraction P(s) of nodes that have strength s.
- 7.
Throughout the book, by expected value (or expectation) of a topological property we mean the average of that property over the ensemble of random graphs under consideration. We denote expectation values with angular brackets \(\langle \cdot \rangle \). The rigorous definition is given later in Eq. (2.7).
- 8.
The average degree in a simple undirected graph with N nodes is defined as \(\bar{k}=N^{-1}\sum _{i=1}^N k_i\) and necessarily equals 2L / N, where L is the total number of links.
- 9.
The average strength in a weighted undirected graph with N nodes is defined as \(\bar{s}=N^{-1}\sum _{i=1}^N s_i\) and necessarily equals 2W / N, where W is the total weight of all links in the network.
- 10.
The maximum value of the entropy S(P) depends on the total number of configurations over which the sum in Eq. (2.8) runs. This number can be rescaled to one for all probability distributions, upon normalizing S(P) by the maximum value itself.
- 11.
In statistical physics, the thermodynamic limit is defined as the limit where the number of fundamental units that describe the microscopic configurations of the system diverges. In our graph ensembles, we regard the nodes as the units and their connections as the interactions.
References
S. Maslov, K. Sneppen, Specificity and stability in topology of protein networks. Science 296, 910–913 (2002)
S. Maslov, K. Sneppen, A. Zaliznyak, Detection of topological patterns in complex networks: correlation profile of the Internet. Physica A 333, 529–540 (2004)
R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii, U. Alon, Network motifs: simple building blocks of complex networks. Science 298, 824–827 (2002)
F. Chung, L. Lu, “Connected components in random graphs with given expected degree sequences”, Ann. Combin. 6(125) (2002)
J. Park, M.E.J. Newman, "Origin of degree correlations in the Internet and other networks". Phys. Rev. E 68, 026112 (2003)
M. Catanzaro, M. Boguná, R. Pastor-Satorras, Generation of uncorrelated random scale-free networks. Phys. Rev. E 71, 027103 (2005)
D. Garlaschelli, M.I. Loffredo, Multispecies grand-canonical models for networks with reciprocity. Phys. Rev. E 73, 015101(R) (2006)
D.B. Stouffer, J. Camacho, W. Jiang, L.A.N. Amaral, Evidence for the existence of a robust pattern of prey selection in food webs. Proc. R. Soc. B 274, 1931–1940 (2007)
R. Guimerá, M. Sales-Pardo, L.A.N. Amaral, Classes of complex networks defined by role-to-role connectivity profiles. Nat. Phys. 3, 63–69 (2007)
J. Park, M.E.J. Newman, Statistical mechanics of networks. Phys. Rev. E 70, 066117 (2004)
M.A. Serrano, M. Boguná, “Weighted configuration model” AIP Conf. Proc. 776(101) (2005)
M.A. Serrano, M. Boguná, R. Pastor-Satorras, Correlations in weighted networks. Phys. Rev. E 74, 055101(R) (2006)
M.A. Serrano, Rich-club vs rich-multipolarization phenomena in weighted networks. Phys. Rev. E 78, 026101 (2008)
A. Barrat, M. Barthelemy, R. Pastor-Satorras, A. Vespignani, The architecture of complex weighted networks. Proc. Nat. Acad. Sci. 101, 3747–3752 (2004)
T. Opsahl, V. Colizza, P. Panzarasa, J.J. Ramasco, Prominence and Control: The Weighted Rich-Club Effect. Phys. Rev. Lett. 101, 168702 (2008)
K. Bhattacharya, G. Mukherjee, J. Saramaki, K. Kaski, S.S. Manna, “The International Trade Network: weighted network analysis and modelling”, J. Stat. Mech., P02002 (2008)
G. Bianconi, The entropy of network ensembles. Phys. Rev. E 79, 036114 (2009)
D. Garlaschelli, M.I. Loffredo, Generalized Bose-Fermi statistics and structural correlations in weighted networks. Phys. Rev. Lett. 102, 038701 (2009)
D. Garlaschelli, The weighted random graph model. New J. Phys. 11, 073005 (2009)
R. Milo, N. Kashtan, S. Itzkovitz, M.E.J. Newman, U. Alon, “On the uniform generation of random graphs with prescribed degree sequences”, http://arxiv.org/abs/cond-mat/0312028
Y. Artzy-Randrup, L. Stone, Generating uniformly distributed random networks. Phys. Rev. E 72, 056708 (2005)
L. Tabourier, C. Roth, J.-P. Cointet, “Generating constrained random graphs using multiple edge switches”, http://arxiv.org/abs/1012.3023
M.E.J. Newman, S.H. Strogatz, D.J. Watts, Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 026118 (2001)
G. Caldarelli, Scale-free Networks. Complex Webs in Nature and Technology (Oxford University Press, 2007)
P. Erdös, A. Rényi, On random graphs. Publicationes Mathematicae Debrecen 6, 290–297 (1959)
A.C.C. Coolen, A. De Martino, A. Annibale, Constrained Markovian dynamics of random graphs. J. Stat. Phys. 136, 1035–1067 (2009)
E.S. Roberts, A.C.C. Coolen, Unbiased degree-preserving randomization of directed binary networks. Phys. Rev. E 85, 046103 (2012)
C.I. Del Genio, H. Kim, Z. Toroczkai, K.E. Bassler, Efficient and exact sampling of simple graphs with given arbitrary degree sequence. PLoS ONE 5(4), e10012 (2010)
H. Kim, C.I. Del Genio, K.E. Bassler, Z. Toroczkai, Constructing and sampling directed graphs with given degree sequences. New J. Phys 14(2), 023012 (2012)
J. Blitzstein, P. Diaconis, A sequential importance sampling algorithm for generating random graphs with prescribed degrees. Internet Mathematics 6(4), 489–522 (2011)
P. Erdös, T. Gallai, “Graphs with prescribed degree of vertices”, Mat. Lapok 11(477) (1960)
S. Melnik, A. Hackett, M.A. Porter, P.J. Mucha, J.P. Gleeson, The unreasonable effectiveness of tree-based theory for networks with clustering. Phys. Rev. E 83, 036112 (2011)
M.E.J. Newman, Random graphs with clustering. Phys. Rev. Lett. 103, 058701 (2009)
M. Boguná, R. Pastor-Satorras, A. Vespignani, Cut-offs and finite size effects in scale-free networks. Eur. Phys. J. B 38, 205–209 (2004)
D. Garlaschelli, M.I. Loffredo, Maximum likelihood: Extracting unbiased information from complex networks. Phys. Rev. E 78, 015101(R) (2008)
M.E.J. Newman, Analysis of weighted networks. Phys. Rev. E 70, 056131 (2004)
J.W. Gibbs, Elementary principles in statistical mechanics (Charles Scribner’s Sons, New York, 1902)
C. Shannon, A mathematical theory of communication. Bell System Tech. Jour. 27(379–423), 623–656 (1948)
E.T. Jaynes, “Information theory and statistical mechanics”, Phys. Rev. 106(620) (1957)
E.T. Jaynes, “On the rationale of maximum-entropy methods”, Proc. IEEE 70(939) (1982)
T. Squartini, D. Garlaschelli, Analytical maximum-likelihood method to detect patterns in real networks. New J. Phys. 13, 083001 (2011)
P. Holland, S. Leinhardt, Sociological Methodology, ed. by D. Heise. (Jossey-Bass, San Francisco, 1975), pp. 1–45
S. Wasserman, K. Faust, Social Network Analysis (Cambridge University Press, Cambridge, 1994)
T.A.B. Snijders, “Markov chain Monte Carlo estimation of exponential random graph models”, J. Soc. Struct. 3(2) (2002)
K.P. Burnham, D.R. Anderson, Model selection and multi-model inference: a practical information-theoretic approach (Springer, New York, 2002)
J.B. Johnson, K.S. Omland, Model selection in ecology and evolution. Trends Ecol. Evol. 9, 101–108 (2004)
D.R. Cox, D.V. Hinkley, Theoretical statistics (Chapman and Hall, Boca Raton, 1974)
H. Akaike, A new look at the statistical model identification. IEEE Trans. Aut. Cont. 19, 716–723 (1974)
K.P. Burnham, D.R. Anderson, Multimodel inference: understanding AIC and BIC in Model Selection. Soc. Met. Res. 33, 261–304 (2004)
E.J. Wagenmakers, S. Farrell, AIC model selection using Akaike weights. Psych. Bull Rev. 11, 192–196 (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 The Author(s)
About this chapter
Cite this chapter
Squartini, T., Garlaschelli, D. (2017). Maximum-Entropy Ensembles of Graphs. In: Maximum-Entropy Networks. SpringerBriefs in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-69438-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-69438-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69436-8
Online ISBN: 978-3-319-69438-2
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)