Topology of random geometric complexes: a survey
 216 Downloads
Abstract
In this expository article, we survey the rapidly emerging area of random geometric simplicial complexes. Random geometric complexes may be viewed as higherdimensional generalizations of random geometric graphs, where vertices are generated by a random point process, and edges are placed based on proximity. Extending the notion of connected components and cycles in graphs, the main object of study has been the homology of these complexes. We review the results known to date about the probabilistic behavior of the homology (and related structures) generated by these random complexes.
Keywords
Simplicial complexes Homology Random graphs Boolean model Random topologyMathematics Subject Classification
Primary 60B99 60D05 05E45 Secondary 55U101 Introduction
Perhaps the most studied model of random graphs is the Erdős–Rényi model G(n, p), where every edge appears independently with probability p. Textbooks overviewing this subject include those by Bollobás (2001) and Janson et al. (2000). Simplicial complex analogues of G(n, p) and their topological properties have been the subject of a lot of activity in recent years. See for example Babson et al. (2011); Kahle (2014a); Linial and Meshulam (2006); Meshulam and Wallach (2009) and the references in the survey article Kahle (2014b).
For certain applications, however, and especially for modeling realworld networks such as social networks, the edgeindependent model G(n, p) is not considered to be particularly realistic. For example, we might expect in a social network that if we know that X is friends with Y and Z, then it becomes much more likely than it would be otherwise that Y is friends with Z.
Many other models of random graphs have been studied in recent years, and one family of models that has received a lot of attention is the random geometric graphs—see Penrose’s monograph (2003) for an overview. The random geometric graph G(n, r) is made by choosing n points independently and identically distributed (i.i.d.), according to a probability measure on Euclidean space \(\mathbb {R}^d\) (or any other metric space), and these points correspond to the vertices of the graph. Two vertices x and y are connected by an edge if and only if the distance between x and y satisfies \(d(x,y) \le r\). Since one is usually interested in asymptotic properties as \(n \rightarrow \infty \), we usually think of the threshold distance r as a function of n.
This is a very general setup, and many variations on this basic model have been studied. The most closely related model to the n points i.i.d. model is a geometric graph on a Poisson point process with expected number of points n. A Poisson point process replaces the independence of points with spatial independence. There is a lot of technology available for transferring theorems between these two models. See, for example, Section 1.7 of Penrose (2003). One might also consider more general point processes than Poisson. For example, Yogeshwaran et al. (2015) studied random geometric graphs and complexes over more general stationary point processes. This family includes certain attractive and repulsive point processes, as well as stationary determinantal processes. In addition, we can consider random geometric graphs in metric measure spaces, such as Riemannian manifolds equipped with probability measures. The topological and geometric properties of such graphs (and their higherdimensional analogues) were recently studied in Bobrowski and Mukherjee (2014); Bobrowski and Oliveira (2017).
There are several natural ways of extending a geometric graph to a simplicial complex, in particular the Čech complex and the Vietoris–Rips complex, whose definitions we review in Sect. 2. Our interest in the topology of random geometric complexes will be mainly confined to their homology. Briefly, if X is a topological space, its degree khomology, denoted by \(H_k(X)\) is a vector space (assuming field coefficients). The vector space \(H_0(X)\) contains information about the connected components in X, and its dimension is the number of components. For \(k>0\), \(H_k(X)\) contains information about kdimensional ‘cycles‘ or ‘holes’ (see more details in Sect. 2). The Betti numbers of X are defined as \(\beta _k(X) = \dim H_k(X)\).
One motivation for studying the topological features of random geometric complexes comes from topological data analysis (TDA). In TDA one builds a simplicial complex (or filtered simplicial complex) on data, and infers qualitative features of the data from homology (or persistent homology) of the point cloud. Studying the topology of random geometric complexes is related to developing probabilistic null hypotheses for topological statistics. We discuss this further in Sect. 9. The seminal work by NiyogiSmaleWeinberger (2008, 2011) introduced a probabilistic analysis to homology recovery algorithms. This was further extended in Balakrishnan et al. (2012); Bobrowski and Mukherjee (2014); Bobrowski et al. (2017); Fasy et al. (2014). For surveys of persistent homology in topological data analysis, see Carlsson (2009) and Ghrist (2008).
Studying the limiting behavior of random geometric complexes, the first observation we make is that there exist three main regimes in which the limiting properties of the complexes are significantly different. The term that controls the limiting behavior is \(\Lambda = nr^d\), which can be thought of as the average number of points in a ball of radius r (up to a constant).
The subcritical (sometimes called ‘sparse’ or ‘dust’) regime, is when \(\Lambda \rightarrow 0\). In this regime the geometric complex is highly disconnected, and this is where homology first appears.
The critical regime (sometimes called ‘the thermodynamic regime’) is when \(\Lambda = \lambda \in (0,\infty )\). Here, the dimension of homology reaches its peak linear growth, and this is also where percolation occurs (the formation of a ‘giant’ component) — see the discussion in Sect. 3.2.
Finally, in the supercritical regime we have \(\Lambda \rightarrow \infty \). In this regime it is known that the number of components slowly decays, until we reach the connectivity threshold. An analogous process occurs for higher homology — cycles get filled, until eventually every kcycle is a boundary and homology \(H_k\) vanishes. But in contrast, for higher homology \(k \ge 1\) there is another phase transition where homology \(H_k\) first appears.
We note that the connectivity (or \(H_0\)) properties of random geometric graphs were extensively studied in the past, see Penrose (2003) for a comprehensive review. Thus, in this survey we will mainly focus on more recent results related to higher degrees of homology (\(H_k,\ k\ge 1\)).
The rest of this survey is structured as follows. In Sect. 2 we present the concepts and notation that will be used later. Section 3 quickly reviews classical results about the connectivity of random geometric graphs for completeness. Section 4 presents a summary of the main results known to date about the limiting behavior of the homology of random geometric complexes. In Sect. 5 we review an alternative approach to study the homology of random Čech complexes using Morse theory for the distance function. Sections 6 and 7 review two extensions to the results in Sect. 4—one for compact manifolds and the other for stationary point processes. Section 8 discusses the case where the distribution underlying the point process has an unbounded support, from an extreme value analysis perspective. In Sect. 9 we discuss work in progress that studies the persistent homology generated by random geometric complexes. Finally, in Sect. 10 we present a list of open problems and future work in this area.
2 Preliminaries
In this section we wish to briefly introduce the concepts and notation that will be used throughout this survey.
2.1 Homology
We wish to introduce the concept of homology here in an intuitive rather than a rigorous way. For a comprehensive introduction to homology, see Hatcher (2002) and Munkres (1984). Let X be a topological space. The homology of X is a set of abelian groups \(\left\{ H_k(X)\right\} _{k=0}^\infty \), which are topological invariants of X.
The dimension of the kth homology group is called the kth Betti number, denoted by \(\beta _k(X) := \dim (H_k(X))\).
2.2 Geometric complexes
The geometric complexes we will be studying are the Čech and the VietorisRips complexes, defined as follows. We will use \(B_r(x)\) to denote the ball of radius r around a point x.
Definition 2.1
 1.
The 0simplices (vertices) are the points in \(\mathcal {X}\).
 2.
A ksimplex \([x_{i_0},\ldots ,x_{i_k}]\) is in \(\mathcal {C}_r(\mathcal {X})\) if \(\bigcap _{j=0}^{k} {B_{r/2}(x_{i_j})} \ne \emptyset \).
Definition 2.2
 1.
The 0simplices (vertices) are the points in \(\mathcal {X}\).
 2.
A ksimplex \([x_{i_0},\ldots ,x_{i_k}]\) is in \(\mathcal {R}_r(\mathcal {X})\) if \(\left\ x_{i_j}  x_{i_l}\right\ \le r\) for all \(0\le j,l \le k\).
Lemma 2.3
This lemma is highly useful in the study of the random Čech complex, since it allows us to translate questions about the random complex into questions about coverage properties, and enables the use of Morse theory (see Sect. 5). One immediate consequence of the Nerve Lemma is that if \(\mathcal {X}\subset \mathbb {R}^d\) then \(H_k(\mathcal {C}_r(\mathcal {X})) = 0\) for all \(k\ge d\).
2.3 Point processes

The binomial process \(\mathcal {X}_n = \{X_1,X_2,\ldots , X_n\}\) is a set of \(\mathrm {i.i.d.}\) (independent and identically distributed) random variables in \(\mathbb {R}^d\) generated by the density function f.
 The Poisson process \(\mathcal {P}_n\) is a spatial Poisson process in \(\mathbb {R}^d\) with intensity function \(\mu = nf\). The distribution of \(\mathcal {P}_n\) satisfies the following properties:This process is also a special case of a ‘Boolean model’.
 1.
For every compact set \(A\subset \mathbb {R}^d\) we have \(\left {\mathcal {P}_n \cap A}\right \sim \mathrm {Poisson}\;({\mu (A))}\), where \(\mu (A) = \int _A \mu (x) dx.\)
 2.
For every two disjoint sets \(A,B \subset \mathbb {R}^d\), we have that \(\left {\mathcal {P}_n\cap A}\right \) and \(\left {\mathcal {P}_n \cap B}\right \) are independent.
 1.
2.4 Convergence of sequences of random variables
Probability theory uses a number of different notions of convergence. Below we define the ones used in this survey.
Definition 2.4
This type of convergence is also sometimes referred to as ‘weak convergence’.
Definition 2.5
We will mostly use the case \(p=2\).
Definition 2.6
Finally, we have the following probabilistic definition related to limiting events rather than random variables.
Definition 2.7
2.5 Some notation

\(a_n = O(b_n)\) if there exists a constant C and \(n_0 >0\) such that \(a_n \le C b_n\) for every \(n> n_0\);

\(a_n = \Omega (b_n)\) if there exists a constant \(C>0\) and \(n_0 >0\) such that \(a_n \ge C b_n\) for every \(n> n_0\);

\(a_n = \Theta (b_n)\) if both \(a_n = O(b_n)\) and \(a_n = \Omega (b_n)\). We will also denote that by \(a_n \sim b_n\);

\(a_n = o(b_n)\) if \(\lim _{n\rightarrow \infty } \left {a_n / b_n}\right = 0\). We will also denote that by \(a_n \ll b_n\);

\(a_n = \omega (b_n)\) if \(\lim _{n\rightarrow \infty } \left {a_n / b_n}\right = \infty \). We will also denote that by \(a_n \gg b_n\).
Finally, for any set \(A\subset \mathbb {R}^d\) we use \(\left {A}\right \) to denote the ddimensional volume of the set.
3 Connectivity
3.1 The subcritical regime
The subcritical regime (also known as the ‘sparse’ or ‘dust’ regime) is when \(\Lambda \rightarrow 0\). In this regime, the graph G(n, r) is very sparse, and mostly disconnected. Therefore, the study of connectivity did not draw much attention in the past. See Bobrowski and Mukherjee (2014) for a proof of the following.
Theorem 3.1
This statement can be sharpened to a central limit theorem, and a law of large numbers can be proved for deviation from the mean. In fact, as we see in the next section, a central limit theorem and law of large numbers continue, even into the critical regime.
3.2 The critical regime
The critical regime (also known as the ‘thermodynamic limit’) is when \(\Lambda = \lambda \in (0,\infty )\). In this regime \(\beta _0(n) \approx c n \) for some constant \(c<1\) (depending on \(\lambda \)), so the number of components is still \(\Theta (n)\), but is significantly lower than in the subcritical regime. The following law of large numbers is proved in section 13.7 of Penrose (2003).
Theorem 3.2
The infinite sum in (3) comes from the fact that we need to count the number of components consisting of any possible number of vertices. The limiting expression provided by the theorem is highly intricate, and at this point impossible to evaluate analytically. Nonetheless, as we will discuss later, this theorem provides the only formula available to date for the limit of the Betti numbers in the critical regime.
In addition to a law of large numbers, there is also a central limit theorem available.
Theorem 3.3
A more geometric view of connectivity is studied in percolation theory. Penrose considered the case where f is a uniform probability density on a ddimensional unit cube, and \(\Lambda = \lambda \). A remarkable fact is that there exists a constant \(\lambda _c >0\) such that if \(\lambda < \lambda _c\) then a.a.s. every connected component is of order \(O( \log n)\), and if \(\lambda > \lambda _c\) then a.a.s. there is a unique “giant” component on \(\Theta (n)\) vertices. This sudden change in behavior over a very small shift of parameter is sometimes called a sharp phase transition.
In Chapters 9 and 10 of Penrose (2003), Penrose relates percolation on random geometric graphs to more classical continuum percolation theory. In continuum percolation, also called the Gilbert disk model, see Gilbert (1961), one considers a random geometric graph on a unitintensity uniform Poisson process on \(\mathbb {R}^d\), and then there is a threshold radius \(r_c > 0\) such that for \(r > r_c\) the random geometric graph has an infinite connected component, and for \(r< r_c\) every component is finite size. For a deeper study of continuum percolation, see Meester and Roy’s book (1996). For an introduction and overview of the subject, see Chapter 8 of Bollobas–Riordan (2006) or Section 12.10 of Grimmett (1999).
3.3 The supercritical regime
The supercritical regime is when \(\Lambda \rightarrow \infty \). As we will see soon, if the radius is large enough (yet still satisfying \(r\rightarrow 0\)) then it can be shown that the graph G(n, r) becomes connected (caveat, this statement depends on the underlying distribution). This phase is sometimes referred to as the ‘connected regime’. As the radius increases, starting at the critical regime where \(\beta _0(n) = \Theta (n)\) and ending at the connected regime where \(\beta _0(n) = \Theta (1)\), the number of components in G(n, r) should exhibit some kind of a decay within the supercritical regime. To this date only partial information is available about this decay process, and we will present it later. We start by describing the connected regime.
In the case of a uniform distribution on the ddimensional unit box \([0,1]^d\), Penrose gives a sharp result for the connectivity threshold. See Penrose (2003), Chapter 13.
Theorem 3.4
In other words, the threshold radius for connectivity is \(r = \left( \frac{2^{d1}}{d\omega _d}\cdot \frac{\log n}{n}\right) ^{1/d}\) (or \(\Lambda = (2^{d1}/d\omega _d) \log n\)). It is interesting to contrast Theorem 3.4 with the analogous statement for a standard multivariate normal distribution \(\mathcal {N}(0,\mathbf{{I}}_{d\times d})\) in \(\mathbb {R}^d\), a case which Penrose also studies. Here r must be significantly larger, roughly \(1 / \sqrt{ \log n}\), in order to ensure connectivity.
Theorem 3.5
In both cases, letting \(c \rightarrow \pm \infty \) gives the correct width of the critical window. The critical window is the range of functions r such that the probability of connectedness approaches a constant strictly between 0 and 1.
Why does the threshold distance \(r=r(n)\) have to be so much larger in the Gaussian case? The support of the Gaussian distribution is unbounded, and there are outlier points at distance roughly \(\sqrt{2 \log n}\). The radius must be large enough just to connect these points to the rest of the graph.
The contrast of Theorems 3.4 and 3.5 suggests that whatever we hope to prove about the topology of random geometric complexes will necessarily depend on the underlying distribution. On the other hand, certain theorems in geometric probability are fairly general and do not depend on the underlying distribution so drastically.
For example, if we ask what is the threshold for G(n, r) to contain a given subgraph, or what is the expected number of occurrences of a given subgraph in the sparse regime, then in some sense the answer does not depend too much on the underlying density function. The following is proved in Chapter 3 of Penrose (2003).
Theorem 3.6
As a rule of thumb, one might expect that global properties such as connectivity depend very delicately on the underlying probability measure. Local properties, such as subgraph counts or behavior in the subcritical regime, do not depend so much on the underlying measure.
To conclude this section, we mention a recent result about the supercritical regime preceding connectivity. As mentioned above, there is a huge gap remaining between the critical regime where \(\beta _0(n) = \Theta (n)\) and the connectivity point where \(\beta _0(n) = \Theta (1)\). Recent work by Ganesan studies the decay in the number of components within the supercritical regime, in the case \(d=2\). The assumption is that the underlying probability measure on \([0,1]^2\) is supported on a measurable density function f, and that f is bounded above and below. The following is Theorem 1 in Ganesan (2013).
Theorem 3.7
We will see an analogue of this theorem for higher Betti numbers of the random Čech and Vietoris–Rips complexes in the following section.
4 Homology and Betti Numbers
Betti numbers of random geometric complexes were first studied by Robins in (2006). Robins studies “alpha shapes” on random point sets (see Herbert et al. (1983)), which are topologically equivalent to Čech complexes but more convenient from the point of view of computation. The underlying distributions are uniform on a ddimensional cube, but to avoid boundary effects periodic, boundary conditions are imposed. Robins computes the expected Betti numbers over a large number of experiments. Furthermore, she explains the shapes of these curves in the “small radius–low intensity” regime, writing formulas in the \(d=2\) and \(d=3\) cases.
The study of the limiting Betti numbers was revisited and significantly extended later in a series of papers by various authors, see for example  Bobrowski and Adler (2014); Bobrowski and Mukherjee (2014); Kahle (2011); Kahle and Meckes (2013) and Yogeshwaran et al. (2015, 2017). In contrast to connectivity which corresponds to the zeroth homology \({H}_0\), the higher homology of random geometric complexes \(H_k(\mathcal {C}_r(n))\), \(k \ge 1\) is not monotone with respect to r. Each homology group passes through two main phase transitions, one where it appears and one where it disappears.
For the random Čech complex, the phase transition where \(H_k\) occurs when \(\Lambda \sim n^{\frac{1}{k+1}}\) (or \(r \sim n^{\frac{k+2}{d(k +1)} }\)). This radius is within the subcritical regime (\(\Lambda \rightarrow 0\)). In this regime the complex is sparse and highly disconnected which allows very precise Betti number computations — in particular we will see that \(\beta _k(n) \sim n \Lambda ^{k+1}\), and therefore \(\beta _k(n) = o(n).\)
The phase transition where the kth homology vanishes depends on the underlying probability distribution, but if f has a compact support then we will see that it occurs at \(\Lambda = \Theta (\log n)\) (or \(r = \Theta ((\log n / n)^{1/d})\)), which is within the supercritical regime. This radius is similar to the connectivity threshold we saw in Section 3.3, though the constants are different. The exact vanishing radius for each of the homology groups \(H_k\) has not been discovered yet, but it is known that it is controlled by a second order (\(\log \log n\)) term that depends on k. We will discuss this in Sect. 6.
In the critical regime the analysis of the Betti numbers \(\beta _k(n)\), \(k\ge 1\), is significantly more complicated than the analysis of \(\beta _0(n)\). In this case we will see that \(\beta _k(n) = \Theta (n)\), however the limiting constants are unknown to date.
We now review the results known to date about the topology of random geometric complexes for each of the regimes.
4.1 The subcritical regime
The work in Kahle (2011) and Kahle and Meckes (2013) provides a detailed study for the Betti numbers in the subcritical regime. Since a random geometric complex in this regime is so sparse, the vast majority of kcycles are generated by “small” spherelike shapes, with the minimum number of vertices possible. For the Čech complex, the minimum number of vertices to form an kcycle is \(k+2\) (for example, to create a 1cycle, or a loop, we need at least 3 vertices). These spherelike formations are local features, so by the rule of thumb above, we might expect a theorem that holds across a wide class of measures.
Theorem 4.1
Theorem 4.1 states that \(\mathbb {E}\left\{ {\beta _k(n)}\right\} \sim n\Lambda ^{k+1}\). Note that within the subcritical regime the limit of the term \(n\Lambda ^{k+1}\) can be either zero, a finite number, or infinity (for different choices of r). Combining with the second moment method (see for example Chapter 4 of Alon and Spencer (2008)), this is the threshold radius for the phase transition where homology first appears.
Theorem 4.2
 1.Ifthen a.a.s. \(H_k( \mathcal {C}_r(n)) = 0\), and$$\begin{aligned} \Lambda \ll {n^ {\frac{1}{k +1} }}, \end{aligned}$$
 2.ifthen a.a.s. \(H_k( \mathcal {C}_r(n)) \ne 0\).$$\begin{aligned} \Lambda \gg {n^{ \frac{1}{k +1}}} \end{aligned}$$
Thus, the threshold where the kth homology first appears is \(\Lambda = \Theta (n^{\frac{1}{k+1}})\), or \(r = \Theta (n^{ \frac{k+2}{d(k +1)}})\).
The parallel result for Vietoris–Rips complexes is also given in Kahle (2011).
Theorem 4.3
 1.Ifthen a.a.s. \(H_k( \mathcal {R}_r(n)) = 0\), and$$\begin{aligned} \Lambda \ll {n^ {\frac{1}{2k +1} }}, \end{aligned}$$
 2.ifthen a.a.s. \(H_k( \mathcal {R}_r(n)) \ne 0\).$$\begin{aligned} \Lambda \gg {n^{ \frac{1}{2k +1}}} \end{aligned}$$
The difference in exponents stems from the fact that in the Vietoris–Rips complex case, the smallest possible vertex support for a nontrivial cycle in \(H_k\) is on \(2k+2\) vertices (rather than \(k+2\) in the Čech complex), a triangulated sphere combinatorially isomorphic to the boundary of the \((k+1)\)dimensional cross polytope. Another difference is that while in the Čech complex the homology degree is bounded by \(d1\) (a consequence of the Nerve Lemma), for the Vietoris–Rips complex it is unbounded, and we can have cycles of every possible dimension.
Kahle and Meckes studied limiting distributions of Betti numbers in the subcritical regime in (2013). When \(\Lambda = \Theta ( n^{\frac{1}{k+1}})\) (or \(r = \Theta (n^{ \frac{k+2}{d(k+1)}})\)), the following is a refinement of Theorem 4.2, and shows that at the threshold where the homology \(H_k\) first appears, there is a regime in which the Betti number \(\beta _k(n)\) converges in law to a Poisson distribution.
Theorem 4.4
When r is above the threshold, the number of cycles goes to infinity, and with the proper normalization it obeys a central limit theorem. Let \(\mathcal {N}(0,1)\) denote a normal distribution with mean 0 and variance 1.
Theorem 4.5
Again, because we are in the subcritical regime, these results hold for a wide variety of measures—whenever the underlying probability measure has a measurable density function which is bounded above. They hold even without compact support, for example for a multivariate normal distribution. In Kahle and Meckes (2013) Theorems 4.4 and 4.5 are accompanied by formulas for expectation and variance of the Betti numbers. Parallel limit theorems are also proved for Vietoris–Rips complexes.
4.2 The critical regime
The study of the Betti numbers becomes significantly more complicated in the critical regime. In the subcritical regime, since the random geometric complex is very sparse and disconnected, the vast majority of kcycles are vertexminimal—spanning \(k+2\) vertices for the Čech complex, \(2k+2\) for the Rips. In the critical regime a giant connected component emerges—see the discussion in Sect. 3.2 on percolation theory—and this significantly complicates the analysis.
To date, there has been some partial progress in studying these cases. For example, we have the following result for expectation.
Theorem 4.6
A parallel theorem in Kahle (2011) gives the same result for the Vietoris–Rips complex \(\mathcal {R}_r(n)\), but in this case one does not require the assumption that \(k \le d1\); in the critical regime, \(\beta _k\) is growing linearly for every \(k \ge 0\).
The last theorem provides us with the expected order of magnitude of the Betti numbers, but the actual constants have not yet been discovered. Nevertheless, recent work by Yogeshwaran et al. (2017) gives laws of large numbers and central limit theorems for Betti numbers of random Čech complexes in the thermodynamic limit. We state here a few of these results relevant for the Čech complex \(\mathcal {C}_r(n)\). The following law of large numbers is Theorem 4.6 in Yogeshwaran et al. (2017).
Theorem 4.7
The version of the central limit theorem proved in Yogeshwaran et al. (2017) is for an underlying uniform distribution, and for simplicity assumes that it is supported on the unit cube in \(\mathbb {R}^d\). In this case, they define \(I_d(\mathcal {P})\) as an interval in \(\mathbb {R}\) whose endpoints are the percolation radii for \(\mathcal {C}_r(n)\) and \(\mathbb {R}^d\backslash \mathcal {C}_r(n)\).
Theorem 4.8
It is mentioned in Yogeshwaran et al. (2017) that it is not clear whether the restriction to \(\lambda \not \in I_d(\mathcal {P})\) is required or just a technical artifact of the proof. For the Poisson process \(\mathcal {P}_n\) similar theorems are proved for all \(\lambda > 0\).
4.3 The supercritical regime
In the supercritical regime the correct order of magnitude of the Betti numbers is still not known, but there are bounds. In particular, we have the following for the random Vietoris–Rips complex, which is Theorem 5.1 in Kahle (2011).
Theorem 4.9
In particular, if \(\Lambda \rightarrow \infty \) (the supercritical regime) then \(\mathbb {E}\left\{ {\beta _k(n)}\right\} = o(n)\). Theorem 4.9 can be compared to Theorem 3.7 which bounds the number of connected components. As an immediate corollary of Theorem 4.9 we have the following.
Corollary 4.10
If \(\Lambda \ge c \log n\) then a.a.s. \(H_k( \mathcal {R}_r(n)) = 0\). Here c is any constant such that \(c > 1 / c_d\), where \(c_d\) is defined in Theorem 4.9.
The proof of Theorem 4.9 uses discrete Morse theory to collapse the Vietoris–Rips complex onto a homotopy equivalent CW complex with far fewer faces. Combining Theorem 4.3 with Corollary 4.10 gives the following global picture for vanishing and nonvanishing homology of the random Vietoris–Rips complex.
Theorem 4.11
 1.Ifthen a.a.s. \(H_k( \mathcal {R}_r(n)) = 0\),$$\begin{aligned} \Lambda \ll n^ {\frac{1}{2k +1}}, \end{aligned}$$
 2.ifthen a.a.s. \(H_k( \mathcal {R}_r(n)) \ne 0\),$$\begin{aligned} n^{\frac{1}{2k +1}} \ll \Lambda \le a \log n, \end{aligned}$$
 3.and ifthen a.a.s. \(H_k( \mathcal {R}_r(n)) =0\).$$\begin{aligned} \Lambda \ge b \log n \end{aligned}$$
For the Čech complex similar bounds are studied in Bobrowski and Oliveira (2017); Bobrowski and Weinberger (2017), using Morse theory for the distance function (discussed in Sect. 5). The idea there is to look for critical points of the distance function, that are responsible for changes in the kth homology. We note that the following bounds were proven for closed manifolds (compact and without a boundary), while a similar proof can be repeated for the compact and convex case. We shall discuss these bounds in detail in Sect. 6.
Theorem 4.12
Combining Theorems 4.2 and 4.12, we have the following statement for the Čech complex.
Theorem 4.13
 1.Ifthen a.a.s. \(H_k( \mathcal {C}_r(n)) = 0\),$$\begin{aligned} \Lambda \ll n^ {\frac{1}{k +1}}, \end{aligned}$$
 2.ifthen a.a.s. \(H_k( \mathcal {C}_r(n)) \ne 0\),$$\begin{aligned} n^ {\frac{1}{k +1}} \ll \Lambda \le A \log n, \end{aligned}$$
 3.and ifthen a.a.s. \(H_k( \mathcal {C}_r(n)) =0\).$$\begin{aligned} \Lambda \ge B \log n \end{aligned}$$
Theorems 4.11 and 4.13 show that the vanishing threshold radius for higher homology has the same order of magnitude as the connectivity threshold that we saw in Theorem 3.4, i.e. it occurs when the average degree is \(\Lambda \sim \log n\). Note that this is also when the union of balls \(B_{r/2}(\mathcal {P}_n)\) is known to completely cover the support of the distribution, in which case it can be shown that \(H_k(B_{r/2}(\mathcal {P}_n)) = 0\). The proof in Kahle (2011) uses this fact together with the Nerve Lemma 2.3 to prove part 3 of the Theorem.
In Sect. 6 we discuss a more refined picture of this transition. We will also see in Sect. 6 that these results can be generalized — for example, to any compact manifold, and for any probability distribution with a density function that is bounded away from zero.
5 Morse theory for the distance function
5.1 Critical points of the distance function
The classical definition of critical points in calculus is as follows. Let \(f:\mathbb {R}^d\rightarrow \mathbb {R}\) be a \(C^2\) function. A point \(c\in \mathbb {R}\) is called a critical point of f if \(\nabla f (c) =0\), and the real number f(c) is called a critical value of f. A critical point c is called nondegenerate if the Hessian matrix \(H_f(c)\) is nonsingular. In that case, the Morse index of f at c, denoted by \(\mu (c)\) is the number of negative eigenvalues of \(H_f(c)\). A \(C^2\) function f is a Morse function if all its critical points are nondegenerate, and its critical values are distinct.
Note that the distance function \(d_{\mathcal {P}}\) defined in (4) is not everywhere differentiable, therefore the definition above does not apply. However, following Gershkovich and Rubinstein (1997), one can still define a notion of nondegenerate critical points for the distance function, as well as their Morse index. Extending Morse theory to functions that are nonsmooth has been developed for a variety of applications Baryshnikov et al. (2014); Bryzgalova (1978); Gershkovich and Rubinstein (1997); Matov (1982). The class of functions studied in these papers have been the minima (or maxima) of a functional and called ‘mintype’ functions.
5.2 Morse theory
The study of homology is strongly connected to the study of critical points of real valued functions. The link between them is called Morse theory, and we shall describe it here briefly. For a deeper introduction, we refer the reader to Milnor (1963).
While classical Morse theory deals with smooth (or \(C^2\)) Morse functions on compact manifolds Milnor (1963), it has been extended to many more general situations, and the extension to “mintype” functions presented in Gershkovich and Rubinstein (1997) enables one to apply similar concepts to the distance function \(d_{\mathcal {P}}\) as well.
Let \(\mathcal {X}_n\) be the binomial process we had before. For \(0\le k\le d\), we define \(C_k(r)\) to be the number of critical points of index k of the distance function \(d_{\mathcal {X}_n}\), for which the critical value is less then or equal to r. According to Morse theory (and the Nerve Lemma 2.3), the critical points accounted for by \(C_{k}(r)\) are the ones generating the homology of \(\mathcal {C}_r(n)\).
Similarly to the study in Sect. 4, we can study the limiting behavior of the random values \(C_k(r)\) as \(n\rightarrow \infty \) and \(r\rightarrow 0\). This was studied in Bobrowski and Adler (2014). This limiting behavior is in some ways very similar to what we observed for the Betti numbers \(\beta _k(n)\). However, as opposed to homology which involves global behavior, the nature of critical points is much more local. This enables us to compute precise limits for \(C_k(r)\) even in the critical and supercritical regimes, where the analysis of the Betti numbers at this point has yet to be completed. We present here the limiting results for the expected values of \(C_k(r)\).
Theorem 5.1
 1.If \(\Lambda \rightarrow 0\) then$$\begin{aligned} \mathbb {E}\left\{ {C_k(r)}\right\} \approx \tilde{c}_k n \Lambda ^k; \end{aligned}$$
 2.If \(\Lambda = \lambda \in (0,\infty ]\) then$$\begin{aligned} \mathbb {E}\left\{ {C_k(r)}\right\} \approx \gamma _k(\lambda )\cdot n; \end{aligned}$$
The values \(\tilde{c}_k\) and \(\gamma _k(\lambda )\) are presented in Bobrowski and Adler (2014), and they depend on the density function f, d and \(\lambda \) via integration, similarly to the constants \(c_k\) in Theorem 4.1.
In the subcritical regime, one can observe that the expected value of \(C_k(r)\) is similar to the limit of \(\beta _k(n)\) and differs mostly by the index k. This is due to the fact that a critical point of index k is generated by a subset of \(k+1\) vertices (see discussion above) whereas an kcycle in the subcritical regime is generated by a subset of \(k+2\) vertices. Not surprisingly, the distribution of \(C_k(r)\) has limit theorems very similar to the ones presented in Sect. 4 for the Betti numbers (see Bobrowski and Adler (2014)).
In the critical regime we have \(C_k(r) = \Theta (n)\) for all \(0\le k \le d\), which, with Morse theory in mind, perfectly agrees with Theorem 4.7 stating that \(\beta _{k}(n) = \Theta (n)\) as well. As opposed to the Betti numbers, studying the critical points yields precise limits for the expectation as well as a central limit theorem (cf. Bobrowski and Adler (2014)). This will enable us later to get a very interesting conclusion regarding the Euler characteristic of \(\mathcal {C}_r(n)\).
In the supercritical regime, we still have the exact limits for the number of critical points. However, in this case, it will not reveal much information about \(\mathcal {C}_r(n)\), since most of the critical points accounted for by \(C_k(r)\) were formed in the critical regime (note that \(C_k(r)\) is a monotone function of r), and the number of critical points actually being formed in the supercritical regime is actually o(n). Nevertheless, in some cases (see Sect. 6), it is possible to study the behavior of critical points within the supercritical regime in a finer resolution and use that to draw conclusions about the vanishing of the different degrees of homology.
5.3 The Euler characteristic
Corollary 5.2
6 Extending to manifolds
In Sects. 3–5 the distributions studied are supported on ddimensional subsets of \(\mathbb {R}^d\). The work in Bobrowski and Mukherjee (2014) studied the same type of problems for the case where the distributions are supported on a closed mdimensional manifold embedded in \(\mathbb {R}^d\) (\(m<d\)). In Bobrowski and Weinberger (2017) the flat torus was studied as a special case of a Riemannian manifold, and this was extended later to compact (smooth) Riemannian manifolds in Bobrowski and Oliveira (2017). In this section we will limit the discussion to the Čech complex, although some of the results (in particular the behavior in the subcritical and critical regimes) could be similarly generalized.
6.1 Closed manifolds embedded in \(\mathbb {R}^d\)
The exact setup studied was as follows. Let \(M\subset \mathbb {R}^d\) be a mdimensional smooth closed manifold (compact and without a boundary). Let \(f:M\rightarrow \mathbb {R}\) be a probability density function on M. Let \(\mathcal {X}_n = \{X_1,\ldots , X_n\}\) be a set of \(\mathrm {i.i.d.}\) points generated by f, and let \(\mathcal {C}_r(n)\) be the Čech complex generated by these points (using ddimensional balls). The results in this case turn out to be very similar to the ones we described earlier, even though the proofs require different analysis tools. In the following we briefly review the results in Bobrowski and Mukherjee (2014) and highlight the main difference from the results in \(\mathbb {R}^d\).
The main difference in studying manifolds shows up when we study the vanishing of the homology. When studying compact and convex bodies, Theorem 4.13 states that homology completely vanishes when \(\Lambda \sim \log n\) (or \(r \sim \left( \frac{\log n}{n}\right) ^{1/d}\)). Sampling from a manifold, by the Nerve Lemma, we expect that upon coverage the homology of the complex \(\mathcal {C}_r(n)\) will not vanish but rather become equal to the homology of M. This result is stated in the following theorem.
Theorem 6.1
We note that while the second part of this theorem did not appear explicitly in Bobrowski and Mukherjee (2014), it is a direct consequence of the calculations done there in addition to the Morse theoretical arguments made in Bobrowski and Weinberger (2017) (discussed later). Also note that the vanishing radius for \(H_k\) (\(k\ge 1\)) is twice the radius of connectivity in the same setup (an analogous result of Theorem 3.4 was proved for the flat torus in Penrose (2003), and can be extended to any compact embedded or Riemannian manifold using the techniques in Bobrowski and Mukherjee (2014); Bobrowski and Oliveira (2017)). This phenomenon has a non formal, yet convincing, explanation. In Penrose (2003) (Theorem 13.17) it is shown that at the edge of connectivity the graph G(n, r) consists roughly of a giant component and some isolated vertices. For a vertex to be isolated, a ball of radius r around it has to be vacant (i.e. with no other points in \(\mathcal {X}_n\) inside it). To get all the higher homology groups correctly, we need to guarantee that the balls of radius r / 2 (the ones used to construct the Čech complex) cover the support. Now, the support is covered if and only if there is no vacant ball of radius r / 2. Thus, it seems harder to reach coverage than connectivity, and the vacancy radii involved have the same ratio as the thresholds we presented.
The statement in Theorem 6.1 has an important consequence to problems in manifold learning, since it shows that by studying Čech complexes we can recover the homology of an unknown manifold M from a finite (yet probably large) number of random samples. The analysis of this type of “topological manifold learning” was established by the seminal work in Niyogi et al. (2008) and (2011), and Theorem 6.1 can be viewed as an asymptotic and extended version of the main results there. Considering asymptotic behavior has the advantage of covering a more general class of distributions and using fewer assumptions.
6.2 Riemannian manifolds and homological connectivity
The work in Bobrowski and Oliveira (2017) and Bobrowski and Weinberger (2017) studied a similar case to the previous one, only that now the random point process is generated on a ddimensional Riemmanian manifold (M, g). The main difference in this setup, is that now the balls used to create the geometric complexes, are ddimensional intrinsic balls on the manifold (i.e. using the Riemannian rather than the Euclidean metric). As before, most of the statements we had for random geometric complexes in Euclidean spaces, can be extended to the Riemannian setting. In this section we focus on one particular aspect that has been further studied in the case of compact Riemannian manifolds. In the following we will limit ourself to uniform distributions on manifolds with a unit volume (in which case \(f \equiv 1\)).
By ‘homological connectivity’ we refer to the phenomenon described above where the kth homology of the Čech complex becomes isomorphic to that of the underlying manifold (i.e. \(H_k(\mathcal {C}_r(n))\cong H_k(M)\)). We note that this term was coined by Linial and Meshulam in (2006). The result in Theorem 6.1 (which could be extended to compact Reiamannian manifolds) states that for all \(k\ge 1\) homological connectivity for \(H_k\) occurs around \(\Lambda = (2^d/\omega _d) \log n\). Note, however, that this result does not differentiate between the different homology groups. Since our previous study shows that cycles in different dimensions are formed by different type of structures, and occur at different radii, we also expect to observe differences in the homological connectivity thresholds for different dimensions k.
The work in Bobrowski and Weinberger (2017) revisited the study of critical points for the distance function for the case when M is the flat torus (i.e. \(\mathbb {T}^d = \mathbb {R}^d / \mathbb {Z}^d\) which can be viewed as the unit box \([0,1]^d\) with the metric \(\rho (x,y) = \min _{\Delta \in \mathbb {Z}^d} \left\ xy+\Delta \right\ \)). By providing more details estimates to the number of critical points, the following statement was proved.
Proposition 6.2
To get the upper bound, we denote by \(\hat{C}_k(r)\) the number of critical points whose critical value is bigger than r. Then \(\beta _k(n) \le \beta _k(M) +\hat{C}_{k+1}(r)\) since by Morse theory all the cycles in \(H_k(\mathcal {C}_r(n))\) that do not belong to \(H_k(M)\) are to be terminated by some critical point of index \(k+1\). For the lower bound, we look for critical points of index k with a special local behavior that guarantees to generate a new kcycle (See Bobrowski and Weinberger (2017) for details). The last inequality then leads to the following result.
Theorem 6.3
Note that: (a) This statement is about isomorphism of the homology groups, which is stronger than just the equality of the Betti numbers; (b) There is a gap in this description of the phase transition, as the two thresholds differ a \(\log \log n\) factor. In Bobrowski and Oliveira (2017) these results were extended from the flat torus to any compact smooth ddimensional Riemannian manifold. However, it is not clear how this result generalizes to spaces that have boundaries (as the ones in Sect. 4.3).
Finally, we note that we believe the following conjecture to be the most accurate description of the phase transition for homological connectivity.
Conjecture 6.4
The reason why this conjecture should be true is that the same phase transition can be shown to describe the vanishing of isolated kfaces (ksimplexes that do not have any \((k+1)\)coface). In all other random simplicial complexes studied in the past it was shown that these isolated faces generate the last cycles that prevent homology from converging. Proving this conjectures, however, remains as future work.
7 Stationary point processes
The results we presented so far in this survey describe the behavior of geometric complexes constructed from either the binomial process \(\mathcal {X}_n\) or the Poisson process \(\mathcal {P}_n\). Both models exhibit a strong level of independence which plays a significant role in the proofs. For the binomial process \(\mathcal {X}_n\) the number of points is fixed, while the locations of the points are independent. For the Poisson process \(\mathcal {P}_n\) the amount of points in different regions are independent, and given the number of points in a region their locations are independent.
Recent work by Yogeshwaran and Adler (2015) extends some of the results presented in this survey to a more general class of spatial point processes allowing certain attractive and repulsive point processes, as well as stationary determinantal processes. In this section we wish to briefly review their results.
The critical (thermodynamic) regime is when \(r = \lambda \in (0,\infty )\). In this case, Yogeshwaran et al. (2015) show that \(\mathbb {E}\left\{ {\beta _k^\Phi (n)}\right\} = \Theta (n)\) and provide a limit for the Euler characteristic similarly to Corollary 5.2.
Finally, in the supercritical regime (\(r\rightarrow \infty \)) Yogeshwaran et al. (2015) discuss the connectivity regime, which is when \(r^d = \Theta (\log n)\). Similarly to Theorem 4.13 they show that there exists a constant c such that if \(r \ge c\left( \frac{1}{\log n}\right) ^{1/d}\) then \(\mathcal {C}_r(\Phi _n)\) is a.a.s contractible.
In addition to the Betti numbers of the Čech complex, they also provide equivalent results for the VietorisRips complexes \(\mathcal {R}_r(\Phi _n)\) and for the critical point counts \(c_k\) for the distance function \(d_{\Phi _n}\). In Yogeshwaran et al. (2017) these theorems are extended in some cases, to laws of large numbers and central limit theorems.
8 Extreme value analysis of random geometric complexes
The results in the supercritical regime (\(\Lambda \rightarrow \infty \)) that we presented so far, assumed that the point process is generated by a distribution with a bounded support (see e.g. Theorems 3.4, 4.9, 4.12). As the result in Theorem 3.5 suggests, the limiting behavior can be significantly different once we generate the point process by a distribution with an unbounded support (e.g. the Gaussian distribution). The work in Adler et al. (2014) and Owada and Adler (2015) studied the distribution of the Betti numbers in these cases.
The work in Owada and Adler (2015) studies this phenomena in detail, discussing the differences between light and heavytailed distributions, and proving that there is a limiting Poisson law that describes the spatial distribution of cycles appearing in each annulus.
9 Persistent homology
Persistent homology is one of the most heavily used tools in applied topology, or TDA (cf. Carlsson 2009; Ghrist 2008). However, very little is known about its probabilistic properties. Briefly, the persistent homology of a Čech or a Rips complex tracks the evolution of the homology of the complex as the radius r changes from zero to infinity. In this section we will review some recent work related to the persistent homology of random geometric complexes (Bobrowski et al. 2015; Duy et al. 2016).
9.1 Limit theorems for persistence diagrams
Denote by \(\xi _k\) the persistence diagram for \({{\mathrm{PH}}}_k\). Clearly, \(\xi _k \subset \Delta \), where \(\Delta := \{ (x,y) : 0 \le x< y < \infty \}\), since death always occurs after birth (see Fig. 6).
In Duy et al. (2016), the Čech and the Rips complex were considered, taken over stationary point processes \(\Phi \) (as discussed in Sect. 7). In this case, taking \(\xi _{k,n}\) to be the kth persistence diagram of \(\Phi _n\), then \(\xi _{k,n}\) is a random point process, or random Radon measure, in \(\mathbb {R}^2\). One of the main theorems in Duy et al. (2016) states that as \(n\rightarrow \infty \) this measure has a nonrandom limit \(\nu _k\). In particular,
Theorem 9.1
9.2 Maximal cycles in persistent homology

This persistence measure is scale invariant, i.e. the persistence of cycles for n points chosen uniformly in a cube \([0,1]^d\) will have the same distribution as for n points chosen uniformly in a cube \([0, \lambda ]^d\) for any \(\lambda > 0\).

In a random geometric setting, one issue with measuring persistence by \(\gamma _{death}\gamma _{birth}\) is that both terms are tending to zero as the number of vertices goes to infinity, and \(\gamma _{birth} \ll \gamma _{death}\). For the prominent cycles, \(\gamma _{birth}\rightarrow 0\) much faster than \(\gamma _{death}\), and therefore if we measure persistence as \(\gamma _{death}  \gamma _{birth}\), then \(\gamma _{birth}\) will just be a small error term and it will be hard to differentiate between the birth and death times. The multiplicative way of measuring persistence is more informative.
 Both Čech complexes \(\mathcal {C}_r(n)\) and Vietoris–Rips complexes \(\mathcal {R}_r(n)\) are central to the theory of persistent homology, and it is important to be able to compare them. The standard way of relating them is via the inclusion maps(In general \(\mathcal {C}_r(n) \hookrightarrow \mathcal {R}_r(n) \hookrightarrow \mathcal {C}_{\alpha r}(n)\) for Čech and Vietoris–Rips complexes in Euclidean space \(\mathbb {R}^d\), as long as \(\alpha \ge \sqrt{ 2d / (d+1)}\), as shown in Theorem 2.5 of de Silva and Ghrist (2007). So one may relate persistent homology between the two types of complexes. Because this relationship is naturally multiplicative in r, our results are stated in a way that holds for both types of complexes.$$\begin{aligned} \cdots \hookrightarrow \mathcal {C}_r(n) \hookrightarrow \mathcal {R}_r(n) \hookrightarrow \mathcal {C}_{\sqrt{2}r}(n) \hookrightarrow \mathcal {R}_{\sqrt{2}r}(n) \hookrightarrow \cdots \end{aligned}$$
Theorem 9.2
Persistent homology is becoming a very popular and powerful data analysis tool. Studying this type of extremal behavior for persistent homology can be later used to provide a statistical analysis to persistent homology. For example, suppose that the data are sampled from a distribution supported on a manifold M with non trivial homology that we wish to recover. Knowing the distribution of \(\Pi _k\) for convex bodies (where homology is trivial), would enable us to develop statistical tests to differentiate between the signal (real cycles of M) and noise (artifacts of the sampling mechanism) in this type of data analysis problem. Persistent homology in random contexts was studied earlier by Bubenik and Kim in (2007).
10 Open problems/future directions
 Sharper results in the thermodynamic limit. Proving strong results for expectation of Betti numbers in the critical regime remains a challenging problem. The best result so far is thatwhere \(C > 0\) is some constant which depends on the underlying distribution on \(\mathbb {R}^d\) and the degree k, see Yogeshwaran et al. (2017). It would be a breakthrough to write an explicit formula for C and we expect that the results would find applications in TDA.$$\begin{aligned} \frac{\mathbb {E}[\beta _k(n)] }{n} \rightarrow C, \end{aligned}$$

Connections between the various models. Is there a model for random geometric complex which approximates the sublevel sets of the Gaussian random field? See Adler and Taylor (2007) and Adler and Taylor (2011) for introduction and overview of Gaussian random fields and their topological properties.

Torsion. All of the results in this survey for homology of random geometric complexes do not depend on the choice of coefficients. In dimensions \(d \ge 4\) and higher, these complexes will likely have torsion in integer homology. What can be said about the limiting distribution of this torsion group?

Higherdimensional percolation theory. All of the random geometric complexes discussed here are analogues of random geometric graphs where the number of vertices n is finite and \(n \rightarrow \infty \). Percolation theory is of a somewhat different flavor—one considers an infinite random graph, by taking a random subgraph of a lattice, and then analyzes largescale structure such as whether or not an infinite connected component appears. Analogous lattice models with higherdimensional cells have been studied, for example “plaquette percolation” (Aizenman et al. 1983; Grimmett and Holroyd 2010). So rather than study homologyvanishing thresholds for finite random geometric complexes with size tending to infinity, one might study the appearance of “infinite” cycles in lattice models. So far, this seems to be relatively unexplored.
Notes
Compliance with ethical standards
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
References
 Adler, R.J., Bobrowski, O., Weinberger, S.: Crackle: the homology of noise. Discret. Comput. Geom. 52(4), 680–704 (2014)MathSciNetCrossRefMATHGoogle Scholar
 Adler, R.J., Taylor, J.E.: Random fields and geometry, Springer Monographs in Mathematics. Springer, New York (2007)Google Scholar
 Adler, R.J., Taylor, J.E.: Topological complexity of smooth random functions. In: Lecture Notes in Mathematics, vol. 2019. Springer, Heidelberg (2011)Google Scholar
 Aizenman, M., Chayes, J.T., Chayes, L., Fröhlich, J., Russo L.: On a sharp transition from area law to perimeter law in a system of random surfaces. Comm. Math. Phys. 92(1), 19–69 (1983)Google Scholar
 Alon, N., Spencer J.H.: The probabilistic method. WileyInterscience Series in Discrete Mathematics and Optimization, 3rd ed. Wiley, Hoboken (2008) (with an appendix on the life and work of Paul Erdős).Google Scholar
 Babson, E., Hoffman, C., Kahle, M.: The fundamental group of random 2complexes. J. Amer. Math. Soc. 24(1), 1–28 (2011)Google Scholar
 Balakrishnan, S., Rinaldo, A., Sheehy, D., Singh, A., Wasserman, L.A.: Minimax rates for homology inference. AISTATS 9, 206–207 (2012)Google Scholar
 Baryshnikov, Yuliy, Bubenik, Peter, Kahle, Matthew: Mintype Morse theory for configuration spaces of hard spheres. Int. Math. Res. Not. IMRN 9, 2577–2592 (2014). (MR 3207377)MathSciNetCrossRefMATHGoogle Scholar
 Bobrowski, Omer, Adler, R.J.: Distance functions, critical points, and the topology of random čech complexes. Homol. Homotopy Appl. 16(2), 311–344 (2014). (en)CrossRefMATHGoogle Scholar
 Bobrowski, O., Kahle, M., Skraba, P.: Maximally persistent cycles in random geometric complexes. Ann. Appl. Prob. 27(4), 2032–2060 (2017)MathSciNetCrossRefMATHGoogle Scholar
 Bobrowski, O., Mukherjee, S.: The topology of probability distributions on manifolds. Probab. Theory Relat Fields 161(3–4), 651–686 (2014)MathSciNetMATHGoogle Scholar
 Bobrowski, O., Mukherjee, S., Taylor, J.E.: Topological consistency via kernel estimation. Bernoulli 23(1), 288–328 (2017)MathSciNetCrossRefMATHGoogle Scholar
 Bobrowski, O., Oliveira, G.: Random čech complexes on riemannian manifolds (2017) arXiv:1704.07204 (arXiv preprint)
 Bobrowski, O., Weinberger, S.: On the vanishing of homology in random Čech complexes. Random Struct. Algorithm 51(1), 14–51 (2017)CrossRefMATHGoogle Scholar
 Bollobás, B.: Random graphs, 2nd edn. Cambridge studies in advanced mathematics, vol. 73. Cambridge University Press, Cambridge (2001)Google Scholar
 Bollobás, B., Riordan, O.: Percolation. Cambridge University Press, New York (2006)Google Scholar
 Borsuk, K.: On the imbedding of systems of compacta in simplicial complexes. Fund. Math. 35, 217–234 (1948)Google Scholar
 Bryzgalova, L.N.: The maximum functions of a family of functions that depend on parameters. Funktsional. Anal. i Prilozhen. 12(1), 66–67 (1978)Google Scholar
 Bubenik, P., Kim, P.T.: A statistical approach to persistent homology. Homol. Homotopy Appl. 9(2), 337–362 (2007)Google Scholar
 Carlsson, G.: Topology and data. Bull. Amer. Math. Soc. (N.S.) 46(2), 255–308 (2009)Google Scholar
 De Silva, V., Ghrist, R.: Coverage in sensor networks via persistent homology. Algebr. Geom. Topol. 7, 339–358 (2007)Google Scholar
 Decreusefond, L., Ferraz, E., Randriambololona, H., Vergneet al.: Simplicial homology of random configurations. Adv. Appl. Probab. 46(2), 325–347 (2014)Google Scholar
 Duy, T.K., Hiraoka, Y., Shirai, T.: Limit theorems for persistence diagrams (2016) arXiv:1612.08371 [math]
 Fasy, B.T., Lecci, F., Rinaldo, A., Wasserman, L., Balakrishnan, S., Singh, A. et al.: Confidence sets for persistence diagrams. Ann. Stat. 42(6), 2301–2339 (2014)Google Scholar
 Ganesan, G.: Size of the giant component in a random geometric graph. Ann. Inst. Henri Poincaré Probab. Stat. 49(4), 1130–1140 (2013)Google Scholar
 Gershkovich, V., Rubinstein, H.: Morse theory for Mintype functions. Asian J. Math. 1(4), 696–715 (1997)Google Scholar
 Ghrist, R.: Barcodes: the persistent topology of data. Bull. Amer. Math. Soc. (N.S.) 45(1), 61–75 (2008)Google Scholar
 Gilbert, E.N.: Random plane networks. J. Soc. Indust. Appl. Math. 9, 533–543 (1961)Google Scholar
 Grimmett, G.: Percolation, 2nd ed., Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 321. Springer, Berlin (1999)Google Scholar
 Grimmett, G.R., Holroyd, A.E.: Plaquettes, spheres, and entanglement. Electron. J. Probab. 15, 1415–1428 (2010)Google Scholar
 Hatcher, A.: Algebraic topology. Cambridge University Press, Cambridge (2002)Google Scholar
 Herbert, E., Kirkpatrick, D.G., Seidel, R.: On the shape of a set of points in the plane. IEEE Trans. Inform. Theory 29(4), 551–559 (1983)Google Scholar
 Janson, S., Łuczak, T., Rucinski, A.: Random graphs. WileyInterscience Series in Discrete Mathematics and Optimization. Wiley, New York (2000)Google Scholar
 Kahle, M.: Random geometric complexes. Discrete Comput. Geom. 45(3), 553–573 (2011)Google Scholar
 Kahle, M.: Sharp vanishing thresholds for cohomology of random flag complexes. Ann. Math. (2) 179(3), 1085–1107 (2014)Google Scholar
 Kahle, M.: Topology of random simplicial complexes: a survey. AMS Contemp. Math 620, 201–222 (2014)MathSciNetCrossRefMATHGoogle Scholar
 Kahle, M., Meckes, E.: Limit theorems for Betti numbers of random simplicial complexes. Homol. Homotopy Appl. 15(1), 343–374 (2013)Google Scholar
 Kahle, M., Meckes, E.: Erratum: Limit theorems for betti numbers of random simplicial complexes (2015). arXiv:1501.03759 (arXiv preprint)
 Linial, N., Meshulam, R.: Homological connectivity of random 2complexes. Combinatorica 26(4), 475–487 (2006)Google Scholar
 Matov, V.I.: Topological classification of the germs of functions of the maximum and minimax of families of functions in general position. Uspekhi Mat. Nauk 37(4)(226), 167–168 (1982)Google Scholar
 Meester, R., Roy R.: Continuum percolation, Cambridge Tracts in Mathematics, vol. 119. Cambridge University Press, Cambridge (1996)Google Scholar
 Meshulam, R., Wallach, N.: Homological connectivity of random \(k\)dimensional complexes. Random Struct. Algorithms 34(3), 408–417 (2009)Google Scholar
 Milnor, J.: Morse theory. In: Based on lecture notes by M. Spivak and R. Wells. Annals of Mathematics Studies, no. 51. Princeton University Press, Princeton (1963)Google Scholar
 Niyogi, P., Smale, S., Weinberger, S.: Finding the homology of submanifolds with high confidence from random samples. Discrete Comput. Geom. 39(1–3), 419–441 (2008)Google Scholar
 Munkres, J.R.: Elements of algebraic topology. AddisonWesley Publishing Company, Menlo Park (1984)Google Scholar
 Niyogi, P., Smale, S., Weinberger, S.: A topological view of unsupervised learning from noisy data. SIAM J. Comput. 40(3), 646–663 (2011)MathSciNetCrossRefMATHGoogle Scholar
 Owada, T., Adler, R.J.: Limit theorems for point processes under geometric constraints (and topological crackle). Ann. Prob. 45(3), 2004–2055 (2017)MathSciNetCrossRefMATHGoogle Scholar
 Penrose, M.: Random geometric graphs, Oxford Studies in Probability, vol. 5. Oxford University Press, Oxford (2003)Google Scholar
 Robins, V.: Betti number signatures of homogeneous poisson point processes. Phys. Rev. E 74(6), 061107 (2006)Google Scholar
 The GUDHI Project: GUDHI user and reference manual. GUDHI Editorial Board (2015)Google Scholar
 Yogeshwaran, D., Adler, R.J., et al.: On the topology of random complexes built over stationary point processes. Ann. Appl. Prob. 25(6), 3338–3380 (2015)MathSciNetCrossRefMATHGoogle Scholar
 Yogeshwaran, D., Subag, E., Adler, R.J.: Random geometric complexes in the thermodynamic regime. Prob. Theory Relat. Fields 167(1–2), 107–142 (2017)Google Scholar