Abstract
Recent research on distributed storage systems (DSSs) has revealed interesting connections between matroid theory and locally repairable codes (LRCs). The goal of this chapter is to introduce the reader to matroids and polymatroids, and illustrate their relation to distributed storage systems. While many of the results are rather technical in nature, effort is made to increase accessibility via simple examples. The chapter embeds all the essential features of LRCs, namely locality, availability, and hierarchy alongside with related generalised Singleton bounds.
The authors gratefully acknowledge the financial support from the Academy of Finland (grants #276031 and #303819), as well as the support from the COST Action IC1104.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
To this end, we need to make specific assumptions on the locality and availability of the LRC [9, Theorem 21], which also implies restrictions on the query structure of the batch code.
- 2.
We remark that if one takes into account queueing theoretic aspects, then data allocation may become less trivial (some nodes may be empty). Such aspects are discussed, especially in a wireless setting, in chapters “Opportunistic Network Coding” and “Coded Random Access”. However, these considerations are out of the scope of this chapter.
- 3.
See appendix for the definition of graphical matroids.
References
S. Ball, On sets of vectors of a finite vector space in which every subset of basis size is a basis. J. Eur. Math. Soc. 14, 733–748 (2012)
A. Barg, I. Tamo, S. Vlăduţ, Locally recoverable codes on algebraic curves (2016), arXiv:1603.08876
A. Barg, K. Haymaker, E. Howe, G. Matthews, A. Várilly-Alvarado, Locally recoverable codes from algebraic curves and surfaces (2017), arXiv:1701.05212
J.E. Bonin, A. de Mier, The lattice of cyclic flats of a matroid. Ann. Comb. 12, 155–170 (2008)
T. Britz, C.G. Rutherford, Covering radii are not matroid invariants. Discret. Math. 296, 117–120 (2005)
V. Cadambe, A. Mazumdar, An upper bound on the size of locally recoverable codes, in International Symposium on Network Coding (2013), pp. 1–5
A. Dimakis, P.B. Godfrey, Y. Wu, M.J. Wainwright, K. Ramchandran, Network coding for distributed storage systems. IEEE Trans. Inf. Theory 56(9), 4539–4551 (2010)
T. Ernvall, T. Westerbäck, R. Freij-Hollanti, C. Hollanti, Constructions and properties of linear locally repairable codes. IEEE Trans. Inf. Theory 62, 5296–5315 (2016)
A. Fazeli, A. Vardy, E. Yaakobi, PIR with low storage overhead: coding instead of replication (2015), arXiv:1505.06241
R. Freij-Hollanti, T. Westerbäck, C. Hollanti, Locally repairable codes with availability and hierarchy: matroid theory via examples, in International Zürich Seminar on Communications (IEEE/ETH, 2016), pp. 45–49
S. Fujishige, Polymatroidal dependence structure of a set of random variables. Inf. Control 39(1), 55–72 (1978)
J. Geelen, B. Gerards, G. Whittle, Solving Rota’s conjecture. Not. Am. Math. Soc. 61, 736–743 (2014)
P. Gopalan, C. Huang, H. Simitci, S. Yekhanin, On the locality of codeword symbols. IEEE Trans. Inf. Theory 58(11), 6925–6934 (2012)
C. Huang, M. Chen, J. Lin, Pyramid codes: flexible schemes to trade space for access efficiency in reliable data storage systems, in International Symposium on Network Computation and Applications (IEEE, 2007), pp. 79–86
A. Ingleton, R. Main, Non-algebraic matroids exist. Bull. Lond. Math. Soc. 7, 144–146 (1975)
Y. Ishai, E. Kushilevitz, R. Ostrovsky, A. Sahai, Batch codes and their applications, in The 36th ACM Symposium on Theory of Computing (STOC) (2004)
R. Jurrius, R. Pellikaan, Truncation formulas for invariant polynomials of matroids and geometric lattices. Math. Comput. Sci. 6, 121–133 (2012)
G.M. Kamath, N. Prakash, V. Lalitha, P.V. Kumar, Codes with local regeneration and erasure correction. IEEE Trans. Inf. Theory 60(8), 4637–4660 (2014)
D. Knuth, The asymmetric number of geometries. J. Comb. Theory Ser. A 16, 398–400 (1974)
B. Lindström, On the vector representations of induced matroids. Bull. Lond. Math. Soc. 5, 85–90 (1973)
B. Lindström, On \(p\)-polynomial representations of projective geometries in algebraic combinatorial geometries. Math. Scand. 63, 36–42 (1988)
B. Lindström, On algebraic matroids. Discret. Math. 111, 357–359 (1993)
H. Lipmaa, V. Skachek, Linear batch codes, in The 4th International Castle Meeting on Coding Theory and Applications (4ICMCTA) (2015)
D. Mayhew, M. Newman, D. Welsh, G. Whittle, On the asymptotic proportion of connected matroids. Eur. J. Comb. 32(6), 882–890 (2011)
D. Mayhew, M. Newman, G. Whittle, Yes, the missing axiom of matroid theory is lost forever (2015), arXiv:1412.8399
P. Nelson, Almost all matroids are non-representable, arXiv:1605.04288
D. Papailiopoulos, A. Dimakis, Locally repairable codes, in International Symposium on Information Theory (IEEE, 2012), pp. 2771–2775
R. Pendavingh, J. van der Pol, On the number of matroids compared to the number of sparse paving matroids. Electron. J. Comb. 22 (2015), 17pp
A. Pöllänen, T. Westerbäck, R. Freij-Hollanti, C. Hollanti, Improved singleton-type bounds for locally repairable codes, in International Symposium on Information Theory (IEEE, 2016), pp. 1586–1590
N. Prakash, G.M. Kamath, V. Lalitha, P.V. Kumar, Optimal linear codes with a local-error-correction property, in International Symposium on Information Theory (IEEE, 2012), pp. 2776–2780
A.S. Rawat, D. Papailiopoulos, A. Dimakis, S. Vishwanath, Locality and availability in distributed storage (2014), arXiv:1402.2011v1
B. Sasidharan, G.K. Agarwal, P.V. Kumar, Codes with hierarchical locality (2015), arXiv:1501.06683v1
N. Silberstein, A. Zeh, Optimal binary locally repairable codes via anticodes (2015), arXiv:1501.07114v1
N. Silberstein, A.S. Rawat, O. Koyluoglu, S. Vishwanath, Optimal locally repairable codes via rank-metric codes, in International Symposium on Information Theory (IEEE, 2013), pp. 1819–1823
J.A. Sims, Some problems in matroid theory. Ph.D. thesis, Oxford University (1980)
R.C. Singleton, Maximum distance q-nary codes. IEEE Trans. Inf. Theory 10(2), 116–118 (1964)
A. Skorobogatov, Linear codes, strata of Grassmannians, and the problems of Segre, in International Workshop on Coding Theory and Algebraic Geometry (1992), pp. 210–223
W. Song, C. Yuen, S.H. Dau, T.J. Li, Optimal locally repairable linear codes. IEEE J. Sel. Areas Commun. 32(5), 1019–1036 (2014)
I. Tamo, A. Barg, A family of optimal locally recoverable codes. IEEE Trans. Inf. Theory 60(8), 4661–4676 (2014)
I. Tamo, D. Papailiopoulos, A. Dimakis, Optimal locally repairable codes and connections to matroid theory, in International Symposium on Information Theory (IEEE, 2013), pp. 1814–1818
I. Tamo, A. Barg, A. Frolov, Bounds on the parameters of locally recoverable codes. IEEE Trans. Inf. Theory 62(6), 3070–3083 (2016)
W. Tutte, A homotopy theorem for matroids, I, II. Trans. Am. Math. Soc. 88, 148–178 (1958)
P. Vámos, The missing axiom of matroid theory is lost forever. J. Lond. Math. Soc. 18, 403–408 (1978)
A. Wang, Z. Zhang, Repair locality with multiple erasure tolerance. IEEE Trans. Inf. Theory 60(11), 6979–6987 (2014)
A. Wang, Z. Zhang, An integer programming-based bound for locally repairable codes. IEEE Trans. Inf. Theory 61(10), 5280–5294 (2015)
T. Westerbäck, R. Freij, C. Hollanti, Applications of polymatroid theory to distributed storage systems, in Allerton Conference on Communication, Control, and Computing (2015), pp. 231–237
T. Westerbäck, R. Freij-Hollanti, T. Ernvall, C. Hollanti, On the combinatorics of locally repairable codes via matroid theory. IEEE Trans. Inf. Theory 62, 5296–5315 (2016)
H. Whitney, On the abstract properties of linear dependence. Am. J. Math. 57, 509–533 (1935)
H. Zhang, V. Skachek, Bounds for batch codes with restricted query size, in IEEE International Symposium on Information Theory (ISIT) (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: More About Matroid Theory
Appendix: More About Matroid Theory
A matroid realisation of an \(\mathbb {F}\)-linear matroid M has two geometric interpretations. Firstly, we may think of a matrix representing M as a collection of n column vectors in \(\mathbb {F}^k\). As the matroid structure is invariant under row operations, or in other words under change of basis in \(\mathbb {F}^k\), we tend to think of M as a configuration of n points in abstract projective k-space.
The second interpretation comes from studying the row space of the matrix, as an embedding of \(\mathbb {F}^k\) into \(\mathbb {F}^n\). Row operations correspond to a change of basis in \(\mathbb {F}^k\), and hence every matroid representation can be thought of as a k-dimensional subspace of \(\mathbb {F}^n\). In other words, a matroid representation is a point in the Grassmannian \(\mathrm {Gr}(n,k; \mathbb {F})\), and \(\mathrm {Gr}(n,k; \mathbb {F})\) has a stratification as a union of realisation spaces R(M), where M ranges over all \(\mathbb {F}\)-representable matroids of size n and rank k. This perspective allows a matroidal perspective also on the subspace codes discussed in chapters “Codes Endowed with the Rank Metric”–“Generalizing Subspace Codes to Flag Codes Using Group Actions”, where the codewords themselves are matroid representations. However, so far this perspective has not brought any new insights to the topic.
Another instance where matroids appear naturally in mathematics is graph theory. Let \(\varGamma \) be a finite graph with edge set E. We obtain a matroid \(M_\varGamma =(\mathscr {I}, E)\), where \(I\subseteq E\) is independent if the subgraph \(\varGamma _I\subseteq \varGamma \) induced on \(I\subseteq E\) is a forest, i.e., has no cycles. A matroid that is isomorphic to \(M_\varGamma \) for some graph \(\varGamma \) is said to be a graphical matroid.
Example 15
The matrix G and the graph \(\varGamma \) given below generate the same matroid, regardless of the field over which G is defined.
Some examples of independent sets in G and \(\varGamma \) are \(\{3,4,6\}, \{1,2,3,5\}, \{2,3,4,6\}\). The set \(X = \{5,6,7\}\) is dependent in \(M_\varGamma \) as these edges form a cycle, and it is dependent in \(M_G\) as the submatrix
has linearly dependent columns.
Indeed, graphical matroids are representable over any field \(\mathbb {F}\). To see this, for a graph \(\varGamma \) with edge set E, we will construct a matrix \(G(\varGamma )\) over \(\mathbb {F}\) with column set E as follows. Choose an arbitrary spanning forest \(T\subseteq E\) in \(\varGamma \), and index the rows of \(G(\varGamma )\) by T. Thus \(G(\varGamma )\) is a \(T\times E\)-matrix. Choose an arbitrary orientation for each edge in the graph. For \(e\in T\subseteq E\) and \(uv\in E\), the entry in position \((e,\{uv\})\) is 1 (respectively \(-1\)) if e is traversed forward (respectively backward) in the unique path from u to v in the spanning forest T. In particular, the submatrix \(G(\varGamma )(T)\) is an identity matrix. It is straightforward to check that the independent sets in \(G(\varGamma )\) are exactly the noncyclic sets in \(\varGamma \).
Example 16
The matrix G in Example 15 is \(G(\varGamma )\) where \(\varGamma \) is the graph in the same example, and the spanning forest T is chosen to be \(\{1,2,3,4\}\).
The restriction to \(X\subseteq E\) of a graphical matroid \(M_\varGamma \) is obtained by the subgraph of \(\varGamma \) containing precisely the edges in X.
A third example of matroids occurring naturally in mathematics are algebraic matroids [22]. These are associated to field extensions \(\mathbb {F}:K\) together with a finite point sets \(E\subseteq K\), where the independent sets are those \(I\subseteq E\) that are algebraically independent over \(\mathbb {F}\). In particular, elements that are algebraic over \(\mathbb {F}\) have rank zero, and in general \(\rho (I)\) is the transcendence degree of the field extension \(\mathbb {F}(I):\mathbb {F}\).
It is rather easy to see that every \(\mathbb {F}\)-linear matroid is also algebraic over \(\mathbb {F}\). Indeed, let \(X_1,\ldots , X_k\) be indeterminates, and let
be given by \(\mathrm {e}_i\mapsto X_i\) for \(i=1,\ldots ,k\). Then \(J\subseteq E\) is linearly independent over \(\mathbb {F}\) if and only if \(\{g(j):j\in J\}\) is algebraically independent over \(\mathbb {F}\). Over fields of characteristic zero the converse also holds, so that all algebraic matroids have a linear representation. However, in positive characteristic there exist algebraic matroids that are not linearly representable. For example, the non-Pappus matroid of Example 4 is algebraically representable over \(\mathbb {F}_4\), although it is not linearly representable over any field [21]. The smallest example of a matroid that is not algebraic over any field is the Vamos matroid, in Fig. 2 [15].
Definition 21
The dual of \(M = (\rho ,E)\) is \(M^*=(\rho ^*,E)\), where
The definition of the dual matroid lies in the heart of matroid theory, and has profound interpretations. In geometric terms, let M be represented by a k-dimensional subspace V of \(\mathbb {F}^n\). Then, the matroid dual \(M^*\) is represented by the orthogonal complement \(V^\perp \subseteq \mathbb {F}^n\). Surprisingly and seemingly unrelatedly, if \(\varGamma \) is a planar graph and \(M=M_\varGamma \) is a graphical matroid, then \(M^*=M_{\bar{\varGamma }}\), where \(\bar{\varGamma }\) is the planar dual of \(\varGamma \). Moreover, the dual \(M_\varGamma ^*\) of a graphical matroid is graphical if and only if \(\varGamma \) is planar.
Definition 22
The contraction of \(X\subseteq E\) in the matroid \(M = (\rho ,E)\) is \(M/X=(\rho ',M\setminus X)\), where \(\rho '(Y)=\rho (Y\cup X)-\rho (X)\).
Contraction is the dual operation of deletion, in the sense that \(M/X=(M^*_{|E\setminus X})^*\). The terminology comes from graphical matroids, where contraction of the edge \(e\in E\) corresponds to deleting e and identifying its endpoints in the graph. Notice that it follows directly from submodularity of the rank function that \(\rho _{M/X}(Y)\le \rho _{M|_{E\setminus X}}(Y)\) for every \(Y\subseteq E\setminus X\). In terms of subspace representations, contraction of \(e\in E\) corresponds to intersecting the subspace that represents M with the hyperplane \(\{x_e=0\}\).
As matroids are used as an abstraction for linear codes, it would be desirable to have a way to go back from matroids to codes, namely to determine whether a given matroid is representable, and when it is, to find such a representation. Unfortunately, there is no simple criterion to determine representability [25, 43]. However, there are a plethora of sufficient criteria to prove nonrepresentability, both over a given field and over fields in general. In recent years, these methods have been used to prove two long-standing conjectures, that we will discuss in sections Rota’s Conjecture and Most Matroids are Nonrepresentable respectively.
1.1 Rota’s Conjecture
While there is no simple criterion to determine linear representability, the situation is much more promising if we consider representations over a fixed field. It has been known since 1958, that there is a simple criterion for when a matroid is binary representable.
Theorem 11
([42]) Let \(M = (\rho ,E)\) be a matroid. The following two conditions are equivalent.
-
1.
M is linearly representable over \(\mathbb {F}_2\).
-
2.
There are no sets \(X\subseteq Y\subseteq E\) such that M|Y / X is isomorphic to the uniform matroid \(U_4^2\).
In essence, this means that the only obstruction that needs to be overcome in order to be representable over the binary alphabet, is that no more than three nonzero points can fit in the same plane. For further reference, we say that a minor of the matroid \(M = (\rho ,E)\) is a matroid of the form M|Y / X, for \(X\subseteq Y\subseteq E\). Clearly, if M is representable over \(\mathbb {F}\), then so is all its minors. Let \(L(\mathbb {F})\) be the class of matroids that are not representable over \(\mathbb {F}\), but such that all of their minors are \(\mathbb {F}\)-representable. Then the class of \(\mathbb {F}\)-representable matroids can be written as the class of matroids that does not contain any matroid from \(L(\mathbb {F})\) as a minor. Gian-Carlo Rota conjectured in 1970 that \(L(\mathbb {F})\) is a finite set for all finite fields \(\mathbb {F}\). A proof of this conjecture was announced by Geelen, Gerards and Whittle in 2014, but the details of the proof still remain to written up [12].
Theorem 12
For any finite field \(\mathbb {F}\), there is a finite set \(L(\mathbb {F})\) of matroids such that any matroid M is representable if and only if it contains no element from \(L(\mathbb {F})\) as a minor.
Since the 1970s, it has been known that a matroid is representable over \(\mathbb {F}_3\) if and only if it avoids the uniform matroids \(U_5^2\), \(U_5^3\), the Fano plane \(P^2(\mathbb {F}_2)\), and its dual \(P^2(\mathbb {F}_2)^*\) as minors. The list \(L(\mathbb {F}_4)\) has seven elements, and was given explicitly in 2000. For larger fields, the explicit list is not known, and there is little hope to even find useful bounds on its size.
1.2 Most Matroids are Nonrepresentable
For a fixed finite field \(\mathbb {F}\), it follows rather immediately from the minor-avoiding description in the last section that the fraction of n-symbol matroids that is \(\mathbb {F}\)-representable goes to zero as \(n\rightarrow \infty \). It has long been a folklore conjecture that this is true even when representations over arbitrary fields are allowed. However, it was only in 2016 that a verifiable proof of this claim was announced [26].
Theorem 13
The proof is via estimates of the denominator and enumerator of the expression in Theorem 13 separately. Indeed, it is shown in [19] that the number of matroids on n nodes is at least \(\varOmega (2^{(2-\varepsilon )^n})\) for every \(\varepsilon >0\). The proof of Theorem 13 thus boiled down to proving that the number of representable matroids is \(O(2^{n^3})\). This is in turn achieved by bounding the number of so called zero-patterns of polynomials.
1.3 Gammoid Construction of Singleton-Optimal LRCs
For completeness, we end this appendix with a theorem that explicitly presents the matroids constructed in Theorem 7 as gammoids. As discussed in Sect. 4.2, this proves the existence of Singleton-optimal linear LRCs whenever a set system satisfying (8) exists.
Theorem 14
([47], \(M(F_1,\ldots ,F_m,E;k;\rho )\)-matroids are gammoids) Let \(M(F_1,\ldots ,F_m;\rho )\) be a matroid given by Theorem 7 and define \(s: E \rightarrow 2^{[m]}\) where \(s(x) = \{i \in [m] : x \in F_i\}\). Then \(M(F_1,\ldots ,F_m,E;k;\rho )\) is equal to the gammoid \(M(\varGamma ,E,T)\), where \(\varGamma = (V,D)\) is the directed graph with
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Freij-Hollanti, R., Hollanti, C., Westerbäck, T. (2018). Matroid Theory and Storage Codes: Bounds and Constructions. In: Greferath, M., Pavčević, M., Silberstein, N., Vázquez-Castro, M. (eds) Network Coding and Subspace Designs. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-70293-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-70293-3_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70292-6
Online ISBN: 978-3-319-70293-3
eBook Packages: EngineeringEngineering (R0)