Abstract
Finding sparse cuts is an important tool for analyzing large graphs that arise in practice, such as the web graph, online social communities, and VLSI circuits. When dealing with such graphs having billions of nodes, it is often hard to visualize global partitions. While studies on sparse cuts have traditionally looked at cuts with respect to all the nodes in the graph, some recent works analyze graph properties projected onto a small subset of vertices that may be of interest in a given context, e.g., relevant documents to a query in a search engine. In this paper, we study how sparse cuts in a graph partition a certain subset of nodes. We call this partition a cut projection. We study the problem of finding cut projections in the streaming model that is appropriate in this context as the input graph is too large to store in main memory. Specifically, for a d-regular graph G on n nodes with a cut of conductance Φ and constant balance, we show how to partition a randomly chosen set of k nodes in \(\tilde{O}(\frac{1}{\sqrt{\alpha\Phi}})\) passes over the graph stream and space \(\tilde{O}(n\alpha + \frac{n^{3/4}k^{1/4}}{\sqrt{\alpha}\Phi^{19/4}})\), for any choice of α ≤ 1. The resulting partition is the projection of a cut of conductance of at most \(\tilde{O}(\sqrt{\Phi})\). We note that for k < nα 6ΦO(1), this can be done in \(\tilde{O}(1/\sqrt{\alpha\Phi})\) passes and space \(\tilde{O}(n\alpha)\) that is sublinear in the number of nodes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ahn, K.J., Guha, S.: Graph sparsification in the semi-streaming model. In: Albers, S., Marchetti-Spaccamela, A., Matias, Y., Nikoletseas, S., Thomas, W. (eds.) ICALP 2009. Part II. LNCS, vol. 5556, pp. 328–338. Springer, Heidelberg (2009)
Alon, N.: Eigenvalues and expanders. Combinatorica 6(2), 83–96 (1986)
Andersen, R., Chung, F.R.K., Lang, K.J.: Local Graph Partitioning using PageRank Vectors. In: FOCS, pp. 475–486 (2006)
Arora, S., Rao, S., Vazirani, U.V.: Expander flows, geometric embeddings and graph partitioning. In: STOC, pp. 222–231 (2004)
Benczúr, A.A., Karger, D.R.: Approximating s-t Minimum Cuts in Õ(n \(^{\mbox{2}}\)) Time. In: STOC, pp. 47–55 (1996)
Bhatt, S.N., Leighton, F.T.: A Framework for Solving VLSI Graph Layout Problems. J. Comput. Syst. Sci. 28(2), 300–343 (1984)
Boppana, R.: Eigenvalues and graph bisection: an average case analysis. In: 28th IEEE Symposium on Foundations of Computer Science, FOCS (1987)
Borgs, C., Chayes, J.T., Mahdian, M., Saberi, A.: Exploring the community structure of newsgroups. In: KDD, pp. 783–787 (2004)
Sarma, A.D., Gollapudi, S., Panigrahy, R.: Estimating PageRank on graph streams. In: PODS, pp. 69–78 (2008)
Demetrescu, C., Finocchi, I., Ribichini, A.: Trading of space for passes in graph streaming problems. In: ACM-SIAM Symposium on Discrete Algorithms, SODA, pp. 714–723 (2006)
Goldreich, O., Ron, D.: Property Testing in Bounded Degree Graphs. In: STOC, pp. 406–415 (1997)
Henzinger, M., Raghavan, P., Rajagopalan, S.: Computing on data streams. In: External Memory Algorithms. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 50, pp. 107–118 (1999)
Karger, D.R.: Minimum cuts in near-linear time. J. ACM 47(1), 46–76 (2000)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: SODA, pp. 668–677 (1998)
Lempel, R., Moran, S.: Salsa: the stochastic approach for link-structure analysis. ACM Trans. Inf. Syst. 19(2), 131–160 (2001)
Leskovec, J., Dumais, S., Horvitz, E.: Web projections: learning from contextual subgraphs of the web. In: WWW 2007: Proceedings of the 16th international conference on World Wide Web, pp. 471–480. ACM, New York (2007)
Leskovec, J., Faloutsos, C.: Sampling from large graphs. In: KDD, pp. 631–636 (2006)
Lovász, L., Simonovits, M.: The Mixing Rate of Markov Chains, an Isoperimetric Inequality, and Computing the Volume. In: FOCS, pp. 346–354 (1990)
Lovász, L., Simonovits, M.: Random Walks in a Convex Body and an Improved Volume Algorithm. Random Struct. Algorithms 4(4), 359–412 (1993)
Manokaran, R., Naor, J., Raghavendra, P., Schwartz, R.: SDP gaps and UGC hardness for multiway cut, 0-extension, and metric labeling. In: STOC, pp. 11–20 (2008)
Sinclair, A., Jerrum, M.: Conductance and the mixing property of markov chains; the approximation of the permanent resolved. In: Proc. of the 20th annual ACM Symposium on Theory of computing, pp. 235–244 (1988)
Spielman, D.A., Teng, S.-H.: Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In: STOC, pp. 81–90 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das Sarma, A., Gollapudi, S., Panigrahy, R. (2009). Sparse Cut Projections in Graph Streams. In: Fiat, A., Sanders, P. (eds) Algorithms - ESA 2009. ESA 2009. Lecture Notes in Computer Science, vol 5757. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04128-0_43
Download citation
DOI: https://doi.org/10.1007/978-3-642-04128-0_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04127-3
Online ISBN: 978-3-642-04128-0
eBook Packages: Computer ScienceComputer Science (R0)