Abstract
In this paper, we obtain improved algorithms for two graph-theoretic problems in the popular MapReduce framework. The first problem we consider is the densest subgraph problem. We present a primal-dual algorithm that provides a \((1+\epsilon )\) approximation and takes \(O({\log n\over \epsilon ^2})\) MapReduce iterations, each iteration having a shuffle size of \(O(m)\) and a reducer size of \(O(d_{max})\). Here \(m\) is the number of edges, \(n\) is the number of vertices, and \(d_{max}\) is the maximum degree of a node. This dominates the previous best MapReduce algorithm, which provided a \((2+\delta )\)-approximation in \(O({\log n\over \delta })\) iterations, with each iteration having a total shuffle size of \(O(m)\) and a reducer size of \(O(d_{max})\).
The standard primal-dual technique for solving the above problem results in \(O(n)\) iterations. Our key idea is to carefully control the width of the underlying polytope so that the number of iterations becomes small, but an approximate primal solution can still be recovered from the approximate dual solution. We then show an application of the same technique to the fractional maximum matching problem in bipartite graphs. Our results also map naturally to the PRAM model.
Ashish Goel: Supported in part by the DARPA xdata program, by grant #FA9550-12-1-0411 from the U.S. Air Force Office of Scientific Research (AFOSR) and the Defense Advanced Research Projects Agency (DARPA), and by NSF Award 0904325.
Kamesh Munagala: Supported by NSF grants CCF- 0745761, CCF-1348696, IIS-0964560, and IIS-1447554; and by grant W911NF-14-1- 0366 from the Army Research Office (ARO). Part of this work was done while the author was visiting Twitter, Inc.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Afrati, F.N., Das Sarma, S., Salihoglu, S., Ullman, J.D.: Upper and lower bounds on the cost of a map-reduce computation. PVLDB 6, 277–288 (2013)
Ahn, K.J., Guha, S.: Linear programming in the semi-streaming model with application to the maximum matching problem. In: Aceto, L., Henzinger, M., Sgall, J. (eds.) ICALP 2011, Part II. LNCS, vol. 6756, pp. 526–538. Springer, Heidelberg (2011)
Arora, S., Hazan, E., Kale, S.: The multiplicative weights update method: a meta algorithm and applications. Theory of Computing 8, 121–164 (2012)
Awerbuch, B., Khandekar, R., Rao, S.: Distributed algorithms for multicommodity flow problems via approximate steepest descent framework. In: SODA, pp. 949–957 (2007)
Bahmani, B., Kumar, R., Vassilvitskii, S.: Densest subgraph in streaming and mapreduce. PVLDB 5(5), 454–465 (2012)
Charikar, M.: Greedy approximation algorithms for finding dense components in a graph. In: Jansen, K., Khuller, S. (eds.) APPROX 2000. LNCS, vol. 1913, pp. 84–95. Springer, Heidelberg (2000)
Dean, J., Ghemawat, S.: Mapreduce: Simplified data processing on large clusters. In: OSDI, pp. 137–150 (2004)
Garg, N., Könemann, J.: Faster and simpler algorithms for multicommodity flow and other fractional packing problems. SIAM J. Comput. 37(2), 630–652 (2007)
Grigoriadis, M.D., Khachiyan, L.G.: Approximate minimum-cost multicommodity flows in \(\tilde{O}(\epsilon ^{-2}knm)\) time. Math. Program. 75, 477–482 (1996)
Apache hadoop, http://hadoop.apache.org
Kannan, R., Vinay, V.: Analyzing the structure of large graphs. Manuscript (1999)
Karloff, H., Suri, S., Vassilvitskii, S.: A model of computation for mapreduce. In: SODA (2010)
Motwani, R., Panigrahy, R., Xu, Y.: Fractional matching via balls-and-bins. In: DÃaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, pp. 487–498. Springer, Heidelberg (2006)
Plotkin, S.A., Shmoys, D.B., Tardos, É.:. Fast approximation algorithms for fractional packing and covering problems. In: FOCS, pp. 495–504 (1991)
Saha, B., Hoch, A., Khuller, S., Raschid, L., Zhang, X.-N.: Dense subgraphs with restrictions and applications to gene annotation graphs. In: Berger, B. (ed.) RECOMB 2010. LNCS, vol. 6044, pp. 456–472. Springer, Heidelberg (2010)
Suri, S., Vassilvitskii, S.: Counting triangles and the curse of the last reducer. In: WWW, pp. 607–614 (2011)
Young, N.E.: Randomized rounding without solving the linear program. In: SODA (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Bahmani, B., Goel, A., Munagala, K. (2014). Efficient Primal-Dual Graph Algorithms for MapReduce. In: Bonato, A., Graham, F., Prałat, P. (eds) Algorithms and Models for the Web Graph. WAW 2014. Lecture Notes in Computer Science(), vol 8882. Springer, Cham. https://doi.org/10.1007/978-3-319-13123-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-13123-8_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13122-1
Online ISBN: 978-3-319-13123-8
eBook Packages: Computer ScienceComputer Science (R0)