Skip to main content

Efficient Primal-Dual Graph Algorithms for MapReduce

  • Conference paper
  • First Online:
Algorithms and Models for the Web Graph (WAW 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8882))

Included in the following conference series:

Abstract

In this paper, we obtain improved algorithms for two graph-theoretic problems in the popular MapReduce framework. The first problem we consider is the densest subgraph problem. We present a primal-dual algorithm that provides a \((1+\epsilon )\) approximation and takes \(O({\log n\over \epsilon ^2})\) MapReduce iterations, each iteration having a shuffle size of \(O(m)\) and a reducer size of \(O(d_{max})\). Here \(m\) is the number of edges, \(n\) is the number of vertices, and \(d_{max}\) is the maximum degree of a node. This dominates the previous best MapReduce algorithm, which provided a \((2+\delta )\)-approximation in \(O({\log n\over \delta })\) iterations, with each iteration having a total shuffle size of \(O(m)\) and a reducer size of \(O(d_{max})\).

The standard primal-dual technique for solving the above problem results in \(O(n)\) iterations. Our key idea is to carefully control the width of the underlying polytope so that the number of iterations becomes small, but an approximate primal solution can still be recovered from the approximate dual solution. We then show an application of the same technique to the fractional maximum matching problem in bipartite graphs. Our results also map naturally to the PRAM model.

Ashish Goel: Supported in part by the DARPA xdata program, by grant #FA9550-12-1-0411 from the U.S. Air Force Office of Scientific Research (AFOSR) and the Defense Advanced Research Projects Agency (DARPA), and by NSF Award 0904325.

Kamesh Munagala: Supported by NSF grants CCF- 0745761, CCF-1348696, IIS-0964560, and IIS-1447554; and by grant W911NF-14-1- 0366 from the Army Research Office (ARO). Part of this work was done while the author was visiting Twitter, Inc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Afrati, F.N., Das Sarma, S., Salihoglu, S., Ullman, J.D.: Upper and lower bounds on the cost of a map-reduce computation. PVLDB 6, 277–288 (2013)

    Google Scholar 

  2. Ahn, K.J., Guha, S.: Linear programming in the semi-streaming model with application to the maximum matching problem. In: Aceto, L., Henzinger, M., Sgall, J. (eds.) ICALP 2011, Part II. LNCS, vol. 6756, pp. 526–538. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  3. Arora, S., Hazan, E., Kale, S.: The multiplicative weights update method: a meta algorithm and applications. Theory of Computing 8, 121–164 (2012)

    Article  MathSciNet  Google Scholar 

  4. Awerbuch, B., Khandekar, R., Rao, S.: Distributed algorithms for multicommodity flow problems via approximate steepest descent framework. In: SODA, pp. 949–957 (2007)

    Google Scholar 

  5. Bahmani, B., Kumar, R., Vassilvitskii, S.: Densest subgraph in streaming and mapreduce. PVLDB 5(5), 454–465 (2012)

    Google Scholar 

  6. Charikar, M.: Greedy approximation algorithms for finding dense components in a graph. In: Jansen, K., Khuller, S. (eds.) APPROX 2000. LNCS, vol. 1913, pp. 84–95. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  7. Dean, J., Ghemawat, S.: Mapreduce: Simplified data processing on large clusters. In: OSDI, pp. 137–150 (2004)

    Google Scholar 

  8. Garg, N., Könemann, J.: Faster and simpler algorithms for multicommodity flow and other fractional packing problems. SIAM J. Comput. 37(2), 630–652 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  9. Grigoriadis, M.D., Khachiyan, L.G.: Approximate minimum-cost multicommodity flows in \(\tilde{O}(\epsilon ^{-2}knm)\) time. Math. Program. 75, 477–482 (1996)

    MATH  MathSciNet  Google Scholar 

  10. Apache hadoop, http://hadoop.apache.org

  11. Kannan, R., Vinay, V.: Analyzing the structure of large graphs. Manuscript (1999)

    Google Scholar 

  12. Karloff, H., Suri, S., Vassilvitskii, S.: A model of computation for mapreduce. In: SODA (2010)

    Google Scholar 

  13. Motwani, R., Panigrahy, R., Xu, Y.: Fractional matching via balls-and-bins. In: Díaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, pp. 487–498. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  14. Plotkin, S.A., Shmoys, D.B., Tardos, É.:. Fast approximation algorithms for fractional packing and covering problems. In: FOCS, pp. 495–504 (1991)

    Google Scholar 

  15. Saha, B., Hoch, A., Khuller, S., Raschid, L., Zhang, X.-N.: Dense subgraphs with restrictions and applications to gene annotation graphs. In: Berger, B. (ed.) RECOMB 2010. LNCS, vol. 6044, pp. 456–472. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  16. Suri, S., Vassilvitskii, S.: Counting triangles and the curse of the last reducer. In: WWW, pp. 607–614 (2011)

    Google Scholar 

  17. Young, N.E.: Randomized rounding without solving the linear program. In: SODA (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bahman Bahmani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Bahmani, B., Goel, A., Munagala, K. (2014). Efficient Primal-Dual Graph Algorithms for MapReduce. In: Bonato, A., Graham, F., Prałat, P. (eds) Algorithms and Models for the Web Graph. WAW 2014. Lecture Notes in Computer Science(), vol 8882. Springer, Cham. https://doi.org/10.1007/978-3-319-13123-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13123-8_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13122-1

  • Online ISBN: 978-3-319-13123-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics