Skip to main content

Matrix-Based Graph Systems

  • Chapter
  • First Online:
Systems for Big Graph Analytics

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

Abstract

Graphs and matrices are related inherently, so it is not surprising that a number of big graph systems expose a matrix-based interface for users. In this chapter, we introduce the matrix-based systems for big graph processing. In particular, we review three such systems, PEGASUS, GBASE, and SystemML. All three systems allow users to express graph algorithms using operations on matrices, and rely on a general purpose data processing system, such as MapReduce or Spark, for distributed execution. Among the three systems, SystemML is the only one that is an active and well-maintained open-source project. Thus, for interested readers, we highly recommend SystemML for trying out the matrix-based graph processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://apache.github.io/incubator-systemml/standalone-guide.html.

  2. 2.

    http://apache.github.io/incubator-systemml/hadoop-batch-mode.html.

  3. 3.

    http://apache.github.io/incubator-systemml/spark-batch-mode.html.

  4. 4.

    http://apache.github.io/incubator-systemml/spark-mlcontext-programming-guide.html.

  5. 5.

    http://apache.github.io/incubator-systemml/jmlc.html.

References

  1. N. Biggs. Algebraic Graph Theory. Cambridge University Press, 2nd edition, 1993.

    Google Scholar 

  2. M. Boehm, D. Burdick, A. Evfimievski, B. Reinwald, F. R. Reiss, P. Sen, S. Tatikonda, and Y. Tian. SystemML’s optimizer: Plan generation for large-scale machine learning programs. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 37(3), 2014.

    Google Scholar 

  3. M. Boehm, M. W. Dusenberry, D. Eriksson, A. V. Evfimievski, F. M. Manshadi, N. Pansare, B. Reinwald, F. R. Reiss, P. Sen, A. C. Surve, and S. Tatikonda. SystemML: Declarative machine learning on spark. PVLDB, 9(13):1425–1436, 2016.

    Google Scholar 

  4. M. Boehm, S. Tatikonda, B. Reinwald, P. Sen, Y. Tian, D. R. Burdick, and S. Vaithyanathan. Hybrid parallelization strategies for large-scale machine learning in SystemML. PVLDB, 7(7):553–564, 2014.

    Google Scholar 

  5. A. Elgohary, M. Boehm, P. J. Haas, F. R. Reiss, and B. Reinwald. Compressed linear algebra for large-scale machine learning. PVLDB, 9(12):960–971, 2016.

    Google Scholar 

  6. A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan. SystemML: Declarative machine learning on mapreduce. In ICDE, pages 231–242, 2011.

    Google Scholar 

  7. C. Godsil and G. Royle. Algebraic Graph Theory, volume 207 of Graduate Texts in Mathematics. Springer, 2001.

    Google Scholar 

  8. B. Huang, M. Boehm, Y. Tian, B. Reinwald, S. Tatikonda, and F. R. Reiss. Resource elasticity for large-scale machine learning. In SIGMOD, pages 137–152, 2015.

    Google Scholar 

  9. U. Kang, H. Tong, J. Sun, C. Lin, and C. Faloutsos. GBASE: a scalable and general graph management system. In SIGKDD, pages 1091–1099, 2011.

    Google Scholar 

  10. U. Kang, H. Tong, J. Sun, C.-Y. Lin, and C. Faloutsos. gbase: an efficient analysis platform for large graphs. The VLDB Journal, 21(5):637–650, 2012.

    Google Scholar 

  11. U. Kang, C. E. Tsourakakis, and C. Faloutsos. PEGASUS: A peta-scale graph mining system. In ICDM, pages 229–238, 2009.

    Google Scholar 

  12. U. Kang, C. E. Tsourakakis, and C. Faloutsos. Pegasus: Mining peta-scale graphs. Knowl. Inf. Syst., 27(2):303–325, May 2011.

    Article  Google Scholar 

  13. Y. Tian, S. Tatikonda, and B. Reinwald. Scalable and numerically stable descriptive statistics in SystemML. In ICDE, pages 1351–1359, 2012.

    Google Scholar 

  14. V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar, R. Evans, T. Graves, J. Lowe, H. Shah, S. Seth, B. Saha, C. Curino, O. O’Malley, S. Radia, B. Reed, and E. Baldeschwieler. Apache hadoop yarn: Yet another resource negotiator. In SOCC, pages 5:1–5:16, 2013.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2017 The Author(s)

About this chapter

Cite this chapter

Yan, D., Tian, Y., Cheng, J. (2017). Matrix-Based Graph Systems. In: Systems for Big Graph Analytics. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-58217-7_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58217-7_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58216-0

  • Online ISBN: 978-3-319-58217-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics