Abstract
Graphs and matrices are related inherently, so it is not surprising that a number of big graph systems expose a matrix-based interface for users. In this chapter, we introduce the matrix-based systems for big graph processing. In particular, we review three such systems, PEGASUS, GBASE, and SystemML. All three systems allow users to express graph algorithms using operations on matrices, and rely on a general purpose data processing system, such as MapReduce or Spark, for distributed execution. Among the three systems, SystemML is the only one that is an active and well-maintained open-source project. Thus, for interested readers, we highly recommend SystemML for trying out the matrix-based graph processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
References
N. Biggs. Algebraic Graph Theory. Cambridge University Press, 2nd edition, 1993.
M. Boehm, D. Burdick, A. Evfimievski, B. Reinwald, F. R. Reiss, P. Sen, S. Tatikonda, and Y. Tian. SystemML’s optimizer: Plan generation for large-scale machine learning programs. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 37(3), 2014.
M. Boehm, M. W. Dusenberry, D. Eriksson, A. V. Evfimievski, F. M. Manshadi, N. Pansare, B. Reinwald, F. R. Reiss, P. Sen, A. C. Surve, and S. Tatikonda. SystemML: Declarative machine learning on spark. PVLDB, 9(13):1425–1436, 2016.
M. Boehm, S. Tatikonda, B. Reinwald, P. Sen, Y. Tian, D. R. Burdick, and S. Vaithyanathan. Hybrid parallelization strategies for large-scale machine learning in SystemML. PVLDB, 7(7):553–564, 2014.
A. Elgohary, M. Boehm, P. J. Haas, F. R. Reiss, and B. Reinwald. Compressed linear algebra for large-scale machine learning. PVLDB, 9(12):960–971, 2016.
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan. SystemML: Declarative machine learning on mapreduce. In ICDE, pages 231–242, 2011.
C. Godsil and G. Royle. Algebraic Graph Theory, volume 207 of Graduate Texts in Mathematics. Springer, 2001.
B. Huang, M. Boehm, Y. Tian, B. Reinwald, S. Tatikonda, and F. R. Reiss. Resource elasticity for large-scale machine learning. In SIGMOD, pages 137–152, 2015.
U. Kang, H. Tong, J. Sun, C. Lin, and C. Faloutsos. GBASE: a scalable and general graph management system. In SIGKDD, pages 1091–1099, 2011.
U. Kang, H. Tong, J. Sun, C.-Y. Lin, and C. Faloutsos. gbase: an efficient analysis platform for large graphs. The VLDB Journal, 21(5):637–650, 2012.
U. Kang, C. E. Tsourakakis, and C. Faloutsos. PEGASUS: A peta-scale graph mining system. In ICDM, pages 229–238, 2009.
U. Kang, C. E. Tsourakakis, and C. Faloutsos. Pegasus: Mining peta-scale graphs. Knowl. Inf. Syst., 27(2):303–325, May 2011.
Y. Tian, S. Tatikonda, and B. Reinwald. Scalable and numerically stable descriptive statistics in SystemML. In ICDE, pages 1351–1359, 2012.
V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar, R. Evans, T. Graves, J. Lowe, H. Shah, S. Seth, B. Saha, C. Curino, O. O’Malley, S. Radia, B. Reed, and E. Baldeschwieler. Apache hadoop yarn: Yet another resource negotiator. In SOCC, pages 5:1–5:16, 2013.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 The Author(s)
About this chapter
Cite this chapter
Yan, D., Tian, Y., Cheng, J. (2017). Matrix-Based Graph Systems. In: Systems for Big Graph Analytics. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-58217-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-58217-7_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58216-0
Online ISBN: 978-3-319-58217-7
eBook Packages: Computer ScienceComputer Science (R0)