Abstract
Graph is a way of representing data as a web of relationships. Graphs consist of nodes and edges where nodes are concepts and edges are the relationships. Graph analytics is the science of solving problems expressed on graphs. Graph analytics is useful for finding hidden patterns, relationships, similarities and anomalies in graphs. These tasks are useful in many application areas like protein analysis, fraud detection, health care, computer security, financial data analysis, etc. Minimum description length (MDL) comes from information theory, which can be used for universal coding or universal modeling. SUBstructure Discovery Using Examples (SUBDUE) algorithm uses MDL to discover substructures. In this paper, the use of MDL for graph analytics is shown by applying MDL encoding to various graph datasets and in particular graph matching is solved using MDL. Further, comparative analysis is done to show how MDL value changes w.r.t. varying graph properties. subgen tool is used to generate graph datasets. Statistical tests are applied and we came to know in which cases MDL value changes significantly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
D.J. Cook, L.B. Holder, D.J. Cook, and S. Djoko “Substructure discovery in the SUBDUE system.,” in Proc. of the AI Workshop on Knowledge Discovery in Databases, pages 169–180, (1994)
Minimum description length, https://en.wikipedia.org/wiki/Minimum_description_length
Minimum Description Length (MDL), http://www.modelselection.org/mdl/
Quinlan, J.R., Rivest, R.L., Inferring decision trees using the minimum description length principle, Information and Computation, 227–248, (1989)
Hitoshi. I.B.A., H.DE and S.TAISUKE, Genetic Programming using a Minimum Description Length Principle. Advances in Genetic Programming, (1994)
Leclerc. Y.G. Constructing simple stable descriptions for image partitioning. International Journal of Computer Vision, 73–102, (1989)
Matthijs van Leeuwen and Jilles Vreeken, Mining and Using Sets of Patterns through Compression, In: Charu C. Aggarwal. Jiawei Han, Frequent Pattern Mining-Springer US (2014)
Kaspar Riesen, Xiaoyi Jiang, Horst Bunke, Exact and Inexact Graph Matching: Methodology and Applications, In: Charu C. Aggarwal, Haixun Wang (eds.)-Managing and Mining Graph Data- Springer US (2010)
The Stony Brook Algorithm Repository, http://www3.cs.stonybrook.edu/~algorith/implement/nauty/implement.shtml
C. Noble and D. Cook, Graph-Based Anomaly Detection. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining., 631–636, (2003)
The T-Test, http://www.socialresearchmethods.net/kb/stat_t.php
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this paper
Cite this paper
Velampalli, S., Murthy Jonnalagedda, V.R. (2017). Minimum Description Length (MDL) Based Graph Analytics. In: Satapathy, S., Prasad, V., Rani, B., Udgata, S., Raju, K. (eds) Proceedings of the First International Conference on Computational Intelligence and Informatics . Advances in Intelligent Systems and Computing, vol 507. Springer, Singapore. https://doi.org/10.1007/978-981-10-2471-9_10
Download citation
DOI: https://doi.org/10.1007/978-981-10-2471-9_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2470-2
Online ISBN: 978-981-10-2471-9
eBook Packages: EngineeringEngineering (R0)