Skip to main content

A Distributed Algorithm for γ-Quasi-Clique Extractions in Massive Graphs

  • Conference paper
Innovative Computing Technology (INCT 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 241))

Included in the following conference series:

Abstract

In this paper, we investigate the challenge of increasing the size of graphs for finding γ-quasi-cliques. We propose an algorithm based on MapReduce programming model. In the proposed solution, we use some known techniques to prune unnecessary and inefficient parts of search space and divides the massive input graph into smaller parts. Then the data for processing each part is sent to a single computer. The evaluation shows that we can substantially reduce the time for large graphs and besides there is no limit for graph size in our algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gantz, J., Chute, C., Manfrediz, A., Minton, S., Reinsel, D., Schlichting, W., Toncheva, A.: The diverse and exploding digital universe. IDC White Paper 2 (2008)

    Google Scholar 

  2. http://www.facebook.com/press/info.php?statistics

  3. http://www.psc.edu/general/software/packages/genbank

  4. Grimmett, G.: Percolation. Springer, Heidelberg (1999)

    Book  MATH  Google Scholar 

  5. Feige, U., Peleg, D., Kortsarz, G.: The dense k-subgraph problem. Algorithmica 29, 410–421 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  6. Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: The 6th Conference on Opearting Systems Design & Implementation, vol. 6, p. 10. USENIX Association, San Francisco (2004)

    Google Scholar 

  7. Cohen, J.: Graph Twiddling in a MapReduce World. Computing in Science and Engineering 11, 29–41 (2009)

    Article  Google Scholar 

  8. Zeng, Z., Wang, J., Zhou, L., Karypis, G.: Coherent closed quasi-clique discovery from large dense graph databases. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 797–802. ACM, Philadelphia (2006)

    Chapter  Google Scholar 

  9. Zeng, Z., Wang, J., Zhou, L., Karypis, G.: Out-of-Core Coherent Closed Quasi-Clique Mining from Large Dense Graph Databases. ACM Transactions on Database Systems 32 (2007)

    Google Scholar 

  10. Liu, G., Wong, L.: Effective Pruning Techniques for Mining Quasi-Cliques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 33–49. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  11. Pei, J., Jiang, D., Zhang, A.: On Mining Cross-Graph Quasi-Cliques. In: KDD 2005. ACM, Chicago (2005)

    Google Scholar 

  12. Uno, T.: An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem. Algorithmica 56, 3–16 (2010)

    Article  MathSciNet  MATH  Google Scholar 

  13. Gibson, D., Kumar, R., Tomkins, A.: Discovering large dense subgraphs in massive graphs. In: Proceedings of the 31st International Conference on Very Large Data Bases, pp. 721–732. VLDB Endowment, Trondheim (2005)

    Google Scholar 

  14. Abello, J., Resende, M., Sudarsky, S.: Massive Quasi-Clique Detection. In: Rajsbaum, S. (ed.) LATIN 2002. LNCS, vol. 2286, pp. 598–612. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  15. Agarwal, G., Kempe, D.: Modularity-maximizing graph communities via mathematical programming. The European Physical Journal B - Condensed Matter and Complex Systems 66, 409–418 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  16. Bader, G., Hogue, C.: An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 4, 2 (2003)

    Article  Google Scholar 

  17. Gargi, U., Lu, W., Mirrokni, V., Yoon, S.: Large-Scale Community Detection on YouTube for Topic Discovery and Exploration (2011)

    Google Scholar 

  18. Ucar, D., Asur, S., Catalyurek, U., Parthasarathy, S.: Improving Functional Modularity in Protein-Protein Interactions Graphs Using Hub-Induced Subgraphs. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 371–382. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  19. Hu, H., Yan, X., Huang, Y., Han, J., Zhou, X.J.: Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics 21, i1213 (2005)

    Google Scholar 

  20. Tsourakakis, C.E., Kang, U., Miller, G.L., Faloutsos, C.: DOULION: counting triangles in massive graphs with a coin. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 837–846. ACM, Paris (2009)

    Chapter  Google Scholar 

  21. Zhang, Y., Wang, J., Wang, Y., Zhou, L.: Parallel community detection on large networks with propinquity dynamics. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 997–1006. ACM, Paris (2009)

    Chapter  Google Scholar 

  22. Zhang, Y., Wang, J., Zeng, Z., Zhou, L.: Parallel mining of closed quasi-cliques, pp. 1–10. IEEE (2008)

    Google Scholar 

  23. Afrati, F.N., Ullman, J.D.: A New Computation Model for Cluster Computing (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Khosraviani, A., Sharifi, M. (2011). A Distributed Algorithm for γ-Quasi-Clique Extractions in Massive Graphs. In: Pichappan, P., Ahmadi, H., Ariwa, E. (eds) Innovative Computing Technology. INCT 2011. Communications in Computer and Information Science, vol 241. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27337-7_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27337-7_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27336-0

  • Online ISBN: 978-3-642-27337-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics