Skip to main content

On Attributed Community Search

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10731))

Abstract

Communities, which are prevalent in attributed graphs (e.g., social networks and knowledge bases) can be used in emerging applications such as product advertisement and setting up of social events. Given a graph G and a vertex \(q \in G\), the community search (CS) query returns a subgraph of G that contains vertices related to q. In this article, we study CS over two common attributed graphs, where (1) vertices are associated with keywords; and (2) vertices are augmented with locations. For keyword-based attributed graphs, we investigate the keyword-based attributed community (or KAC) query, which returns a KAC for a query vertex. A KAC satisfies both structure cohesiveness (i.e., its vertices are tightly connected) and keyword cohesiveness (i.e., its vertices share common keywords). For spatial-based attributed graphs, we aim to find the spatial-aware community (or SAC), whose vertices are close structurally and spatially, for a query vertex in an online manner. To enable efficient KAC search and SAC search, we propose efficient query algorithms. We also perform experimental evaluation on large real datasets, and the results show that our methods achieve higher effectiveness than the state-of-the-art community retrieval algorithms. Moreover, our solutions are faster than baseline approaches. In addition, we develop the C-Explorer system to assist users in extracting, visualizing, and analyzing KACs.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   60.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.meetup.com/.

  2. 2.

    Without ambiguity, all the attributed graphs mentioned in this section refer to keyword-based attributed graphs.

  3. 3.

    All the proofs of lemmas in this article can be found in [13].

  4. 4.

    All the pseudocodes of algorithms in this article can be found in [13].

  5. 5.

    We use “node” to mean “CL-tree node” in Sect. 3.

  6. 6.

    For simplicity, in this section we call spatial-based attributed graphs spatial graphs.

  7. 7.

    To avoid ambiguity, we use word “node” for tree nodes in Sect. 4.

References

  1. Barbieri, N., Bonchi, F., Galimberti, E., Gullo, F.: Efficient and effective community search. DMKD 29(5), 1406–1433 (2015)

    MathSciNet  Google Scholar 

  2. Barthélemy, M.: Spatial networks. Phys. Rep. 499(1), 1–101 (2011)

    Article  MathSciNet  Google Scholar 

  3. Batagelj, V., Zaversnik, M.: An o(m) algorithm for cores decomposition of networks. arXiv (2003)

    Google Scholar 

  4. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  5. Bollobás, B.: The evolution of random graphs. Trans. Am. Math. Soc. 286(1), 257–274 (1984)

    Article  MathSciNet  MATH  Google Scholar 

  6. Chen, Y., Jun, X., Minzheng, X.: Finding community structure in spatially constrained complex networks. IJGIS 29(6), 889–911 (2015)

    Google Scholar 

  7. Cohen, J.: Trusses: cohesive subgraphs for social network analysis. National Security Agency Technical Report, p. 16 (2008)

    Google Scholar 

  8. Cui, W., Xiao, Y., Wang, H., Lu, Y., Wang, W.: Online search of overlapping communities. In: SIGMOD, pp. 277–288 (2013)

    Google Scholar 

  9. Cui, W., Xiao, Y., Wang, H., Wang, W.: Local search of communities in large graphs. In: SIGMOD, pp. 991–1002 (2014)

    Google Scholar 

  10. Elzinga, D.J., Hearn, D.W.: The minimum covering sphere problem. Manage. Sci. 19(1), 96–104 (1972)

    Article  MathSciNet  MATH  Google Scholar 

  11. Elzinga, D.J., Hearn, D.W.: Geometrical solutions for some minimax location problems. Transp. Sci. 6(4), 379–394 (1972)

    Article  MathSciNet  Google Scholar 

  12. Expert, P., et al.: Uncovering space-independent communities in spatial networks. PNAS 108(19), 7663–7668 (2011)

    Article  MATH  Google Scholar 

  13. Fang, Y.: Effective and efficient community search over large attributed graphs. HKU Ph.D. thesis, September 2017

    Google Scholar 

  14. Fang, Y., Cheng, R., Chen, Y., Luo, S., Hu, J.: Effective and efficient attributed community search. VLDB J. 26(6), 803–828 (2017)

    Article  Google Scholar 

  15. Fang, Y., Cheng, R., Li, X., Luo, S., Hu, J.: Effective community search over large spatial graphs. PVLDB 10(6), 709–720 (2017)

    Google Scholar 

  16. Fang, Y., Cheng, R., Luo, S., Hu, J.: Effective community search for large attributed graphs. PVLDB 9(12), 1233–1244 (2016)

    Google Scholar 

  17. Fang, Y., Cheng, R., Luo, S., Hu, J., Huang, K.: C-explorer: browsing communities in large graphs. PVLDB 10(12), 1885–1888 (2017)

    Google Scholar 

  18. Fang, Y., Cheng, R., Tang, W., Maniu, S., Yang, X.: Scalable algorithms for nearest-neighbor joins on big trajectory data. TKDE 28(3), 785–800 (2016)

    Google Scholar 

  19. Fang, Y., Cheng, R., Tang, W., Maniu, S., Yang, X.S.: Scalable algorithms for nearest-neighbor joins on big trajectory data. In: ICDE, pp. 1528–1529 (2016)

    Google Scholar 

  20. Fang, Y., Zhang, H., Ye, Y., Li, X.: Detecting hot topics from Twitter: a multiview approach. J. Inf. Sci. 40(5), 578–593 (2014)

    Article  Google Scholar 

  21. Finkel, R.A., Bentley, J.L.: Quad trees: a data structure for retrieval on composite keys. Acta Informatica 4(1), 1–9 (1974)

    Article  MATH  Google Scholar 

  22. Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2010)

    Article  MathSciNet  Google Scholar 

  23. Gaertler, M., Patrignani, M.: Dynamic analysis of the autonomous system graph. In: IPS, pp. 13–24 (2004)

    Google Scholar 

  24. Gibbons, A.: Algorithmic Graph Theory. Cambridge University Press, Cambridge (1985)

    MATH  Google Scholar 

  25. Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. PNAS 99(12), 7821–7826 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  26. Guo, D.: Regionalization with dynamically constrained agglomerative clustering and partitioning (redcap). IJGIS 22(7), 801–823 (2008)

    Google Scholar 

  27. Guo, T., Cao, X., Cong, G.: Efficient algorithms for answering the m-closest keywords query. In: SIGMOD, pp. 405–418. ACM (2015)

    Google Scholar 

  28. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: SIGMOD (2000)

    Google Scholar 

  29. Hu, J., Cheng, R., Huang, Z., Fang, Y., Luo, S.: On embedding uncertain graphs. In: CIKM. ACM (2017)

    Google Scholar 

  30. Hu, J., Wu, X., Cheng, R., Luo, S., Fang, Y.: Querying minimal Steiner maximum-connected subgraphs in large graphs. In: CIKM, pp. 1241–1250 (2016)

    Google Scholar 

  31. Hu, J., Xiaowei, W., Cheng, R., Luo, S., Fang, Y.: On minimal steiner maximum-connected subgraph queries. TKDE 29(11), 2455–2469 (2017)

    Google Scholar 

  32. Huang, X., Cheng, H., Qin, L., Tian, W., Yu, J.X.: Querying k-truss community in large and dynamic graphs. In: SIGMOD (2014)

    Google Scholar 

  33. Huang, X., Lakshmanan, L.V.S., Yu, J.X., Cheng, H.: Approximate closest community search in networks. PVLDB 9(4), 276–287 (2015)

    Google Scholar 

  34. Kim, Y., Son, S.-W., Jeong, H.: Finding communities in directed networks. Phys. Rev. E 81(1), 016103 (2010)

    Article  Google Scholar 

  35. Leicht, E.A., Newman, M.E.J.: Community structure in directed networks. Phys. Rev. Lett. 100(11), 118703 (2008)

    Article  Google Scholar 

  36. Li, R.-H., Qin, L., Yu, J.X., Mao, R.: Influential community search in large networks. In: PVLDB (2015)

    Google Scholar 

  37. Li, Z., Fang, Y., Liu, Q., Cheng, J., Cheng, R., Lui, J.: Walking in the cloud: parallel simrank at scale. PVLDB 9(1), 24–35 (2015)

    Google Scholar 

  38. Liu, Y., Niculescu-Mizil, A., Gryc, W.: Topic-link LDA: joint models of topic and author community. In: ICML (2009)

    Google Scholar 

  39. Malliaros, F.D., Vazirgiannis, M.: Clustering and community detection in directed networks: a survey. Phys. Rep. 533(4), 95–142 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  40. Megiddo, N.: Linear-time algorithms for linear programming in r3 and related problems. In: FOCS, pp. 329–338. IEEE (1982)

    Google Scholar 

  41. Nallapati, R.M., Ahmed, A., Xing, E.P., Cohen, W.W.: Joint latent topic models for text and citations. In: KDD (2008)

    Google Scholar 

  42. Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)

    Article  Google Scholar 

  43. Plantié, M., Crampes, M.: Survey on social community detection. In: Ramzan, N., van Zwol, R., Lee, J.S., Clüver, K., Hua, X.S. (eds.) Social Media Retrieval. Computer Communications and Networks, pp. 65–85. Springer, London (2013). https://doi.org/10.1007/978-1-4471-4555-4_4

    Chapter  Google Scholar 

  44. Ruan, Y., Fuhry, D., Parthasarathy, S.: Efficient community detection in large networks using content and links. In: WWW (2013)

    Google Scholar 

  45. Sachan, M., et al.: Using content and interactions for discovering communities in social networks. In: WWW, pp. 331–340 (2012)

    Google Scholar 

  46. Seidman, S.B.: Network structure and minimum degree. Soc. Netw. 5(3), 269–287 (1983)

    Article  MathSciNet  Google Scholar 

  47. Shakarian, P., et al.: Mining for geographically disperse communities in social networks by leveraging distance modularity. In: KDD, pp. 1402–1409 (2013)

    Google Scholar 

  48. Sozio, M., Gionis, A.: The community-search problem and how to plan a successful cocktail party. In: KDD (2010)

    Google Scholar 

  49. Xu, Z., Ke, Y., Wang, Y., Cheng, H., Cheng, J.: A model-based approach to attributed graph clustering. In: SIGMOD, pp. 505–516. ACM (2012)

    Google Scholar 

  50. Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes. In: ICDM, pp. 1151–1156 (2013)

    Google Scholar 

  51. Yang, T., Jin, R., Chi, Y., Zhu, S.: Combining link and content for community detection: a discriminative approach. In: KDD (2009)

    Google Scholar 

  52. Yang, T., et al.: Directed network community detection: a popularity and productivity link model. In: SDM, pp. 742–753. SIAM (2010)

    Google Scholar 

  53. Zhang, W., et al.: Combining latent factor model with location features for event-based group recommendation. In: KDD, pp. 910–918. ACM (2013)

    Google Scholar 

  54. Zhou, Y., Cheng, H., Yu, J.F.: Graph clustering based on structural/attribute similarities. VLDB 2(1), 718–729 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yixiang Fang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fang, Y., Cheng, R. (2018). On Attributed Community Search. In: Doulkeridis, C., Vouros, G., Qu, Q., Wang, S. (eds) Mobility Analytics for Spatio-Temporal and Social Data. MATES 2017. Lecture Notes in Computer Science(), vol 10731. Springer, Cham. https://doi.org/10.1007/978-3-319-73521-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73521-4_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73520-7

  • Online ISBN: 978-3-319-73521-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics