Abstract
Communities, which are prevalent in attributed graphs (e.g., social networks and knowledge bases) can be used in emerging applications such as product advertisement and setting up of social events. Given a graph G and a vertex \(q \in G\), the community search (CS) query returns a subgraph of G that contains vertices related to q. In this article, we study CS over two common attributed graphs, where (1) vertices are associated with keywords; and (2) vertices are augmented with locations. For keyword-based attributed graphs, we investigate the keyword-based attributed community (or KAC) query, which returns a KAC for a query vertex. A KAC satisfies both structure cohesiveness (i.e., its vertices are tightly connected) and keyword cohesiveness (i.e., its vertices share common keywords). For spatial-based attributed graphs, we aim to find the spatial-aware community (or SAC), whose vertices are close structurally and spatially, for a query vertex in an online manner. To enable efficient KAC search and SAC search, we propose efficient query algorithms. We also perform experimental evaluation on large real datasets, and the results show that our methods achieve higher effectiveness than the state-of-the-art community retrieval algorithms. Moreover, our solutions are faster than baseline approaches. In addition, we develop the C-Explorer system to assist users in extracting, visualizing, and analyzing KACs.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
Without ambiguity, all the attributed graphs mentioned in this section refer to keyword-based attributed graphs.
- 3.
All the proofs of lemmas in this article can be found in [13].
- 4.
All the pseudocodes of algorithms in this article can be found in [13].
- 5.
We use “node” to mean “CL-tree node” in Sect. 3.
- 6.
For simplicity, in this section we call spatial-based attributed graphs spatial graphs.
- 7.
To avoid ambiguity, we use word “node” for tree nodes in Sect. 4.
References
Barbieri, N., Bonchi, F., Galimberti, E., Gullo, F.: Efficient and effective community search. DMKD 29(5), 1406–1433 (2015)
Barthélemy, M.: Spatial networks. Phys. Rep. 499(1), 1–101 (2011)
Batagelj, V., Zaversnik, M.: An o(m) algorithm for cores decomposition of networks. arXiv (2003)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Bollobás, B.: The evolution of random graphs. Trans. Am. Math. Soc. 286(1), 257–274 (1984)
Chen, Y., Jun, X., Minzheng, X.: Finding community structure in spatially constrained complex networks. IJGIS 29(6), 889–911 (2015)
Cohen, J.: Trusses: cohesive subgraphs for social network analysis. National Security Agency Technical Report, p. 16 (2008)
Cui, W., Xiao, Y., Wang, H., Lu, Y., Wang, W.: Online search of overlapping communities. In: SIGMOD, pp. 277–288 (2013)
Cui, W., Xiao, Y., Wang, H., Wang, W.: Local search of communities in large graphs. In: SIGMOD, pp. 991–1002 (2014)
Elzinga, D.J., Hearn, D.W.: The minimum covering sphere problem. Manage. Sci. 19(1), 96–104 (1972)
Elzinga, D.J., Hearn, D.W.: Geometrical solutions for some minimax location problems. Transp. Sci. 6(4), 379–394 (1972)
Expert, P., et al.: Uncovering space-independent communities in spatial networks. PNAS 108(19), 7663–7668 (2011)
Fang, Y.: Effective and efficient community search over large attributed graphs. HKU Ph.D. thesis, September 2017
Fang, Y., Cheng, R., Chen, Y., Luo, S., Hu, J.: Effective and efficient attributed community search. VLDB J. 26(6), 803–828 (2017)
Fang, Y., Cheng, R., Li, X., Luo, S., Hu, J.: Effective community search over large spatial graphs. PVLDB 10(6), 709–720 (2017)
Fang, Y., Cheng, R., Luo, S., Hu, J.: Effective community search for large attributed graphs. PVLDB 9(12), 1233–1244 (2016)
Fang, Y., Cheng, R., Luo, S., Hu, J., Huang, K.: C-explorer: browsing communities in large graphs. PVLDB 10(12), 1885–1888 (2017)
Fang, Y., Cheng, R., Tang, W., Maniu, S., Yang, X.: Scalable algorithms for nearest-neighbor joins on big trajectory data. TKDE 28(3), 785–800 (2016)
Fang, Y., Cheng, R., Tang, W., Maniu, S., Yang, X.S.: Scalable algorithms for nearest-neighbor joins on big trajectory data. In: ICDE, pp. 1528–1529 (2016)
Fang, Y., Zhang, H., Ye, Y., Li, X.: Detecting hot topics from Twitter: a multiview approach. J. Inf. Sci. 40(5), 578–593 (2014)
Finkel, R.A., Bentley, J.L.: Quad trees: a data structure for retrieval on composite keys. Acta Informatica 4(1), 1–9 (1974)
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2010)
Gaertler, M., Patrignani, M.: Dynamic analysis of the autonomous system graph. In: IPS, pp. 13–24 (2004)
Gibbons, A.: Algorithmic Graph Theory. Cambridge University Press, Cambridge (1985)
Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. PNAS 99(12), 7821–7826 (2002)
Guo, D.: Regionalization with dynamically constrained agglomerative clustering and partitioning (redcap). IJGIS 22(7), 801–823 (2008)
Guo, T., Cao, X., Cong, G.: Efficient algorithms for answering the m-closest keywords query. In: SIGMOD, pp. 405–418. ACM (2015)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: SIGMOD (2000)
Hu, J., Cheng, R., Huang, Z., Fang, Y., Luo, S.: On embedding uncertain graphs. In: CIKM. ACM (2017)
Hu, J., Wu, X., Cheng, R., Luo, S., Fang, Y.: Querying minimal Steiner maximum-connected subgraphs in large graphs. In: CIKM, pp. 1241–1250 (2016)
Hu, J., Xiaowei, W., Cheng, R., Luo, S., Fang, Y.: On minimal steiner maximum-connected subgraph queries. TKDE 29(11), 2455–2469 (2017)
Huang, X., Cheng, H., Qin, L., Tian, W., Yu, J.X.: Querying k-truss community in large and dynamic graphs. In: SIGMOD (2014)
Huang, X., Lakshmanan, L.V.S., Yu, J.X., Cheng, H.: Approximate closest community search in networks. PVLDB 9(4), 276–287 (2015)
Kim, Y., Son, S.-W., Jeong, H.: Finding communities in directed networks. Phys. Rev. E 81(1), 016103 (2010)
Leicht, E.A., Newman, M.E.J.: Community structure in directed networks. Phys. Rev. Lett. 100(11), 118703 (2008)
Li, R.-H., Qin, L., Yu, J.X., Mao, R.: Influential community search in large networks. In: PVLDB (2015)
Li, Z., Fang, Y., Liu, Q., Cheng, J., Cheng, R., Lui, J.: Walking in the cloud: parallel simrank at scale. PVLDB 9(1), 24–35 (2015)
Liu, Y., Niculescu-Mizil, A., Gryc, W.: Topic-link LDA: joint models of topic and author community. In: ICML (2009)
Malliaros, F.D., Vazirgiannis, M.: Clustering and community detection in directed networks: a survey. Phys. Rep. 533(4), 95–142 (2013)
Megiddo, N.: Linear-time algorithms for linear programming in r3 and related problems. In: FOCS, pp. 329–338. IEEE (1982)
Nallapati, R.M., Ahmed, A., Xing, E.P., Cohen, W.W.: Joint latent topic models for text and citations. In: KDD (2008)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Plantié, M., Crampes, M.: Survey on social community detection. In: Ramzan, N., van Zwol, R., Lee, J.S., Clüver, K., Hua, X.S. (eds.) Social Media Retrieval. Computer Communications and Networks, pp. 65–85. Springer, London (2013). https://doi.org/10.1007/978-1-4471-4555-4_4
Ruan, Y., Fuhry, D., Parthasarathy, S.: Efficient community detection in large networks using content and links. In: WWW (2013)
Sachan, M., et al.: Using content and interactions for discovering communities in social networks. In: WWW, pp. 331–340 (2012)
Seidman, S.B.: Network structure and minimum degree. Soc. Netw. 5(3), 269–287 (1983)
Shakarian, P., et al.: Mining for geographically disperse communities in social networks by leveraging distance modularity. In: KDD, pp. 1402–1409 (2013)
Sozio, M., Gionis, A.: The community-search problem and how to plan a successful cocktail party. In: KDD (2010)
Xu, Z., Ke, Y., Wang, Y., Cheng, H., Cheng, J.: A model-based approach to attributed graph clustering. In: SIGMOD, pp. 505–516. ACM (2012)
Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes. In: ICDM, pp. 1151–1156 (2013)
Yang, T., Jin, R., Chi, Y., Zhu, S.: Combining link and content for community detection: a discriminative approach. In: KDD (2009)
Yang, T., et al.: Directed network community detection: a popularity and productivity link model. In: SDM, pp. 742–753. SIAM (2010)
Zhang, W., et al.: Combining latent factor model with location features for event-based group recommendation. In: KDD, pp. 910–918. ACM (2013)
Zhou, Y., Cheng, H., Yu, J.F.: Graph clustering based on structural/attribute similarities. VLDB 2(1), 718–729 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Fang, Y., Cheng, R. (2018). On Attributed Community Search. In: Doulkeridis, C., Vouros, G., Qu, Q., Wang, S. (eds) Mobility Analytics for Spatio-Temporal and Social Data. MATES 2017. Lecture Notes in Computer Science(), vol 10731. Springer, Cham. https://doi.org/10.1007/978-3-319-73521-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-73521-4_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73520-7
Online ISBN: 978-3-319-73521-4
eBook Packages: Computer ScienceComputer Science (R0)