Abstract
Some structural characteristics of online discussions have been successfully modeled in the recent years. When parameters of these models are properly estimated, the models are able to generate synthetic discussions that are structurally similar to the real discussions. A common aspect of these models is that they consider that all users behave according to the same model. In this paper, we combine a growth model with an Expectation–Maximization algorithm that finds different parameters for different latent groups of users. We use this method to find the different roles that coexist in the community. Moreover, we analyze whether we can predict users behaviors based on their roles. Indeed, we show that predictions are improved for some of the roles when compared with a simple growth model.
Similar content being viewed by others
Notes
In practice, \(\mathbf{X }\) may be represented as a matrix of feature vectors \({\mathbf{x }}_i\) that makes the computing of the log-likelihood easy to vectorize in some programming languages.
In general, \(f({\mathbb {E}}[X]) \ge {\mathbb {E}}[f(X)]\) where f is a concave function.
References
Agarwal N, Liu H, Tang L, Yu PS (2008) Identifying the influential bloggers in a community. In: Proceedings of the international conference on web search and web data mining—WSDM ’08, New York, NY, USA. ACM Press, p 207
Angeletou S, Rowe M, Alani H (2011) Modelling and analysis of user behaviour in online communities. In: Proceedings of the 10th international semantic web conference, pp 35–50
Aragón P, Gómez V, Kaltenbrunner A (2017) To thread or not to thread: the impact of conversation threading on online discussion. In: 11th international AAAI conference on web and social media. The AAAI Press
Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286(October):509–512
Bensmail H, Celeux G, Raftery A, Robert C (1997) Inference in model-based cluster analysis. Stat Comput 7:1–10
Buntain C, Golbeck J (2014) Identifying social roles in reddit using network structure. In: Proceedings of the companion publication of the 23rd international conference on World wide web companion, pp 615–620
Chan J, Hayes, C, Daly E (2010) Decomposing discussion forums using common user roles. In: Proceedings of the WebSci10: extending the frontiers of society on-line
Cheng J, Danescu-Niculescu-Mizil C, Leskovec J (2015) Antisocial behavior in online discussion communities. In: AAAI international conference on weblogs and social media. AAAI Press, pp 61–70
Choobdar S, Ribeiro P, Silva F (2017) Evolutionary role mining in complex networks by ensemble clustering. In: Proceedings of the symposium on applied computing, SAC ’17, New York, NY, USA. ACM, pp 1053–1060
Forestier M, Velcin J, Stavrianou A, Zighed D (2012) Extracting celebrities from online discussions. In: Proceedings of the 2012 IEEE/ACM international conference on advances in social networks analysis and mining, ASONAM 2012, pp 322–326
Golder SA (2003) A typology of social roles in usenet. Ph.D. thesis, Harvard University
Gómez V, Kappen HJ, Kaltenbrunner A (2010) Modeling the structure and evolution of discussion cascades. In: Proceedings of the 22nd ACM conference on hypertext and hypermedia, pp 181–190
Gómez V, Kappen HJ, Litvak N, Kaltenbrunner A (2012) A likelihood-based framework for the analysis of discussion threads. World Wide Web 16(5–6):645–675
Goyal A, Bonchi F, Lakshmanan LV (2008). Discovering leaders from community actions. In: Proceeding of the 17th ACM conference on information and knowledge mining—CIKM ’08, New York, NY, USA. ACM Press, p 499
Himelboim I, Gleave E, Smith M (2009) Discussion catalysts in online political discussions: content importers and conversation starters. J Comput Med Commun 14(4):771–789
Kolaczyk ED (2009) Statistical analysis of network data: methods and models. Springer, New York
Kumar R, Mahdian M, McGlohon M (2010) Dynamics of conversations. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 553–562
Kumar S, Spezzano F, Subrahmanian VS (2014) Accurately detecting trolls in Slashdot Zoo via decluttering. In: Proceedings of the 2014 IEEE/ACM international conference on advances in social networks analysis and mining, pp 188–195
Lui M, Baldwin T (2010) Classifying user forum participants: separating the gurus from the hacks, and other tales of the internet. In: Proceedings of Australasian language technology association workshop, pp 49–57
Nelder J, Mead R, Nelder BJ, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313
Nolker RD, Zhou L (2005) Social computing and weighting to identify member roles in online communities. In: The 2005 IEEE/WIC/ACM international conference on web intelligence (WI’05). IEEE, pp 87–93
Rowe M, Fernandez M, Angeletou S, Alani H (2013) Community analysis through semantic rules and role composition derivation. Web Semant Sci Serv Agents World Wide Web 18(1):31–47
Wang C, Ye M, Huberman BA (2012) From user comments to on-line conversations. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 244–252
White A, Chan J, Hayes C, Murphy BT (2012) Mixed Membership models for exploring user roles in online fora. In: Proceedings of the 6th annual international conference on weblogs and social media—ICWSM2012, pp 599–602
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lumbreras, A., Jouve, B., Velcin, J. et al. Role detection in online forums based on growth models for trees. Soc. Netw. Anal. Min. 7, 49 (2017). https://doi.org/10.1007/s13278-017-0472-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-017-0472-z