Mechanics and Quality of Agent-Informational Clustering in Social Networks

  • Daria A. YakovlevaEmail author
  • Olga A. TsukanovaEmail author
Conference paper
Part of the Lecture Notes on Data Engineering and Communications Technologies book series (LNDECT, volume 41)


The present paper is devoted to the study of the mechanics of agent-informational clustering in a social network on the example of user segmentation tasks taking into account an influence criterion. The main features of data generated by social networks (social big data) and metrics that characterize influential network nodes are considered. A review of community-building algorithms based on the theory of social networks, as well as clustering methods based on machine learning, is carried out. Metrics for assessing the quality of segmentation are presented. The results of the application of methods (selected on the basis of the performed analysis) to a test dataset are shown. The limitations of the applicability of considered approaches and possible problems during the implementation of algorithms in the field of social network analysis are described. Evaluation of the effectiveness is performed.


Social networks Social big bata Clustering Segmentation 


  1. 1.
    How Much Data Do We Create Every Day? The Mind-Blowing Stats Everyone Should Read. The Forbes. Accessed Feb 2019
  2. 2.
    Digital in 2018: world’s Internet users pass 4 billion mark. We are social. Accessed Feb 2019
  3. 3.
    Definition of social network. Merriam-Webster dictionary. Accessed Feb 2019
  4. 4.
  5. 5.
    The Network Science Book. Albert-Laszlo Barabashi. Accessed Feb 2019
  6. 6.
    Oussous, A., Benjelloun, F.Z.: Big data technologies: a survey. J. King Saud Univ. Comput. Inf. Sci. 30, 431–448 (2018)Google Scholar
  7. 7.
    What is Big Data? – A definition with five Vs. The * umBlog. Accessed Feb 2019
  8. 8.
    Definition of segment. Oxford dictionary. Accessed Mar 2019
  9. 9.
    Definition of community. Oxford dictionary. Accessed Mar 2019
  10. 10.
    Williams, K.: Social networks and social capital. J. Commun. Inform. (2017)Google Scholar
  11. 11.
    Community vs Social Network. Khoros Community. Accessed Mar 2019
  12. 12.
    Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks (2004)Google Scholar
  13. 13.
    Usama, M., Qadir, J., Raza, A.: Techniques, applications and research challenges (2017)Google Scholar
  14. 14.
    Keeling, M.J., Eames, K.T.D.: Networks and epidemics models. J. R. Soc. Interface (2005)Google Scholar
  15. 15.
    The EU General Data Protection Regulation (GDPR) is the most important change in data privacy regulation in 20 years. EU GDPR.ORG. Accessed Mar 2019
  16. 16.
    What is personal data? European Commission. Accessed Mar 2019
  17. 17.
    Dataset YouTube 2. Arizona State University. Accessed Apr 2019
  18. 18.
    Benchmark Performance and Scaling of Python Clustering Algorithms. HDBSCAN documentation. Accessed Mar 2019
  19. 19.
    Comparing Python Clustering Algorithms. HDBSCAN documentation. Accessed Mar 2019

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.National Research University Higher School of EconomicsMoscowRussia

Personalised recommendations