Skip to main content

Building Profiles of Blog Users Based on Comment Graph Analysis: The Habrahabr.ru Case

  • Conference paper
  • First Online:
Analysis of Images, Social Networks and Texts (AIST 2015)

Abstract

Our study is aimed at developing a language-independent tool for building user profiles of online community users. To that end the definition of a comment graph, a convenient representation of users interaction, is studied. The set of comment graph characteristics for users that form the basis of the profiling techniques is suggested. Finally, the user profiling method based on cluster analysis is presented. The described method was applied to Habrahabr data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://habrahabr.ru/.

References

  1. Pennachiotti, M.A.: Machine learning approach to twitter user classification. In: Fifth International AAAI Conference on Weblogs and Social Media, p. 45 (2011)

    Google Scholar 

  2. Kazushi, I.: Twitter user profiling based on text and community mining for market analysis. Knowl.-Based Syst. 51, 35–47 (2013)

    Article  Google Scholar 

  3. Shinsuke, N.: Discovering important bloggers based on analyzing blog threads. In: 2nd Annual Workshop on the Webblogging Ecosystem (2005)

    Google Scholar 

  4. Rocha, E.: User profiling on Twitter. Semant. Web: Interoperability, Usability, Applicability 1(1), 105–110 (2011)

    Google Scholar 

  5. Santosh, R.: Author profiling: predicting age and gender from blogs. In: PAN at CLEF (2013)

    Google Scholar 

  6. Balog, K., Fang, Y., de Rijke, M., Serdyukov, P., Si, L.: Expertise retrieval. Found. Trends Inf. Retrieval 6(2–3), 127–256 (2012)

    Article  Google Scholar 

  7. MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, pp. 281–297 (1967)

    Google Scholar 

  8. Rousseeuw, P.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20(1), 53–65 (1987)

    Article  MATH  Google Scholar 

  9. Krasnov, F., Yavorskiy, R.: Measurement of maturity level of a professional community. Bus. Inf. 1(23), 64–67 (2013). (in Russian)

    Google Scholar 

  10. Yavorskiy, R., Vlasova, E., Krasnov, F.: Connectivity analysis of computer science centers based on scientific publications data for major Russian cities. In: Procedia Computer Science, vol. 31, pp. 892–899 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rostislav Yavorskiy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Barysheva, A., Petrov, M., Yavorskiy, R. (2015). Building Profiles of Blog Users Based on Comment Graph Analysis: The Habrahabr.ru Case. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26123-2_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26122-5

  • Online ISBN: 978-3-319-26123-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics