Skip to main content

Social Network Sentiment Analysis and Message Clustering

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11938))

Abstract

Till today, classification of documents into negative, neutral, or positive remains a key task within the analysis of text tonality/sentiment. There are several methods for the automatic analysis of text sentiment. The method based on network models, the most linguistically sound, to our viewpoint, allows us take into account the syntagmatic connections of words. Also, it utilizes the assumption that not all words in a text are equivalent; some words have more weight and cast higher impact upon the tonality of the text than others. We see it natural to represent a text as a network for sentiment studies, especially in the case of short texts where grammar structures play a higher role in formation of the text pragmatics and the text cannot be seen as just “a bag of words”. We propose a method of text analysis that combines using a lexical mask and an efficient clustering mechanism. In this case, cluster analysis is one of the main methods of typology which demands obtaining formal rules for calculating the number of clusters. The choice of a set of clusters and the moment of completion of the clustering algorithm depend on each other. We show that cluster analysis of data from an n-dimensional vector space using the “single linkage” method can be considered a discrete random process. Sequences of “minimum distances” define the trajectories of this process. “Approximation-estimating test” allows establishing the Markov moment of the completion of the agglomerative clustering process.

This research in the part of data interpretation and literature review has been supported by the Russian Science Foundation grant 16-18-10125-P (2016–2018, prolongued for 2019–2020).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2, 1–135 (2008)

    Article  Google Scholar 

  2. Liu, B.: Sentiment analysis and subjectivity. In: Indurkhyaé, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, vol. 2, pp. 627–666 (2010)

    Google Scholar 

  3. Turney, P.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings Association for Computational Linguistics, pp. 417–424 (2002). arXiv:cs.LG/0212032

  4. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. Proc. EMNLP 10, 79–86 (2002)

    Article  Google Scholar 

  5. Bodrunova, S.S., Blekanov, I.S., Smoliarova, A., Litvinenko, A.: Beyond left and right: real-world political polarization in Twitter discussions on inter-ethnic conflicts. Media Commun. 7(3), 119–132 (2019)

    Article  Google Scholar 

  6. Bollen, J., Mao, H., Zeng, X.-J.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)

    Article  Google Scholar 

  7. Pang, B., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings 43rd Annual Meeting Association for Computational Linguistics, University of Michigan, USA, June 25-30, 115–124 (2005)

    Google Scholar 

  8. Snyder, B., Barzilay, R.: Multiple aspect ranking using the good grief algorithm. In: Proceedings Joint Human Language Technology/North American Chapter of the ACL Conference (HLT-NAACL), pp. 300-307 (2007)

    Google Scholar 

  9. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. J. Am. Soc. Inf. Sci. Technol. 61(12), 2544–2558 (2010)

    Article  Google Scholar 

  10. Su, F., Markert, K.: From words to senses: a case study in subjectivity recognition. In: Proceedings 22nd International Conference on Computational Linguistics, vol. 1, pp. 825–832, ACL (2008)

    Google Scholar 

  11. Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity. In: Proceedings 42nd Annual Meeting, Association for Computational Linguistics, pp. 271–278 (2004)

    Google Scholar 

  12. Washington, E.: Human Sentiment Analysis. Growingsocialmedia.com. Last accessed 14 Nov 2013

    Google Scholar 

  13. Pandey, A.C., Rajpoot, D.S., Saraswat, M.: Twitter sentiment analysis using hybrid cuckoo search method. Inf. Process. Manag. 53(4), 764–779 (2017)

    Article  Google Scholar 

  14. Pazelskaya, A., Soloviev, A.: Method of determining emotions in texts in Russian. In: The International Conference on Computational Linguistics and Intellectual Technologies “Dialogue-2011”, Moscow, 510–522 (2011) [in Russian]

    Google Scholar 

  15. Chetviorkin, I., Loukachevitch, N.: Extraction of russian sentiment lexicon for product meta-domain. Proc. COLING 2012, 593–610 (2012)

    Google Scholar 

  16. Klekovkina, M.V., Kotelnikov, E.V.: The method of automatic classification of texts by tonality, based on the dictionary of emotional vocabulary. Trudy 14–i Vserossiyskoi nauchnoi konferentsii “Elektronnye biblioteki: Perspektivnye metody i tekhnologii, elektronnye kollektsii”, pp. 81–86 (2012) [in Russian]

    Google Scholar 

  17. Ustalov, D.A.: Extraction of terms from the Russian texts using the graph models. In: Proceedings Conference Graphs Theory and Applications, pp. 62–69 (2012). [in Russian]

    Google Scholar 

  18. Kobayashi, N., Iida, R., Inui, K., Matsumoto, Y.: Opinion mining on the web by extracting subject-aspect-evaluation relations. In: AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, pp. 86–91 (2006)

    Google Scholar 

  19. Ogneva, M.: How Companies Can Use Sentiment Analysis to Improve Their Business. Mashable.com. Last accessed 13 Dec 2012

    Google Scholar 

  20. Rentoumi, V., Giannakopoulos, G., Karkaletsis, V., Vouros, G.A.: Sentiment analysis of figurative language using a word sense disambiguation approach. In: Proceedings International Conference RANLP-2009, pp. 370–375 (2009)

    Google Scholar 

  21. von Stieglitz, S., Dang-Xuan, L., Bruns, A., Neuberger, C.: Social media analytics-an interdisciplinary approach and its implications for information systems. Bus. Inf. Syst. Eng. 6(2), 89–96 (2014)

    Article  Google Scholar 

  22. Girvan, M., Newman, M.E.: Community structure in social and biological networks. Proc. Nat. Acad. Sci. 99(12), 7821–7826 (2002)

    Article  MathSciNet  Google Scholar 

  23. Deitrick, W., Hu, W.: Mutually enhancing community detection and sentiment analysis on twitter networks. J. Data Anal. Inf. Process. 1(03), 19–29 (2013)

    Google Scholar 

  24. Xia, R., Zong, C., Li, S.: Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 181(6), 1138–1152 (2011)

    Article  Google Scholar 

  25. Li, N., Wu, D.D.: Using text mining and sentiment analysis for online forums hotspot detection and forecast. Dec. Support Syst. 48(2), 354–368 (2010)

    Article  Google Scholar 

  26. Scheible, C.: Sentiment translation through lexicon induction. In: Proceedings ACL 2010 Student Research Workshop, pp. 25–30 (2010)

    Google Scholar 

  27. Burscher, B., Vliegenthart, R., Vreese, C.H.D.: Frames beyond words: applying cluster and sentiment analysis to news coverage of the nuclear power issue. Soc. Sci. Comput. Rev. 34(5), 530–545 (2016)

    Article  Google Scholar 

  28. Cambria, E., Poria, S., Bajpai, R., Schuller, B.: SenticNet 4: a semantic resource for sentiment analysis based on conceptual primitives. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2666–2677 (2016)

    Google Scholar 

  29. Kang, G.J., et al.: Semantic network analysis of vaccine sentiment in online social media. Vaccine 35(29), 3621–3638 (2017)

    Article  Google Scholar 

  30. Doerfel, M.L.: What constitutes semantic network analysis? a comparison of research and methodologies. Connections 21(2), 16–26 (1998)

    Google Scholar 

  31. Kharlamov, A.A., Yermolenko, T.V.: Neuronetwork Environment (Neuromorphic Associative Memory) for Information Complexity Overcoming. Searching of Sense in Weak Structured Information Arrays. Part II. Information Processing in Hyppocampus. Informational Technology. 21(11), pp. 883–889 (2015) [in Russian]

    Google Scholar 

  32. Orekhov, A.V., Kharlamov, A.A., Bodrunova, S.S.: Network presentation of texts and clustering of messages. In: Proceedings International conference on Internet Science 2019 (INSCI2019), in this volume (2019)

    Chapter  Google Scholar 

  33. Ore, O.: Theory of Graphs. American Mathematical Society, Providence (1983)

    MATH  Google Scholar 

  34. Kharlamov, A.A., Yermolenko, T.V.: Text analysis: linguistics, semantics, pragmatics in the cognitive approach. Int. Sci. J. “Manag. Syst. Mach.” 6(250), 29–33 (2015). [in Ukrainian]

    Google Scholar 

  35. Kharlamov, A.A.: Formation of the n-grammatic thematic model of the text. Speech Technol. Number 1–2, 15–23 (2016). [in Russian]

    Google Scholar 

  36. Orekhov, A.V.: Markov moment for the agglomerative method of clustering in Euclidean space. Bulletin of Saint Petersburg University, Series Applied Mathematics. Computer Science. Control Processes, 15(1), 76–92 (2019). https://doi.org/10.21638/11702/spbu10.2019.106

  37. Orekhov, A.V.: Agglomerative method for texts clustering. In: Bodrunova, S.S., et al. (eds.) INSCI 2018. LNCS, vol. 11551, pp. 19–32. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-17705-8_2

    Chapter  Google Scholar 

  38. Aldenderfer, M.S., Blashfield, R.K.: Cluster Analysis. Sage Publications, Thousand Oaks (1984)

    Book  Google Scholar 

  39. Wald, A.: Sequential Analysis. John Wiley & Sons, Hoboken (1947)

    MATH  Google Scholar 

  40. Sirjaev, A.N.: Statistical sequential analysis: optimal stopping rules. Am. Math. Soc. 38, 174 (1973)

    Google Scholar 

  41. Orekhov, A.V.: Criterion for estimation of stress-deformed state of SD-materials. AIP Conf. Proc. 1959, 070028 (2018). https://doi.org/10.1063/1.5034703

    Article  Google Scholar 

  42. Orekhov, A.V.: Approximation-evaluation tests for a stress-strain state of deformable solids. Bull. St. Petersburg Univ. Ser. Appl. Math. Comput. Sci. Control Process. 14(3), 230–242 (2018). https://doi.org/10.21638/11702/spbu10.2018.304

    MathSciNet  Google Scholar 

  43. Granichin, O.N., Shalymov, D.S., Avros, R., Volkovich, Z.: A randomized algorithm for finding the number of clusters. Automation and remote control 86–98, (2011). [in Russian]

    Google Scholar 

  44. Shalymov, D.S.: Randomized method for determining the number of clusters on a data set. Sci. Tech. Bull. St. Petersburg State Univ. Inf. Technol., Mech. Optics 5(63), 111–116 (2009). [in Russian]

    Google Scholar 

  45. Shalymov, D.S.: Algorithms for sustainable clustering based on index functions and stability functions. Stoch. Optimizat. Comput. Sci. 4(1–1), 236–248 (2008). [in Russian]

    Google Scholar 

Download references

Acknowledgements

This research in the part of data interpretation and literature review has been supported by the Russian Science Foundation grant 16-18-10125-P (2016–2018, prolongued for 2019–2020).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrey V. Orekhov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kharlamov, A.A., Orekhov, A.V., Bodrunova, S.S., Lyudkevich, N.S. (2019). Social Network Sentiment Analysis and Message Clustering. In: El Yacoubi, S., Bagnoli, F., Pacini, G. (eds) Internet Science. INSCI 2019. Lecture Notes in Computer Science(), vol 11938. Springer, Cham. https://doi.org/10.1007/978-3-030-34770-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-34770-3_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-34769-7

  • Online ISBN: 978-3-030-34770-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics