Skip to main content

Stance Prediction for Russian: Data and Analysis

  • Conference paper
  • First Online:
Proceedings of 6th International Conference in Software Engineering for Defence Applications (SEDA 2018)

Abstract

Stance detection is a critical component of rumour and fake news identification. It involves the extraction of the stance a particular author takes related to a given claim, both expressed in text. This paper investigates stance classification for Russian. It introduces a new dataset, RuStance, of Russian tweets and news comments from multiple sources, covering multiple stories, as well as text classification approaches to stance detection as benchmarks over this data in this language. As well as presenting this openly-available dataset, the first of its kind for Russian, the paper presents a baseline for stance prediction in the language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rapoza K (2017) These two Russian ‘fake news’ outfits get billions of hits on Facebook. https://www.forbes.com/sites/kenrapoza/2017/09/22/these-two-russian-fake-news-outfits-get-billions-of-hits-on-facebook

  2. Zubiaga A, Aker A, Bontcheva K, Liakata M, Procter R (April 2017) Detection and resolution of rumours in social media: a survey. ArXiv e-prints

    Google Scholar 

  3. Mrowca D, Wang E, Kosson A (2017) Stance detection for fake news identification

    Google Scholar 

  4. Ferreira W, Vlachos A (2016) Emergent: a novel data-set for stance classification. In: Proceedings of the 2016 conference of the North American chapter of the Association for Computational Linguistics: human language technologies. ACL

    Google Scholar 

  5. Anta AF, Chiroque LN, Morere P, Santos A (2013) Sentiment analysis and topic detection of spanish tweets: a comparative study of of NLP techniques. Procesamiento del lenguaje natural 50:45–52

    Google Scholar 

  6. Taulé M, Martí MA, Rangel FM, Rosso P, Bosco C, Patti V et al (2017) Overview of the task on stance and gender detection in tweets on Catalan independence at IberEval 2017. In: 2nd workshop on evaluation of human language technologies for Iberian languages, IberEval 2017, vol 1881, CEUR-WS, pp 157–177

    Google Scholar 

  7. Enikolopov R, Petrova M, Zhuravskaya E (2011) Media and political persuasion: evidence from Russia. Am Econ Rev 101(7):3253–85

    Article  Google Scholar 

  8. Zubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M (2016) Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations. arXiv preprint arXiv:1609.09028

  9. Qazvinian V, Rosengren E, Radev DR, Mei Q (2011) Rumor has it: identifying misinformation in microblogs. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1589–1599

    Google Scholar 

  10. Lukasik M, Srijith P, Vu D, Bontcheva K, Zubiaga A, Cohn T (2016) Hawkes processes for continuous time sequence classification: an application to rumour stance classification in Twitter. In: Proceedings of 54th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 393–398

    Google Scholar 

  11. Meduza (2018) http://meduza.io

  12. Channel RT TV (2018) Russia Today. https://rt.com

  13. The Guardian (2017) Russia’s ‘irrefutable evidence’ of us help for ISIS appears to be video game still. https://www.theguardian.com/world/2017/nov/14/russia-us-isis-syria-video-game-still

  14. Meduza (2017) Meduza.io: on fake evidence. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry

  15. Meduza (2017) Meduza.io: on Ministry of Defense. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry

  16. Channel RT TV (2017) Russia Today: on Russian president candidates 2018. https://russian.rt.com/inotv/2017-11-14/Rukovoditel-internet-kampanii-Sobchak-eyo-uchastie

  17. Derczynski L, Bontcheva K (2014) PHEME: veracity in digital social networks. In: UMAP workshops

    Google Scholar 

  18. Pomerleau D, Rao D (2017) Fake news challenge. http://www.fakenewschallenge.org

  19. Anonymous (2018) Data statements for NLP: toward mitigating system bias and enabling better science. OpenReview.net

  20. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119

    Google Scholar 

  21. Rustance. https://figshare.com/articles/dataset_csv/7151906/2

  22. Chollet F et al (2015) Keras. https://github.com/keras-team/keras

  23. Liu X, Nourbakhsh A, Li Q, Fang R, Shah S (2015) Real-time rumor debunking on Twitter. In: Proceedings of the 24th ACM international on conference on information and knowledge management. ACM, pp 1867–1870

    Google Scholar 

  24. Kutuzov A, Kuzmenko E (2017) WebVectors: a toolkit for building web interfaces for vector semantic models. Springer, Cham, pp 155–161

    Google Scholar 

  25. Ghulati D (2016) Introducing factmata—artificial intelligence for political fact-checking. https://medium.com/factmata/introducing-factmata-artificial-intelligence-for-political-fact-checking-db8acdbf4cf1

  26. Baird YPS, Sibley D (2017) Talos targets disinformation with fake news challenge victory. http://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html

  27. Řehůřek R, Sojka P (May 2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks. ELRA, Valletta, pp 45–50. http://is.muni.cz/publication/884893/en

  28. Derczynski L, Bontcheva K, Liakata M, Procter R, Hoi GWS, Zubiaga A (2017) SemEval-2017 task 8: RumourEval: determining rumour veracity and support for rumours. arXiv preprint arXiv:1704.05972

  29. Mohammad SM, Sobhani P, Kiritchenko S (2017) Stance and sentiment in tweets. ACM Trans. Internet Technol. (TOIT) 17(3):26

    Article  Google Scholar 

  30. Aker A, Derczynski L, Bontcheva K (2017) Simple open stance classification for rumour analysis. In: Proceedings of RANLP

    Google Scholar 

  31. Ruder S, Glover J, Mehrabani A, Ghaffari P (2018) 360\({}^\circ \) stance detection. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: demonstrations. Association for Computational Linguistics, pp 31–35

    Google Scholar 

  32. Thorne J, Vlachos A, Christodoulopoulos C, Mittal A (2018) Fever: a large-scale dataset for fact extraction and verification. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (long papers), vol 1. Association for Computational Linguistics, pp 809–819

    Google Scholar 

  33. Kochkina E, Liakata M, Augenstein I (2017) Turing at SemEval-2017 task 8: sequential approach to rumour stance classification with branch-LSTM. In: Proceedings of SemEval (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Nikita Lozhnikov , Leon Derczynski or Manuel Mazzara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lozhnikov, N., Derczynski, L., Mazzara, M. (2020). Stance Prediction for Russian: Data and Analysis. In: Ciancarini, P., Mazzara, M., Messina, A., Sillitti, A., Succi, G. (eds) Proceedings of 6th International Conference in Software Engineering for Defence Applications. SEDA 2018. Advances in Intelligent Systems and Computing, vol 925. Springer, Cham. https://doi.org/10.1007/978-3-030-14687-0_16

Download citation

Publish with us

Policies and ethics