Stance Prediction for Russian: Data and Analysis

Lozhnikov, Nikita; Derczynski, Leon; Mazzara, Manuel

doi:10.1007/978-3-030-14687-0_16

Nikita Lozhnikov¹⁹,
Leon Derczynski²⁰ &
Manuel Mazzara¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 925))

Included in the following conference series:

International Conference in Software Engineering for Defence Applications

528 Accesses
6 Citations
3 Altmetric

Abstract

Stance detection is a critical component of rumour and fake news identification. It involves the extraction of the stance a particular author takes related to a given claim, both expressed in text. This paper investigates stance classification for Russian. It introduces a new dataset, RuStance, of Russian tweets and news comments from multiple sources, covering multiple stories, as well as text classification approaches to stance detection as benchmarks over this data in this language. As well as presenting this openly-available dataset, the first of its kind for Russian, the paper presents a baseline for stance prediction in the language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rapoza K (2017) These two Russian ‘fake news’ outfits get billions of hits on Facebook. https://www.forbes.com/sites/kenrapoza/2017/09/22/these-two-russian-fake-news-outfits-get-billions-of-hits-on-facebook
Zubiaga A, Aker A, Bontcheva K, Liakata M, Procter R (April 2017) Detection and resolution of rumours in social media: a survey. ArXiv e-prints
Google Scholar
Mrowca D, Wang E, Kosson A (2017) Stance detection for fake news identification
Google Scholar
Ferreira W, Vlachos A (2016) Emergent: a novel data-set for stance classification. In: Proceedings of the 2016 conference of the North American chapter of the Association for Computational Linguistics: human language technologies. ACL
Google Scholar
Anta AF, Chiroque LN, Morere P, Santos A (2013) Sentiment analysis and topic detection of spanish tweets: a comparative study of of NLP techniques. Procesamiento del lenguaje natural 50:45–52
Google Scholar
Taulé M, Martí MA, Rangel FM, Rosso P, Bosco C, Patti V et al (2017) Overview of the task on stance and gender detection in tweets on Catalan independence at IberEval 2017. In: 2nd workshop on evaluation of human language technologies for Iberian languages, IberEval 2017, vol 1881, CEUR-WS, pp 157–177
Google Scholar
Enikolopov R, Petrova M, Zhuravskaya E (2011) Media and political persuasion: evidence from Russia. Am Econ Rev 101(7):3253–85
Article Google Scholar
Zubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M (2016) Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations. arXiv preprint arXiv:1609.09028
Qazvinian V, Rosengren E, Radev DR, Mei Q (2011) Rumor has it: identifying misinformation in microblogs. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1589–1599
Google Scholar
Lukasik M, Srijith P, Vu D, Bontcheva K, Zubiaga A, Cohn T (2016) Hawkes processes for continuous time sequence classification: an application to rumour stance classification in Twitter. In: Proceedings of 54th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 393–398
Google Scholar
Meduza (2018) http://meduza.io
Channel RT TV (2018) Russia Today. https://rt.com
The Guardian (2017) Russia’s ‘irrefutable evidence’ of us help for ISIS appears to be video game still. https://www.theguardian.com/world/2017/nov/14/russia-us-isis-syria-video-game-still
Meduza (2017) Meduza.io: on fake evidence. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry
Meduza (2017) Meduza.io: on Ministry of Defense. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry
Channel RT TV (2017) Russia Today: on Russian president candidates 2018. https://russian.rt.com/inotv/2017-11-14/Rukovoditel-internet-kampanii-Sobchak-eyo-uchastie
Derczynski L, Bontcheva K (2014) PHEME: veracity in digital social networks. In: UMAP workshops
Google Scholar
Pomerleau D, Rao D (2017) Fake news challenge. http://www.fakenewschallenge.org
Anonymous (2018) Data statements for NLP: toward mitigating system bias and enabling better science. OpenReview.net
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Google Scholar
Rustance. https://figshare.com/articles/dataset_csv/7151906/2
Chollet F et al (2015) Keras. https://github.com/keras-team/keras
Liu X, Nourbakhsh A, Li Q, Fang R, Shah S (2015) Real-time rumor debunking on Twitter. In: Proceedings of the 24th ACM international on conference on information and knowledge management. ACM, pp 1867–1870
Google Scholar
Kutuzov A, Kuzmenko E (2017) WebVectors: a toolkit for building web interfaces for vector semantic models. Springer, Cham, pp 155–161
Google Scholar
Ghulati D (2016) Introducing factmata—artificial intelligence for political fact-checking. https://medium.com/factmata/introducing-factmata-artificial-intelligence-for-political-fact-checking-db8acdbf4cf1
Baird YPS, Sibley D (2017) Talos targets disinformation with fake news challenge victory. http://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html
Řehůřek R, Sojka P (May 2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks. ELRA, Valletta, pp 45–50. http://is.muni.cz/publication/884893/en
Derczynski L, Bontcheva K, Liakata M, Procter R, Hoi GWS, Zubiaga A (2017) SemEval-2017 task 8: RumourEval: determining rumour veracity and support for rumours. arXiv preprint arXiv:1704.05972
Mohammad SM, Sobhani P, Kiritchenko S (2017) Stance and sentiment in tweets. ACM Trans. Internet Technol. (TOIT) 17(3):26
Article Google Scholar
Aker A, Derczynski L, Bontcheva K (2017) Simple open stance classification for rumour analysis. In: Proceedings of RANLP
Google Scholar
Ruder S, Glover J, Mehrabani A, Ghaffari P (2018) 360\({}^\circ \) stance detection. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: demonstrations. Association for Computational Linguistics, pp 31–35
Google Scholar
Thorne J, Vlachos A, Christodoulopoulos C, Mittal A (2018) Fever: a large-scale dataset for fact extraction and verification. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (long papers), vol 1. Association for Computational Linguistics, pp 809–819
Google Scholar
Kochkina E, Liakata M, Augenstein I (2017) Turing at SemEval-2017 task 8: sequential approach to rumour stance classification with branch-LSTM. In: Proceedings of SemEval (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Innopolis University, Innopolis, Russian Federation
Nikita Lozhnikov & Manuel Mazzara
ITU Copenhagen, Copenhagen, Denmark
Leon Derczynski

Authors

Nikita Lozhnikov
View author publications
You can also search for this author in PubMed Google Scholar
Leon Derczynski
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Mazzara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Nikita Lozhnikov , Leon Derczynski or Manuel Mazzara .

Editor information

Editors and Affiliations

University of Bologna, Bologna, Italy
Paolo Ciancarini
Innopolis University, Innopolis, Russia
Manuel Mazzara
Innopolis University, Innopolis, Russia
Angelo Messina
Innopolis University, Innopolis, Russia
Alberto Sillitti
Innopolis University, Innopolis, Russia
Giancarlo Succi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lozhnikov, N., Derczynski, L., Mazzara, M. (2020). Stance Prediction for Russian: Data and Analysis. In: Ciancarini, P., Mazzara, M., Messina, A., Sillitti, A., Succi, G. (eds) Proceedings of 6th International Conference in Software Engineering for Defence Applications. SEDA 2018. Advances in Intelligent Systems and Computing, vol 925. Springer, Cham. https://doi.org/10.1007/978-3-030-14687-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-14687-0_16
Published: 19 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14686-3
Online ISBN: 978-3-030-14687-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics