Abstract
Stance detection is a critical component of rumour and fake news identification. It involves the extraction of the stance a particular author takes related to a given claim, both expressed in text. This paper investigates stance classification for Russian. It introduces a new dataset, RuStance, of Russian tweets and news comments from multiple sources, covering multiple stories, as well as text classification approaches to stance detection as benchmarks over this data in this language. As well as presenting this openly-available dataset, the first of its kind for Russian, the paper presents a baseline for stance prediction in the language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rapoza K (2017) These two Russian ‘fake news’ outfits get billions of hits on Facebook. https://www.forbes.com/sites/kenrapoza/2017/09/22/these-two-russian-fake-news-outfits-get-billions-of-hits-on-facebook
Zubiaga A, Aker A, Bontcheva K, Liakata M, Procter R (April 2017) Detection and resolution of rumours in social media: a survey. ArXiv e-prints
Mrowca D, Wang E, Kosson A (2017) Stance detection for fake news identification
Ferreira W, Vlachos A (2016) Emergent: a novel data-set for stance classification. In: Proceedings of the 2016 conference of the North American chapter of the Association for Computational Linguistics: human language technologies. ACL
Anta AF, Chiroque LN, Morere P, Santos A (2013) Sentiment analysis and topic detection of spanish tweets: a comparative study of of NLP techniques. Procesamiento del lenguaje natural 50:45–52
Taulé M, Martà MA, Rangel FM, Rosso P, Bosco C, Patti V et al (2017) Overview of the task on stance and gender detection in tweets on Catalan independence at IberEval 2017. In: 2nd workshop on evaluation of human language technologies for Iberian languages, IberEval 2017, vol 1881, CEUR-WS, pp 157–177
Enikolopov R, Petrova M, Zhuravskaya E (2011) Media and political persuasion: evidence from Russia. Am Econ Rev 101(7):3253–85
Zubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M (2016) Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations. arXiv preprint arXiv:1609.09028
Qazvinian V, Rosengren E, Radev DR, Mei Q (2011) Rumor has it: identifying misinformation in microblogs. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1589–1599
Lukasik M, Srijith P, Vu D, Bontcheva K, Zubiaga A, Cohn T (2016) Hawkes processes for continuous time sequence classification: an application to rumour stance classification in Twitter. In: Proceedings of 54th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 393–398
Meduza (2018) http://meduza.io
Channel RT TV (2018) Russia Today. https://rt.com
The Guardian (2017) Russia’s ‘irrefutable evidence’ of us help for ISIS appears to be video game still. https://www.theguardian.com/world/2017/nov/14/russia-us-isis-syria-video-game-still
Meduza (2017) Meduza.io: on fake evidence. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry
Meduza (2017) Meduza.io: on Ministry of Defense. https://meduza.io/shapito/2017/11/14/minoborony-vylozhilo-neosporimoe-dokazatelstvo-sotrudnichestva-ssha-i-ig-skrinshot-iz-mobilnoy-igry
Channel RT TV (2017) Russia Today: on Russian president candidates 2018. https://russian.rt.com/inotv/2017-11-14/Rukovoditel-internet-kampanii-Sobchak-eyo-uchastie
Derczynski L, Bontcheva K (2014) PHEME: veracity in digital social networks. In: UMAP workshops
Pomerleau D, Rao D (2017) Fake news challenge. http://www.fakenewschallenge.org
Anonymous (2018) Data statements for NLP: toward mitigating system bias and enabling better science. OpenReview.net
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Rustance. https://figshare.com/articles/dataset_csv/7151906/2
Chollet F et al (2015) Keras. https://github.com/keras-team/keras
Liu X, Nourbakhsh A, Li Q, Fang R, Shah S (2015) Real-time rumor debunking on Twitter. In: Proceedings of the 24th ACM international on conference on information and knowledge management. ACM, pp 1867–1870
Kutuzov A, Kuzmenko E (2017) WebVectors: a toolkit for building web interfaces for vector semantic models. Springer, Cham, pp 155–161
Ghulati D (2016) Introducing factmata—artificial intelligence for political fact-checking. https://medium.com/factmata/introducing-factmata-artificial-intelligence-for-political-fact-checking-db8acdbf4cf1
Baird YPS, Sibley D (2017) Talos targets disinformation with fake news challenge victory. http://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html
Řehůřek R, Sojka P (May 2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks. ELRA, Valletta, pp 45–50. http://is.muni.cz/publication/884893/en
Derczynski L, Bontcheva K, Liakata M, Procter R, Hoi GWS, Zubiaga A (2017) SemEval-2017 task 8: RumourEval: determining rumour veracity and support for rumours. arXiv preprint arXiv:1704.05972
Mohammad SM, Sobhani P, Kiritchenko S (2017) Stance and sentiment in tweets. ACM Trans. Internet Technol. (TOIT) 17(3):26
Aker A, Derczynski L, Bontcheva K (2017) Simple open stance classification for rumour analysis. In: Proceedings of RANLP
Ruder S, Glover J, Mehrabani A, Ghaffari P (2018) 360\({}^\circ \) stance detection. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: demonstrations. Association for Computational Linguistics, pp 31–35
Thorne J, Vlachos A, Christodoulopoulos C, Mittal A (2018) Fever: a large-scale dataset for fact extraction and verification. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (long papers), vol 1. Association for Computational Linguistics, pp 809–819
Kochkina E, Liakata M, Augenstein I (2017) Turing at SemEval-2017 task 8: sequential approach to rumour stance classification with branch-LSTM. In: Proceedings of SemEval (2017)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Lozhnikov, N., Derczynski, L., Mazzara, M. (2020). Stance Prediction for Russian: Data and Analysis. In: Ciancarini, P., Mazzara, M., Messina, A., Sillitti, A., Succi, G. (eds) Proceedings of 6th International Conference in Software Engineering for Defence Applications. SEDA 2018. Advances in Intelligent Systems and Computing, vol 925. Springer, Cham. https://doi.org/10.1007/978-3-030-14687-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-14687-0_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14686-3
Online ISBN: 978-3-030-14687-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)