Abstract
Looking at the dynamics of news content and social media content can help us understand the increasingly complex dynamics of the relationship between the media and the public surrounding noteworthy news events. Although topic models such as latent Dirichlet allocation (lda) are valuable tools, they are a poor fit for analyses in which some documents, like news articles, tend to incorporate multiple topics, while others, like tweets, tend to be focused on just one. In this paper, we propose Single Topic lda (st-lda) which jointly models news-type documents as distributions of topics and tweets as having a single topic; the model improves topic discovery in news and tweets within a unified topic space by removing noisy topics that conventional lda tends to assign to tweets. Using st-lda, we focus on the unrest in Ferguson, Missouri after the fatal shooting of Michael Brown on August 9, 2014, looking in particular at the topic dynamics of tweets in and out of St. Louis area, and at differences and relationships between topic coverage in news and tweets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
To identify locations of tweets, we refer to the geographic boundary file of 2014 TIGER/Line, https://www.census.gov/geo/maps-data/data/tiger-line.html.
- 2.
Tweets from these media sources are filtered from our Twitter data.
- 3.
News tokenization is done by OpenNLP, https://opennlp.apache.org/. Tweet tokenization is done by Twokenizer, http://www.cs.cmu.edu/~ark/TweetNLP/.
- 4.
Code is available at https://github.com/ywwbill/YWWTools#st_lda_cmd.
- 5.
Note that st-lda will not outperform lda on perplexity, since the words in a tweet are generated from the same topic. However, the sacrifice of perplexity brings improvement in topic identification.
References
Breuer, A., Landman, T., Farquhar, D.: Social media and protest mobilization: evidence from the Tunisian revolution. Democratization 22, 764?792 (2014)
Cataldi, M., Di Caro, L., Schifanella, C.: Emerging topic detection on Twitter based on temporal and social terms evaluation. In: Proceedings of Conference on Knowledge Discovery and Data Mining (2010)
Effing, R., van Hillegersberg, J., Huibers, T.: Social media and political participation: are Facebook, Twitter and YouTube democratizing our political systems? In: International Conference on Electronic Participation (2011)
Entman, R.M.: Framing: towards clarification of a fractured paradigm. J. Commun. 43, 51?58 (1993)
Fujita, K., Henderson, M.D., Eng, J., Trope, Y., Liberman, N.: Spatial distance and mental construal of social events. Psycholog. Sci. 17, 278?282 (2006)
Gao, W., Li, P., Darwish, K.: Joint topic modeling for event summarization across news and social media streams. In: Proceedings of the ACM International Conference on Information and Knowledge Management (2012)
González-Bailón, S., Borge-Holthoefer, J., Rivero, A., Moreno, Y.: The dynamics of protest recruitment through an online network. Sci. Rep. (2011)
He, D., Parker, D.S.: Topic dynamics: an alternative model of bursts in streams of topics. In: Proceedings of Conference on Knowledge Discovery and Data Mining (2010)
He, J., Hong, L., Frias-Martinez, V., Torrens, P.: Uncovering social media reaction pattern to protest events: a spatiotemporal dynamics perspective of ferguson unrest. In: International Conference on Social Informatics (2015)
He, Y., Lin, C., Gao, W., Wong, K.F.: Tracking sentiment and topic dynamics from social media. In: Proceedings of International Conference on Weblogs and Social Media (2012)
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval (1999)
Hong, L., Davison, B.D.: Empirical study of topic modeling in Twitter. In: Proceedings of Conference on Knowledge Discovery and Data Mining (2010)
Hu, Y., John, A., Wang, F., Kambhampati, S.: ET-LDA: joint topic modeling for aligning events and their Twitter feedback. In: Proceedings of the Association for the Advancement of Artificial Intelligence (2012)
Hua, T., Yue, N., Chen, F., Lu, C.T., Ramakrishnan, N.: Topical analysis of interactions between news and social media. In: Proceedings of the Association for the Advancement of Artificial Intelligence (2016)
Iwata, T., Yamada, T., Sakurai, Y., Ueda, N.: Sequential modeling of topic dynamics with multiple timescales. ACM Trans. Knowl. Discov. Data (TKDD) 5, 19:1?19:27 (2012)
Lau, J.H., Collier, N., Baldwin, T.: On-line trend analysis with topic models: #Twitter trends detection topic model online. In: Proceedings of International Conference on Computational Linguistics (2012)
Leskovec, J., Backstrom, L., Kleinberg, J.: Meme-tracking and the dynamics of the news cycle. In: Proceedings of Conference on Knowledge Discovery and Data Mining (2009)
Mane, K.K., Börner, K.: Mapping topics and topic bursts in PNAS. Proc. Natl. Acad. Sci. 101, 5287?5290 (2004)
Morinaga, S., Yamanishi, K.: Tracking dynamics of topic trends using a finite mixture model. In: Proceedings of Conference on Knowledge Discovery and Data Mining (2004)
Ratkiewicz, J., Conover, M., Meiss, M., Gonçalves, B., Patil, S., Flammini, A., Menczer, F.: Detecting and tracking the spread of astroturf memes in microblog streams. arXiv preprint arXiv:1011.3768 (2010)
Sayre, B., Bode, L., Shah, D., Wilcox, D., Shah, C.: Agenda setting in a digital age: tracking attention to California Proposition 8 in social media, online news and conventional news. Policy & Internet (2010)
Tufekci, Z., Wilson, C.: Social media and the decision to participate in political protest: observations from Tahrir Square. J. Commun. (2012)
Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential Twitterers. In: Proceedings of ACM International Conference on Web Search and Data Mining (2010)
Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of World Wide Web Conference (2013)
Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.P., Yan, H., Li, X.: Comparing twitter and traditional media using topic models. In: Proceedings of the European Conference on Information Retrieval (2011)
Zuo, Y., Zhao, J., Xu, K.: Word network topic model: a simple but general solution for short and imbalanced texts. Knowl. Inf. Syst. 48, 379398 (2014)
Acknowledgement
We thank anonymous reviewers for their insightful comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Hong, L., Yang, W., Resnik, P., Frias-Martinez, V. (2016). Uncovering Topic Dynamics of Social Media and News: The Case of Ferguson. In: Spiro, E., Ahn, YY. (eds) Social Informatics. SocInfo 2016. Lecture Notes in Computer Science(), vol 10046. Springer, Cham. https://doi.org/10.1007/978-3-319-47880-7_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-47880-7_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47879-1
Online ISBN: 978-3-319-47880-7
eBook Packages: Computer ScienceComputer Science (R0)