Abstract
Twitter’s increasing popularity as a source of up to date news and information about current events has spawned a body of research on event detection techniques for social media data streams. Although all proposed approaches provide some evidence as to the quality of the detected events, none relate this task-based performance to their run-time performance in terms of processing speed or data throughput. In particular, neither a quantitative nor a comparative evaluation of these aspects has been performed to date. In this paper, we study the run-time and task-based performance of several state-of-the-art event detection techniques for Twitter. In order to reproducibly compare run-time performance, our approach is based on a general-purpose data stream management system, whereas task-based performance is automatically assessed based on a series of novel measures.
Chapter PDF
References
Abadi, D.J., Ahmad, Y., Balazinska, M., Çetintemel, U., Cherniack, M., Hwang, J., Lindner, W., Maskey, A., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y., Zdonik, S.B.: The design of the borealis stream processing engine. In: Proc. Intl. Conf. on Innovative Data Systems Research (CIDR), pp. 277–289 (2005)
Abadi, D.J., Carney, D., Çetintemel, U., Cherniack, M., Convey, C., Lee, S., Stonebraker, M., Tatbul, N., Zdonik, S.: Aurora: A New Model and Architecture for Data Stream Management. The VLDB Journal 12(2), 120–139 (2003)
Allan, J.: Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers (2002)
Alvanaki, F., Michel, S., Ramamritham, K., Weikum, G.: See what’s enBlogue: real-time emergent topic identification in social media. In: Proc. Intl. Conf. on Extending Database Technology (EDBT), pp. 336–347 (2012)
Arasu, A., Babu, S., Widom, J.: The CQL Continuous Query Language: Semantic Foundations and Query Execution. The VLDB Journal 15(2), 121–142 (2006)
Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: real-world event identification on twitter. In: Proc. Intl. Conf on Weblogs and Social Media (ICWSM), pp. 438–441 (2011)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Bontcheva, K., Rout, D.: Making Sense of Social Media Streams through Semantics: a Survey. Semantic Web 5(5), 373–403 (2014)
Cordeiro, M.: Twitter event detection: combining wavelet analysis and topic inference summarization. In: Proc. Doctoral Symposium on Informatics Engineering (DSIE) (2012)
Culotta, A.: Towards detecting influenza epidemics by analyzing twitter messages. In: Proc. Workshop on Social Media Analytics (SOMA), pp. 115–122 (2010)
Farzindar, A., Khreich, W.: A Survey of Techniques for Event Detection in Twitter. Computational Intelligence (2013). http://dx.doi.org/10.1111/coin.12017
Li, J., Maier, D., Tufte, K., Papadimos, V., Tucker, P.A.: No Pane, No Gain: Efficient Evaluation of Sliding-Window Aggregates over Data Streams. SIGMOD Record 34(1), 39–44 (2005)
Li, J., Tufte, K., Shkapenyuk, V., Papadimos, V., Johnson, T., Maier, D.: Out-of-Order Processing: A New Architecture for High-Performance Stream Systems. PVLDB 1(1), 274–288 (2008)
Long, R., Wang, H., Chen, Y., Jin, O., Yu, Y.: Towards effective event detection, tracking and summarization on microblog data. In: Wang, H., Li, S., Oyama, S., Hu, X., Qian, T. (eds.) WAIM 2011. LNCS, vol. 6897, pp. 652–663. Springer, Heidelberg (2011)
Maier, D., Grossniklaus, M., Moorthy, S., Tufte, K.: Capturing episodes: may the frame be with you. In: Proc. Intl. Conf. on Distributed Event-Based Systems (DEBS), pp. 1–11 (2012)
Petrović, S., Osborne, M., Lavrenko, V.: Streaming first story detection with application to twitter. In: Proc. Conf. of the North American Chapter of the Association for Computational Linguistics (HLT), pp. 181–189 (2010)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proc. Intl. Conf. on World Wide Web (WWW), pp. 851–860 (2010)
Sparck Jones, K.: A Statistical Interpretation of Term Specificity and Its Application in Retrieval, pp. 132–142. Taylor Graham Publishing (1988)
Toshniwal, A., Taneja, S., Shukla, A., Ramasamy, K., Patel, J.M., Kulkarni, S., Jackson, J., Gade, K., Fu, M., Donham, J., Bhagat, N., Mittal, S., Ryaboy, D.V.: Storm @Twitter. In: Proc. Intl. Conf. on Management of Data (SIGMOD), pp. 147–156 (2014)
Weiler, A., Grossniklaus, M., Scholl, M.H.: Event identification and tracking in social media streaming data. In: Proc. EDBT Workshop on Multimodal Social Data Management (MSDM), pp. 282–287 (2014)
Weng, J., Lee, B.S.: Event detection in twitter. In: Proc. Intl. Conf on Weblogs and Social Media (ICWSM), pp. 401–408 (2011)
Zimmermann, M., Ntoutsi, I., Siddiqui, Z.F., Spiliopoulou, M., Kriegel, H.P.: Discovering global and local bursts in a stream of news. In: Proc. Symp. on Applied Computing (SAC), pp. 807–812 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Weiler, A., Grossniklaus, M., Scholl, M.H. (2015). Run-Time and Task-Based Performance of Event Detection Techniques for Twitter. In: Zdravkovic, J., Kirikova, M., Johannesson, P. (eds) Advanced Information Systems Engineering. CAiSE 2015. Lecture Notes in Computer Science(), vol 9097. Springer, Cham. https://doi.org/10.1007/978-3-319-19069-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-19069-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19068-6
Online ISBN: 978-3-319-19069-3
eBook Packages: Computer ScienceComputer Science (R0)