Skip to main content

An Efficient Online Event Detection Method for Microblogs via User Modeling

  • Conference paper
  • First Online:
Web Technologies and Applications (APWeb 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9931))

Included in the following conference series:

Abstract

Detecting events in microblog is important but still challenging. As tweet stream is a mixture of user interests and external events, its expensive to distinguish them. Existing methods are ineffective since they ignore user interests or only model interests and events on a fixed dataset without scalability. In this paper, we introduce an online learning model User Modeling Based Interest and Event Topic Model (UMIETM). UMIETM (1) exploits user modeling’s information to discover events, which usually capture attentions from users with different interests, and (2) treats the arriving data as stream and run the detection in online learning style. Furthermore, UMIETM can handle dynamic increased vocabulary in tweet stream. The UMIETM is verified on the real dataset which spans one year and contains 16 million tweets, and it outperforms state-of-the-art models in quantitative.

This research is supported by the Natural Science Foundation of China (Grant No. 61300003, 61572043), and the Specialized Research Fund for the Doctoral Program of Higher Education (Grant No. 20130001120001).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://twitter.com/biz.

  2. 2.

    http://weibo.com/hadoopchina.

  3. 3.

    http://en.wikipedia.org/wiki/Sina_Weibo/.

  4. 4.

    http://open.weibo.com/.

  5. 5.

    http://mallet.cs.umass.edu/dist/mallet-2.0.7.tar.gz.

References

  1. Lau, J.H., Collier, N., Baldwin, T.: On-line trend analysis with topic models: #twitter trends detection topic model online. In COLING, 2012

    Google Scholar 

  2. Diao, Q., Jiang, J., Zhu, F., Lim, E.-P.: Finding bursty topics from microblogs. In: ACL (2012)

    Google Scholar 

  3. He, Q., Chang, K., Lim, E.-P.: Analyzing feature trajectories for event detection. In: ACM SIGIR conference on Research and Development in information retrieval, pp. 207–214. ACM (2007)

    Google Scholar 

  4. Weng, J., Lee, B.-S.: Event detection in twitter. In: ICWSM (2011)

    Google Scholar 

  5. Petrović, S., Osborne, M., Lavrenko, V.: Streaming first story detection with application to twitter. In: HLT-NAACL (2010)

    Google Scholar 

  6. McCreadie, R., Macdonald, C., Ounis, I., Osborne, M., Petrovic, S.: Scalable distributed event detection for twitter (2013)

    Google Scholar 

  7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR (2003)

    Google Scholar 

  8. Diao, Q., Jiang, J.: A unified model for topics, events and users on twitter. In: EMNLP (2013)

    Google Scholar 

  9. Yan, X., Guo, J., Lan, Y., Jun, X., Cheng, X.: A Probabilistic Model for Bursty Topic Discovery in Microblogs. AAAI, pp. 353–359 (2015)

    Google Scholar 

  10. Zhao, Y., Wang, G., Yu, P.S., Liu, S., Zhang, S.: Simon: inferring social roles and statuses in social networks. In: SIGKDD. ACM (2013)

    Google Scholar 

  11. Yoshida, T.: Toward finding hidden communities based on user profile. J. Intell. Inf. Syst. 40(2), 189–209 (2013)

    Article  Google Scholar 

  12. Culotta, A., Kumar, N.R., Cutler, J.: Predicting the demographics of twitter users from website traffic data. In: AAAI, pp. 72–78 (2015)

    Google Scholar 

  13. Faralli, S., Stilo, G., Velardi, P.: Large scale homophily analysis in twitter using a twixonomy. In: IJCAI, pp. 2334–2340. AAAI Press (2015)

    Google Scholar 

  14. Blei, D.M., Jordan, M.I.: Modeling annotated data. In: SIGIR (2003)

    Google Scholar 

  15. Griffiths, T.L., Steyvers, M.: Finding scientific topics. PNAS (2004)

    Google Scholar 

  16. Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  17. Quan, X., Kit, C., Ge, Y., Pan, S.J.: Short and sparse text topic modeling via self-aggregation. IJCAI, pp. 2270–2276 (2015)

    Google Scholar 

  18. Wallach, H.M.: Structured topic models for language. Doctoral dissertation, Univ. of Cambridge (2008)

    Google Scholar 

  19. Wallach, H.M., Murray, I., Salakhutdinov, R., Mimno, D.: Evaluation methods for topic models. In: ICML (2009)

    Google Scholar 

  20. Wallach, H.M., Mimno, D.M., McCallum, A.: Rethinking lda: Why priors matter. In: NIPS, vol. 22, pp. 1973–1981 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Huang, W., Chen, W., Zhang, L., Wang, T. (2016). An Efficient Online Event Detection Method for Microblogs via User Modeling. In: Li, F., Shim, K., Zheng, K., Liu, G. (eds) Web Technologies and Applications. APWeb 2016. Lecture Notes in Computer Science(), vol 9931. Springer, Cham. https://doi.org/10.1007/978-3-319-45814-4_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45814-4_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45813-7

  • Online ISBN: 978-3-319-45814-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics