Skip to main content

Graphical Models with Content Relevance for Crucial Date Detection in Social Media Event

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11062))

Abstract

Crucial date detection from social media text streams of an event aims to select the important dates where happen the important sub-events. The crucial dates are an important part of timeline summarization. The existing work on traditional news gains candidate dates through date expressions, and then selects crucial dates. While the posts in social media usually have fewer date expressions, so this is a new challenge for date detection. We observe that (1) there is an obvious burst when an sub-event happens; (2) the lasting durations of different sub-events are different and there are some content relevances among sub-events. Therefore, we propose a graphical models with Content Relevance for crucial Date Detection in social media event (CRDD). The model treats the publishing dates set of tweet streams as candidate dates set. A graph is constructed by content relevances of tweet streams collected in different candidate dates. The content relevances integrate semantic information of sub-topics and burst property of social media event. Based on the graph, random walk model is used to rank dates. The experiments on Twitter datasets about Arab Spring show that the proposed model is effective.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Binh Tran, G., Alrifai, M., Quoc Nguyen, D.: Predicting relevant news events for timeline summaries. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 91–92. ACM (2013)

    Google Scholar 

  2. Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 113–120. ACM (2006)

    Google Scholar 

  3. Chang, Y., Tang, J., Yin, D., Yamada, M., Liu, Y.: Timeline summarization from social media with life cycle models. In: IJCAI, pp. 3698–3704 (2016)

    Google Scholar 

  4. Chang, Y., Yamada, M., Ortega, A., Liu, Y.: Ups and downs in buzzes: life cycle modeling for temporal pattern discovery. In: 2014 IEEE International Conference on Data Mining (ICDM), pp. 749–754. IEEE (2014)

    Google Scholar 

  5. Gao, D., Li, W., Cai, X., Zhang, R.: Sequential summarization: a full view of Twitter trending topics. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 22, 293–302 (2014)

    Article  Google Scholar 

  6. Kessler, R., Tannier, X., Hagege, C., Moriceau, V.: Finding salient dates for building thematic timelines. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 730–739. Association for Computational Linguistics (2012)

    Google Scholar 

  7. Marcus, A., Bernstein, M.S., Badar, O., Karger, D.R., Madden, S., Miller, R.C.: TwitInfo: aggregating and visualizing microblogs for event exploration. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 227–236. ACM (2011)

    Google Scholar 

  8. Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: EMNLP, pp. 404–411 (2004)

    Google Scholar 

  9. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  10. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)

    Google Scholar 

  11. Ramos, J., et al.: Using TF-IDF to determine word relevance in document queries. In: Proceedings of the First Instructional Conference on Machine Learning, vol. 242, pp. 133–142 (2003)

    Google Scholar 

  12. Reed, J.W., Jiao, Y., Potok, T.E., Klump, B.A., Elmore, M.T., Hurson, A.R.: TF-ICF: a new term weighting scheme for clustering dynamic data streams. In: 2006 5th International Conference on Machine Learning and Applications, ICMLA 2006, pp. 258–263. IEEE (2006)

    Google Scholar 

  13. Tran, G., Herder, E., Markert, K.: Joint graphical models for date selection in timeline summarization. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing: Long Papers, vol. 1, pp. 1598–1607 (2015)

    Google Scholar 

  14. Zhao, W.X., et al.: Comparing Twitter and traditional media using topic models. In: Clough, P., et al. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_34

    Chapter  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (61472277). We also thank the anonymous reviewers for their valuable comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruifang He .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

He, R., Ding, D. (2018). Graphical Models with Content Relevance for Crucial Date Detection in Social Media Event. In: Liu, W., Giunchiglia, F., Yang, B. (eds) Knowledge Science, Engineering and Management. KSEM 2018. Lecture Notes in Computer Science(), vol 11062. Springer, Cham. https://doi.org/10.1007/978-3-319-99247-1_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99247-1_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99246-4

  • Online ISBN: 978-3-319-99247-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics