Skip to main content

User Message Model: A New Approach to Scalable User Modeling on Microblog

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8870))

Abstract

Modeling users’ topical interests on microblog is an important but challenging task. In this paper, we propose User Message Model (UMM), a hierarchical topic model specially designed for user modeling on microblog. In UMM, users and their messages are modeled by a hierarchy of topics. Thus, it has the ability to 1) deal with both the data sparseness and the topic diversity problems which previous methods suffer from, and 2) jointly model users and messages in a unified framework. Furthermore, UMM can be easily distributed to handle large-scale datasets. Experimental results on both Sina Weibo and Twitter datasets show that UMM can effectively model users’ interests on microblog. It can achieve better results than previous methods in topic discovery and message recommendation. Experimental results on a large-scale Twitter dataset, containing about 2 million users and 50 million messages, further demonstrate the scalability and efficiency of distributed UMM.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Analyzing User Modeling on Twitter for Personalized News Recommendations. In: Konstan, J.A., Conejo, R., Marzo, J.L., Oliver, N. (eds.) UMAP 2011. LNCS, vol. 6787, pp. 1–12. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  2. Ahmed, A., Low, Y., Aly, M., Josifovski, V., Smola, A.J.: Scalable Distributed Inference of Dynamic User Interests for Behavioral Targeting. In: SIGKDD, pp. 114–122 (2011)

    Google Scholar 

  3. Blei, D., Ng, Y., Jordan, I.: Latent Dirichlet Allocation. Mach. Learn. Res. (2003)

    Google Scholar 

  4. Chen, K., Chen, T., Zheng, G., Jin, O., Yao, E., Yu, Y.: Collaborative Personalized Tweet Recommendation. In: SIGIR, pp. 661–670 (2012)

    Google Scholar 

  5. Diao, Q., Jiang, J.: A Unified Model for Topics, Events and Users on Twitter. In: EMNLP (2013)

    Google Scholar 

  6. Diao, Q., Jiang, J., Zhu, F., Lim, E.P.: Finding Bursty Topics from Microblogs. In: ACL (2012)

    Google Scholar 

  7. Grant, C., George, C.P., Jenneisch, C., Wilson, J.N.: Online Topic Modeling for Real-time Twitter Search. In: TREC (2011)

    Google Scholar 

  8. Griffiths, T.L., Steyvers, M.: Finding scientific topics. In: Proc. Natl. Acad. Sci., USA (2004)

    Google Scholar 

  9. Hong, L., Davison, B.D.: Empirical Study of Topic Modeling in Twitter. In: SIGKDD Workshop, pp. 80–88 (2010)

    Google Scholar 

  10. Hong, L., Doumith, A.S., Davison, B.D.: Co-factorization Machines: Modeling User Interests and Predicting Individual Decisions in Twitter. In: WSDM, pp. 557–566 (2013)

    Google Scholar 

  11. Hu, Y., John, A., Wang, F., Kambhampati, S.: Et-lda: Joint Topic Modeling for Aligning Events and their Twitter Feedback. In: AAAI (2012)

    Google Scholar 

  12. Järvelin, K., Kekääinen, J.: Cumulated Gain-based Evaluation of IR Techniques. ACM Trans. Inf. Sys. (2002)

    Google Scholar 

  13. Joachims, T.: Optimizing Search Engines using Clickthrough Data. In: SIGKDD, pp. 133–142 (2002)

    Google Scholar 

  14. Li, H.: Learning to Rank for Information Retrieval and Natural Language Processing (2011)

    Google Scholar 

  15. Michelson, M., Macskassy, S.A.: Discovering Users Topics of Interest on Twitter: A First Look. In: CIKM Workshop, pp. 73–80 (2010)

    Google Scholar 

  16. Newman, D., Asuncion, A., Smyth, P., Welling, M.: Distributed Inference for Latent Dirichlet Allocation. In: NIPS (2007)

    Google Scholar 

  17. Pennacchiotti, M., Gurumurthy, S.: Investigating Topic Models for Social Media User Recommendation. In: WWW, pp. 101–102 (2011)

    Google Scholar 

  18. Ramage, D., Dumais, S.T., Liebling, D.J.: Characterizing Microblogs with Topic Models. In: AAAI (2010)

    Google Scholar 

  19. Ren, Z., Liang, S., Meij, E., de Rijke, M.: Personalized Time-aware Tweets Summarization. In: SIGIR, pp. 513–522 (2013)

    Google Scholar 

  20. Steyvers, M., Smyth, P., Rosen-Zvi, M., Griffiths, T.: Probabilistic Author-topic Models for Information Discovery. In: SIGKDD, pp. 306–315 (2004)

    Google Scholar 

  21. Teh, Y.W., Jordan, M.I., Beal, M.J., Blei, D.M.: Hierarchical Dirichlet Processes. J. Am. Stat. Assoc. (2006)

    Google Scholar 

  22. Wen, Z., Lin, C.Y.: On the Quality of Inferring Interests from Social Neighbors. In: SIGKDD, pp. 373–382 (2010)

    Google Scholar 

  23. Weng, J., Lim, E.P., Jiang, J., He, Q.: TwitterRank: Finding Topic-sensitive Influential Twitterers. In: WSDM, pp. 261–270 (2010)

    Google Scholar 

  24. Wu, W., Zhang, B., Ostendorf, M.: Automatic Generation of Personalized Annotation Tags for Twitter Users. In: NAACL-HLT, pp. 689–692 (2010)

    Google Scholar 

  25. Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering User Interest on Twitter with a Modified Author-topic Model. In: WI-IAT, pp. 422–429 (2011)

    Google Scholar 

  26. Xu, Z., Zhang, Y., Wu, Y., Yang, Q.: Modeling User Posting Behavior on Social Media. In: SIGIR (2012)

    Google Scholar 

  27. Yuan, Q., Cong, G., Ma, Z., Sun, A., Magnenat-Thalmann, N.: Who, where, when and what: Discover Spatio-temporal Topics for Twitter Users. In: SIGKDD, pp. 605–613 (2013)

    Google Scholar 

  28. Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing Twitter and Traditional Media using Topic Models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, Q., Xu, J., Li, H. (2014). User Message Model: A New Approach to Scalable User Modeling on Microblog. In: Jaafar, A., et al. Information Retrieval Technology. AIRS 2014. Lecture Notes in Computer Science, vol 8870. Springer, Cham. https://doi.org/10.1007/978-3-319-12844-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12844-3_18

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12843-6

  • Online ISBN: 978-3-319-12844-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics