Skip to main content

Detecting Keyphrases in Micro-blogging with Graph Modeling of Information Diffusion

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8862))

Abstract

The rapid increasing popularity of micro-blogging has made it an important information seeking channel. Keyphrase extraction is an effective way for summarizing and analyzing micro-blogging content, which can help users gain insights into internet hotspots. Existing methods for keyphrase extraction usually unilaterally consider phrase frequency or user retweet count as key factors. However, those methods may neglect the relationships between different phrases and the importance of user influence to further information diffusion. Generally, phrases shown in the influential users’ micro-blogs are more likely to attract other users’ interest, making them more likely to be diffused in the near future. Besides, phrases may have relations with each other, and some phrases usually have similar diffusion paths and attract the attention of the same population. In this paper, by comprehensively considering all the above mentioned factors to detect micro-blogging keyphrases, we proposed a novel model. The proposed model first detect high frequency term from abundant micro-blogs as candidate keyphrases, then construct a relation graph about them with user interest and user following web. Finally, we rank those candidates with graph models for realizing keyphrases detection. Experiments show this model is very effective for micro-blogging keyphrase extraction.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aral, S., Brynjolfsson, E., Alstyne, M.V.: Productivity Effects of Information Diffusion in Networks, The MIT Center for Digital Business, paper 234 (2007)

    Google Scholar 

  2. Barker, K., Cornacchia, N.: Using Noun Phrase Heads to Extract Document Keyphrases. In: Hamilton, H.J. (ed.) Canadian AI 2000. LNCS (LNAI), vol. 1822, pp. 40–52. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  3. Bellaachia, A., Al-Dhelaan, M.: NE-Rank: A Novel Graph-Based Keyphrase Extraction in Twitter. In: Proceedings of the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, pp. 372–379 (2012)

    Google Scholar 

  4. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the Seventh International Conference on World Wide Web, pp. 107–117 (1998)

    Google Scholar 

  5. Celli, F., Di Lascio, F.M.L., Magnani, M., Pacelli, B., Rossi, L.: Social network data and practices: The case of FriendFeed. In: Chai, S.-K., Salerno, J.J., Mabry, P.L. (eds.) SBP 2010. LNCS, vol. 6007, pp. 346–353. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in Twitter: the million follower fallacy. In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 10–17 (2010)

    Google Scholar 

  7. Cheng, J., Sun, A., Hu, D., Zeng, D.: An information diffusion based recommendation framework for micro-blogging. Journal of the Association for Information Systems 12(7), 463–486 (2011)

    Google Scholar 

  8. Choudhury, M.D., Lin, Y.-R., Sundaram, H.: How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 34–41 (2010)

    Google Scholar 

  9. Ding, Z., Zhang, Q., Huang, X.: Keyphrase Extraction from Online News Using Binary Integer Programming. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, pp. 165–173 (2011)

    Google Scholar 

  10. Haveliwala, T.: Topic Sensitive PageRank. In: Proceedings of the 11th International World Wide Web Conference, pp. 517–526 (2002)

    Google Scholar 

  11. Hussey, R., Williams, S., Mitchell, R., Field, I.: A Comparison of Automated Keyphrase Extraction Techniques and of Automatic Evaluation vs. Human Evaluation. International Journal on Advances in Life Sciences 4(3&4), 136–153 (2012)

    Google Scholar 

  12. Java, A., Song, X., Finin, T., Tseng, B.: Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, pp. 56–65 (2007)

    Google Scholar 

  13. Nakagawa, H., Mori, T.: A simple but powerful automatic term extraction method. In: Proceedings of COLING 2002 on COMPUTERM 2002: Second International Workshop on Computational Terminology, pp. 1–7 (2002)

    Google Scholar 

  14. Li, X., Liu, B., Yu, P.: Time sensitive ranking with application to publication search. In: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp. 893–898 (2008)

    Google Scholar 

  15. Liu, Z., Huang, W., Zheng, Y., Sun, M.: Automatic keyphrase extraction via topic decomposition. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 366–376 (2010)

    Google Scholar 

  16. Lui, M., Baldwin, T.: Cross-domain Feature Selection for Language Identification. In: Proceedings of the Fifth International Joint Conference on Natural Language Processing, pp. 553–561 (2011)

    Google Scholar 

  17. Yang, M., Lee, J., Lee, S., Rim, H.: Finding interesting posts in Twitter based on retweet graph analysis. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1073–1074 (2012)

    Google Scholar 

  18. Mei, Q., Shen, X., Zhai, C.: Automatic labeling of multinomial topic model. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 490–499 (2007)

    Google Scholar 

  19. Mori, J., Ishizuka, M., Matsuo, Y.: Extracting Keyphrases to Represent Relations in Social Networks from Web. In: Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, pp. 2820–2827 (2007)

    Google Scholar 

  20. Paukkeri, M., Nieminen, I., Pöllä, M., Honkela, T.: A Language-independent Approach to Keyphrase Extraction and Evaluation. In: Proceedings of the 22nd International Conference on Computational Linguistics, pp. 237–252 (2008)

    Google Scholar 

  21. Song, S., Li, Q., Zheng, N.: A spatio-temporal framework for related topic search in micro-blogging. In: An, A., Lingras, P., Petty, S., Huang, R. (eds.) AMT 2010. LNCS, vol. 6335, pp. 63–73. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  22. Song, S., Li, Q., Zheng, X.: Detecting popular topics in micro-blogging based on a user interest-based model. In: Proceedings of the 2012 International Joint Conference on Neural Networks, pp. 1–8 (2012)

    Google Scholar 

  23. Wan, X., Xiao, J.: Single Document Keyphrase Extraction Using Neighborhood Knowledge. In: Proceedings of the 23rd AAAI Conference on Artificial Intelligence, pp. 855–860 (2008)

    Google Scholar 

  24. Weng, J., Lim, E.-P., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, pp. 261–270 (2010)

    Google Scholar 

  25. Wu, W., Zhang, B., Ostendorf, M.: Automatic generation of personalized annotation tags for twitter users. In: Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 689–692 (2010)

    Google Scholar 

  26. Zhao, W., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., Li, X.: Topical Keyphrase Extraction from Twitter. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pp. 379–388 (2011)

    Google Scholar 

  27. Cho, V., Esfahbod, B., Mansouri, M.: City of New York on Twitter:@ NYCGov. In: Proceedings of the 13th Annual International Conference on Digital Government Research, pp. 274–275 (2012)

    Google Scholar 

  28. Macdonald, C., Ounis, I.: Voting Techniques for Expert Search. J. Knowledge and Information Systems, 259–280 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Song, S., Meng, Y., Sun, J. (2014). Detecting Keyphrases in Micro-blogging with Graph Modeling of Information Diffusion. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13560-1_3

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13559-5

  • Online ISBN: 978-3-319-13560-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics