Skip to main content

Defining Semantic Meta-hashtags for Twitter Classification

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7824))

Abstract

Given the wide spread of social networks, research efforts to retrieve information using tagging from social networks communications have increased. In particular, in Twitter social network, hashtags are widely used to define a shared context for events or topics. While this is a common practice often the hashtags freely introduced by the user become easily biased. In this paper, we propose to deal with this bias defining semantic meta-hashtags by clustering similar messages to improve the classification. First, we use the user-defined hashtags as the Twitter message class labels. Then, we apply the meta-hashtag approach to boost the performance of the message classification.

The meta-hashtag approach is tested in a Twitter-based dataset constructed by requesting public tweets to the Twitter API. The experimental results yielded by comparing a baseline model based on user-defined hashtags with the clustered meta-hashtag approach show that the overall classification is improved. It is concluded that by incorporating semantics in the meta-hashtag model can have impact in different applications, e.g. recommendation systems, event detection or crowdsourcing.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Efron, M.: Hashtag retrieval in a microblogging environment. In: SIGIR, pp. 787–788 (2010)

    Google Scholar 

  2. Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Semantic enrichment of twitter posts for user profile construction on the social web. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part II. LNCS, vol. 6644, pp. 375–389. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  3. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860 (2010)

    Google Scholar 

  4. Popescu, A.-M., Pennacchiotti, M.: Detecting controversial events from twitter. In: CIKM, pp. 1873–1876 (2010)

    Google Scholar 

  5. Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: Real-world event identification on twitter. In: ICWSM (2011)

    Google Scholar 

  6. Ovadia, S.: Exploring the Potential of Twitter as a Research Tool. Behavioral & Social Sciences Librarian 28(4), 202–205 (2009)

    Article  Google Scholar 

  7. Rowe, M., Stankovic, M.: Mapping tweets to conference talks: A goldmine for semantics. In: Workshop on Social Data on the Web, SDoW (2010)

    Google Scholar 

  8. Weller, K., Puschmann, C.: Twitter for Scientific Communication: How Can Citations/References be Identified and Measured?, pp. 1–4 (2011), http://journal.webscience.org/500/

  9. Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Analyzing user modeling on twitter for personalized news recommendations. In: Proceedings of the 19th International Conference on User Modeling, Adaption, and Personalization, pp. 1–12 (2011)

    Google Scholar 

  10. Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: What 140 characters reveal about political sentiment. In: Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media, pp. 178–185 (2010)

    Google Scholar 

  11. O’Connor, B., Balasubramanyan, R., Routledge, B.R., Smith, N.A.: From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In: Proceedings of the International AAAI Conference on Weblogs and Social Media (2010)

    Google Scholar 

  12. Huang, J., Thornton, K.M., Efthimiadis, E.N.: Conversational tagging in twitter. In: Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, pp. 173–178 (2010)

    Google Scholar 

  13. Merriam-webster’s dictionary (October 2012), www.merriam-webster.com/

  14. Zappavigna, M.: Ambient affiliation: A linguistic perspective on Twitter. New Media & Society 13(5), 788–806 (2011)

    Article  Google Scholar 

  15. Johnson, S.: How twitter will change the way we live. Time Magazine 173, 23–32 (2009)

    Google Scholar 

  16. Tsur, O., Rappoport, A.: “What’s in a hashtag?: content based prediction of the spread of ideas in microblogging communities”. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012, pp. 643–652 (2012)

    Google Scholar 

  17. Yang, L., Sun, T., Zhang, M., Mei, Q.: We know what @you #tag: does the dual role affect hashtag adoption? In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 261–270 (2012)

    Google Scholar 

  18. Chang, H.-C.: A new perspective on twitter hashtag use: diffusion of innovation theory. In: Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem, ASIS&T 2010, vol. 47, pp. 85:1–85:4 (2010)

    Google Scholar 

  19. Doerr, B., Fouz, M., Friedrich, T.: Why rumors spread so quickly in social networks. Commun. ACM 55(6), 70–75 (2012)

    Article  Google Scholar 

  20. Tantipathananandh, C., Berger-Wolf, T.Y.: Finding Communities in Dynamic Social Networks. In: 2011 IEEE 11th International Conference on Data Mining, pp. 1236–1241 (2011)

    Google Scholar 

  21. Treiber, M., Schall, D., Dustdar, S., Scherling, C.: Tweetflows: flexible workflows with twitter. In: Proceedings of the 3rd International Workshop on Principles of Engineering Service-Oriented Systems, pp. 1–7 (2011)

    Google Scholar 

  22. Zangerle, E., Gassler, W., Specht, G.: Recommending #-tags in twitter. In: Proceedings of the Workshop on Semantic Adaptive Social Web, in Connection with the 19th International Conference on User Modeling, Adaptation and Personalization, UMAP 2011, pp. 67-78 (2011)

    Google Scholar 

  23. Oguztuzun, H., Ozdikis, O., Senkul, P.: Semantic Expansion of Hashtags for Enhanced Event Detection in Twitter. In: VLDB The First International Workshop on Online Social Systems (2012)

    Google Scholar 

  24. Koerich, A.: Improving classification performance using metaclasses. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 717–722 (2003)

    Google Scholar 

  25. Vapnik, V.: The Nature of Statistical Learning Theory. Springer (1999)

    Google Scholar 

  26. Joachims, T.: Learning Text Classifiers with Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)

    Book  Google Scholar 

  27. Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research 2, 45–66 (2002)

    MATH  Google Scholar 

  28. Costa, J., Silva, C., Antunes, M., Ribeiro, B.: On using crowdsourcing and active learning to improve classification performance. In: Proceedings of 11th International Conference on Intelligent Systems Design and Applications (ISDA), pp. 469–474 (2011)

    Google Scholar 

  29. van Rijsbergen, C.: Information Retrieval. Butterworths (1979)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Costa, J., Silva, C., Antunes, M., Ribeiro, B. (2013). Defining Semantic Meta-hashtags for Twitter Classification. In: Tomassini, M., Antonioni, A., Daolio, F., Buesser, P. (eds) Adaptive and Natural Computing Algorithms. ICANNGA 2013. Lecture Notes in Computer Science, vol 7824. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37213-1_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37213-1_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37212-4

  • Online ISBN: 978-3-642-37213-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics