Skip to main content

Distinguishing the Popularity between Topics: A System for Up-to-Date Opinion Retrieval and Mining in the Web

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2013)

Abstract

The constantly increasing amount of opinionated texts found in the Web had a significant impact in the development of sentiment analysis. So far, the majority of the comparative studies in this field focus on analyzing fixed (offline) collections from certain domains, genres, or topics. In this paper, we present an online system for opinion mining and retrieval that is able to discover up-to-date web pages on given topics using focused crawling agents, extract opinionated textual parts from web pages, and estimate their polarity using opinion mining agents. The evaluation of the system on real-world case studies, demonstrates that is appropriate for opinion comparison between topics, since it provides useful indications on the popularity based on a relatively small amount of web pages. Moreover, it can produce genre-aware results of opinion retrieval, a valuable option for decision-makers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, X., Zhang, X.: Hawk: A focused crawler with content and link analysis. In: Proc. of the International Conference on e-Business Engineering (ICEBE), Xi’an Jiaotong Xian, China (2008)

    Google Scholar 

  2. Gerani, S., Carman, M.J., Crestani, F.: Proximity-based opinion retrieval. In: 33rd International Conference on Research and Development in Information Retrieval (SIGIR), Geneva, Switzerland (2010)

    Google Scholar 

  3. Hati, D., Sahoo, B., Kumar, A.: Adaptive focused crawling based on link analysis. In: Proc. of 2nd International Conference on Education Technology and Computer (ICETC), Shanghai, China (2010)

    Google Scholar 

  4. Jia, L., Yu, C., Zhang, W.: Uic at trec 2008 blog track. In: Proc. of The 17th Text Retrieval Conference (TREC), Gaithersburg, USA (2008)

    Google Scholar 

  5. Liu, S., Liu, F., Yu, C., Meng, W.: An effective approach to document retrieval via utilizing wordnet and recognizing phrases. In: Proc. of the 27th International Conference on Research and Development in Information Retrieval (SIGIR), Sheffield, United Kingdom (2004)

    Google Scholar 

  6. Wiegand, D.K.M.: Bootstrapping supervised machine-learning polarity classifiers with rule-based classification. In: Proceedings of the 1st ECAI-Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA), Lisbon, Portugal (2009)

    Google Scholar 

  7. Macdonald, C., Ounis, I., Soboroff, I.: Overview of trec-2009 blog track. In: Proc. of The 17th Text Retrieval Conference (TREC), Gaithersburg, USA (2009)

    Google Scholar 

  8. Manning, C.D., Raghavan, P., Schtze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)

    Google Scholar 

  9. Mishne, G.: Multiple ranking strategies for opinion retrieval in blogs. In: Proc. of the 15th Text Retrieval Conference (TREC), Gaithersburg, USA (2006)

    Google Scholar 

  10. Ounis, I., de Rijke, M., Macdonald, C., Mishne, G., Soboroff, I.: Overview of the trec-2006 blog track. In: Proc. of TREC, Gaithersburg, USA (2006)

    Google Scholar 

  11. Pal, A., Tomar, D.S., Shrivastava, S.C.: Effective Focused Crawling Based on Content and Link Structure Analysis. Journal of Computer Science 2(1) (2009)

    Google Scholar 

  12. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)

    Article  Google Scholar 

  13. Pappas, N., Katsimpras, G., Stamatatos, E.: An agent-based focused crawling framework for topic- and genre-related web document discovery. In: Proc. of the 24th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), Athens, Greece (2012)

    Google Scholar 

  14. Pappas, N., Katsimpras, G., Stamatatos, E.: Extracting informative textual parts from web pages containing user-generated content. In: Proc. of the 12th International Conference on Knowledge Management and Knowledge Technologies (i-Know), Graz, Austria (2012)

    Google Scholar 

  15. Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proc. of the International Conference on Empirical Methods in Natural Language Processing (EMNLP), Sapporo, Japan (2003)

    Google Scholar 

  16. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proc. of the 40th Annual Meeting on Association for Computational Linguistics (ACL), Philadelphia, USA (2002)

    Google Scholar 

  17. Wiebe, J., Wilson, T., Bruce, R., Bell, M., Martin, M.: Learning subjective language. Computational Linguistics 30(3), 277–308 (2004)

    Article  Google Scholar 

  18. Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: International Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT/EMNLP), Vancouver, Canada (2005)

    Google Scholar 

  19. Yang, K.: Widit in trec 2008 blog track: Leveraging multiple sources of opinion evidence. In: Proc. of The 17th Text Retrieval Conference (TREC), Gaithersburg, USA (2009)

    Google Scholar 

  20. Yang, K., Yu, N., Valerio, R., Zhang, H.: Widit in trec-2006 blog track. In: Proc. of The 14th Text Retrieval Conference (TREC), Gaithersburg, USA (2006)

    Google Scholar 

  21. Zhang, M., Ye, X.: A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In: Proc. of the 31st International Conference on Research and Development in Information Retrieval (SIGIR), Singapore (2008)

    Google Scholar 

  22. Zhang, W., Yu, C., Meng, W.: Opinion retrieval from blogs. In: Proc. of the 16th International Conference on Information and Knowledge Management (CIKM), Lisbon, Portugal (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pappas, N., Katsimpras, G., Stamatatos, E. (2013). Distinguishing the Popularity between Topics: A System for Up-to-Date Opinion Retrieval and Mining in the Web. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37256-8_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37256-8_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37255-1

  • Online ISBN: 978-3-642-37256-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics