Skip to main content
Log in

Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

The advent of Web 2.0 has led to an increase in user-generated content on the Web. This has provided an extensive collection of free-style texts with opinion expressions that could influence the decisions and actions of their readers. Providers of such content exert a certain level of influence on the receivers and this is evident from blog sites having effect on their readers’ purchase decisions, political view points, financial planning, and others. By detecting the opinion expressed, we can identify the sentiments on the topics discussed and the influence exerted on the readers. In this paper, we introduce an automatic approach in deriving polarity pattern rules to detect sentiment polarity at the phrase level, and in addition consider the effects of the more complex relationships found between words in sentiment polarity classification. Recent sentiment analysis research has focused on the functional relations of words using typed dependency parsing, providing a refined analysis on the grammar and semantics of textual data. Heuristics are typically used to determine the typed dependency polarity patterns, which may not comprehensively identify all possible rules. We study the use of class sequential rules (CSRs) to automatically learn the typed dependency patterns, and benchmark the performance of CSR against a heuristic method. Preliminary results show CSR leads to further improvements in classification performance achieving over 80% F1 scores in the test cases. In addition, we observe more complex relationships between words that could influence phrase sentiment polarity, and further discuss on possible approaches to handle the effects of these complex relationships.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adar E, Adamic L A. Tracking information epidemics in blogspace. In Proc. Int. Conf. Web Intelligence, Washington, DC, USA, Sept. 2005, pp.207–214.

  2. Agarwal N, Liu H, Tang L, Yu P S. Identifying the influential bloggers in a community. In Proc. WSDM 2008, New York, USA, Feb. 2008, pp.207–218.

  3. Tan L K W, Na J C, Theng Y L. Influence detection between blog posts through blog features, content analysis, and community identity. Online Information Review, 2011, 35(3): 425–442.

    Article  Google Scholar 

  4. Abbasi A, Chen H, Salem A. Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. Trans. Inf. Syst., 2008, 26(3): Article No. 12.

  5. Demartini G, Siersdorfer S. Dear search engine: What’s your opinion about…?: Sentiment analysis for semantic enrichment of web search results. In Proc. SEMSEARCH 2010, New York, USA, April 2010, Article No.4.

  6. Devitt A, Ahmad K. Sentiment polarity identification in financial news: A cohesion-based approach. In Proc. ACL 2007, Prague, Czech Republic, June 2007, pp.984–991.

  7. O’Hare N, Davy M, Bermingham A, Ferguson P, Sheridan P, Gurrin C, Smeaton A F. Topic-dependent sentiment analysis of financial blogs. In Proc. CIKM Workshop on TSA 2009, New York, USA, Nov. 2009, pp.9–16.

  8. Ding X, Liu B, Yu P S. A holistic lexicon-based approach to opinion mining. In Proc. WSDM 2008, New York, USA, April 2008, pp.231–240.

  9. Morinaga S, Yamanishi K, Tateishi K, Fukushima T. Mining product reputations on the Web. In Proc. SIGKDD 2002, New York, USA, July 2002, pp.341–349.

  10. Riloff E, Wiebe J. Learning extraction patterns for subjective expressions. In Proc. EMNLP 2003, Stroudsburg, PA, USA, July 2003, pp.105–112.

  11. Turney P D. Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In Proc. ACL 2002, Stroudsburg, PA, USA, July 2002, pp.417–424.

  12. Pang B, Lee L. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proc. ACL 2004, Barcelona, Spain, July 2004, pp.271–278.

  13. Thet T T, Na J C, Khoo C S G. Aspect-based sentiment analysis of movie reviews on discussion boards. Journal of Information Science, 2010, 36(6): 823–848.

    Article  Google Scholar 

  14. Wilson T, Wiebe J, Hoffmann P. Recognizing contextual polarity in phrase-level sentiment analysis. In Proc. HLTEMNLP 2005, Vancouver, British Columbia, Canada, Oct. 2005, pp.347–354.

  15. Wilson T, Wiebe J, Hwa R. Recognizing strong and weak opinion clauses. Computational Intelligence, 2006, 22(2): 73–99.

    Article  MathSciNet  Google Scholar 

  16. Nivre J. Dependency grammar and dependency parsing. Technical Report MSI report 05133, Växjö University, School of Mathematics and Systems Engineering, 2005.

  17. Jakob N, Weber S H, Muller M C, Gurevych I. Beyond the stars: Exploiting free-text user reviews to improve the accuracy of movie recommendations. In Proc. CIKM Workshop on TSA 2009, Hong Kong, China, Nov. 2009, pp.57–64.

  18. Shaikh M A M, Prendinger H, Ishizuka M. Sentiment assessment of text by analyzing linguistic features and contextual valence assignment. Appl. Artif. Intell., 2008, 22(6): 558–601.

    Article  Google Scholar 

  19. Liu B. Web Data Mining: Exploring Hyperlinks, Contents and Usage Data (1st edition). Springer Berlin Heidelberg, New York, 2006, pp.37–54.

  20. Osman D J, Yearwood J, Vamplew P. Weblogs for market research: Finding more relevant opinion documents using system fusion. Online Information Review, 2009, 33(5): 873–888.

    Article  Google Scholar 

  21. Hu M, Liu B. Mining and summarizing customer reviews. In Proc. the 10th SIGKDD, Seattle, WA, USA, Aug. 2004, pp.168–177.

  22. Kim S M, Hovy E. Determining the sentiment of opinions. In Proc. the 20th COLING, Geneva, Switzerland, 2004, pp.1367–1373.

  23. Zhang C, Zeng D, Li J, Wang F Y, Zuo W. Sentiment analysis of Chinese Documents: From sentence to document level. Journal of the American Society for Information Science and Technology, 2009, 60(12): 2474–2487.

    Article  Google Scholar 

  24. Na J C, Thet T T, Khoo C. Comparing sentiment expression in movie reviews from four online genres. Online Information Review, 2010, 34(2): 317–338.

    Article  Google Scholar 

  25. Moilanen K, Pulman S. Sentiment composition. In Proc. RANLP 2007, Borovets, Bulgaria, Sept. 2007, pp.378–382.

  26. Cohen J. A coe±cient of agreement for nominal scales. Educational and Psychological Measurement, 1960, 20(1): 37–46.

    Article  Google Scholar 

  27. Joshi M, Penstein-Rose C. Generalizing dependency features for opinion mining. In Proc. ACL-IJCNLP 2009, Suntec, Singapore, Aug. 2009, pp.313–316.

  28. Agrawal R, Srikant R. Fast algorithms for mining association rules in large databases. In Proc. VLDB 1994, Santiago de Chile, Chile, Sept. 1994, pp.487–499.

  29. Wong K W, Zhou S, Yang Q, Yeung J M S. Mining customer value: From association rules to direct marketing. Data Mining and Knowledge Discovery, 2005, 11(1): 57–79.

    Article  MathSciNet  Google Scholar 

  30. Polanyi L, Zaenen A. Computing attitude and affect in text: Theory and applications. Computing Attitude and Affect in Text: Theory and Applications, 2006, 20: 1–10.

    Article  Google Scholar 

  31. Quirk R, Greenbaum S, Leech G, Svartvik J. A Comprehensive Grammar of the English Language, Longman, 1985.

  32. Tan L K W, Na J C, Theng Y L, Chang K Y. Sentence-level sentiment polarity classification using a linguistic approach. In Proc. ICADL 2011, Beijing, China, Oct. 2011, pp.77–87.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luke Kien-Weng Tan.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(PDF 92.2 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tan, L.KW., Na, JC., Theng, YL. et al. Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration. J. Comput. Sci. Technol. 27, 650–666 (2012). https://doi.org/10.1007/s11390-012-1251-y

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-012-1251-y

Keywords

Navigation