A Machine Learning Approach for Subjectivity Classification Based on Positional and Discourse Features

Chenlo, Jose M.; Losada, David E.

doi:10.1007/978-3-642-41057-4_3

Jose M. Chenlo¹⁹ &
David E. Losada¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8201))

Included in the following conference series:

Information Retrieval Facility Conference

498 Accesses
1 Citations

Abstract

In recent years, several machine learning methods have been proposed to detect subjective (opinionated) expressions within on-line documents. This task is important in many Opinion Mining and Sentiment Analysis applications. However, the opinion extraction process is often done with rough content-based features. In this paper, we study the role of structural features to guide sentence-level subjectivity classification. More specifically, we combine classical n-grams features with novel features defined from positional information and from the discourse structure of the sentences. Our experiments show that these new features are beneficial in the classification of subjective sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2007)
Google Scholar
Liu, B.: Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers (2012)
Google Scholar
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proc. of Human Language Technologies Conference/Conference on Empirical Methods in Natural Language Processing, HLT/EMNLP 2005 (2005)
Google Scholar
Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media, LSM 2011, pp. 30–38. Association for Computational Linguistics, Stroudsburg (2011)
Google Scholar
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1988)
Google Scholar
Carlson, L., Marcu, D., Okurowski, M.E.: Building a discourse-tagged corpus in the framework of rhetorical structure theory. In: Proceedings of the Second SIGdial Workshop on Discourse and Dialogue, SIGDIAL 2001, vol. 16, pp. 1–10. Association for Computational Linguistics, Stroudsburg (2001)
Chapter Google Scholar
Seki, Y., Evans, D.K., Ku, L.W., Sun, L., Chen, H.H., Kando, N.: Overview of multilingual opinion analysis task at NTCIR-7. In: Proceedings of NTCIR-7 (2008)
Google Scholar
Santos, R.L.T., He, B., Macdonald, C., Ounis, I.: Integrating proximity to subjective sentences for blog opinion retrieval. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 325–336. Springer, Heidelberg (2009)
Chapter Google Scholar
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 486–497. Springer, Heidelberg (2005)
Chapter Google Scholar
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: A library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Nallapati, R.: Discriminative models for information retrieval. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2004, pp. 64–71. ACM, New York (2004)
Chapter Google Scholar
Chenlo, J.M., Losada, D.E.: Effective and efficient polarity estimation in blogs based on sentence-level evidence. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM 2011, pp. 365–374. ACM, New York (2011)
Google Scholar
Chang, Y.W., Lin, C.J.: Feature ranking using linear svm. Journal of Machine Learning Research - Proceedings Track 3, 53–64 (2008)
MathSciNet Google Scholar
Brank, J., Grobelnik, M., Milić-frayling, N., Mladenić, D.: Feature selection using support vector machines. In: Proc. of the 3rd Int. Conf. on Data Mining Methods and Databases for Engineering, Finance, and Other Fields, pp. 84–89 (2002)
Google Scholar
Gerani, S., Carman, M.J., Crestani, F.: Proximity-based opinion retrieval. In: Proc. 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 403–410. ACM, New York (2010)
Google Scholar
Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Pr. of the ACL, pp. 271–278 (2004)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Pr. of the Conference on Empirical Methods in Natural Language Processing (2002)
Google Scholar
Zirn, C., Niepert, M., Stuckenschmidt, H., Strube, M.: Fine-grained sentiment analysis with structural features, vol. (12). Asian Federation of Natural Language Processing (2011)
Google Scholar
Somasundaran, S., Namata, G., Wiebe, J., Getoor, L.: Supervised and unsupervised methods in employing discourse relations for improving opinion polarity classification. In: Proc. 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, vol. 1, pp. 170–179. ACL, Stroudsburg (2009)
Google Scholar
Zhou, L., Li, B., Gao, W., Wei, Z., Wong, K.F.: Unsupervised discovery of discourse relations for eliminating intra-sentence polarity ambiguities. In: Proc. Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, pp. 162–171. ACL, Stroudsburg (2011)
Google Scholar
Heerschop, B., Goossen, F., Hogenboom, A., Frasincar, F., Kaymak, U., de Jong, F.: Polarity analysis of texts using discourse structure. In: Proc. 20th ACM International Conference on Information and Knowledge Management, CIKM 2011, pp. 1061–1070. ACM Press (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Centro de Investigación en Tecnoloxías da Información (CITIUS), Universidad de Santiago de Compostela, Spain
Jose M. Chenlo & David E. Losada

Authors

Jose M. Chenlo
View author publications
You can also search for this author in PubMed Google Scholar
David E. Losada
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstraße 9-11/188, 1040, Vienna, Austria
Mihai Lupu
Google Inc., Brandschenkestraße 110, 8002, Zurich, Switzerland
Evangelos Kanoulas
Department of Multimedia and Graphic Arts, Cyprus University of Technology, 30 Archbishop Kyprianou Street, 3036, Limassol, Cyprus
Fernando Loizides

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chenlo, J.M., Losada, D.E. (2013). A Machine Learning Approach for Subjectivity Classification Based on Positional and Discourse Features. In: Lupu, M., Kanoulas, E., Loizides, F. (eds) Multidisciplinary Information Retrieval. IRFC 2013. Lecture Notes in Computer Science, vol 8201. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41057-4_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-41057-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41056-7
Online ISBN: 978-3-642-41057-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics