Sentence to Document Level Emotion Tagging – A Coarse-Grained Study on Bengali Blogs

  • Dipankar Das
  • Sivaji Bandyopadhyay
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6256)


This paper presents the identification of document level emotions from the sentential emotions obtained at word level granularity. Each of the Bengali blog documents consists of a topic and corresponding user comments. Sense weight based average scoring technique for assigning sentential emotion tag follows the word level emotion tagging using Support Vector Machine (SVM) approach. Cumulative summation of sentential emotion scores is assigned to each document considering the combinations of some heuristic features. An average F-Score of 59.32% with respect to all emotion classes is achieved on 95 documents on the development set by incorporating the best feature combination into account. Instead of assigning a single emotion tag to a document, each document is assigned with the best two emotion tags according to the ordered emotion scores obtained. The best two system assigned emotion tags of each document are compared against best two human annotated emotion tags. Evaluation of 110 test documents yields an average F-Score of 59.50% with respect to all emotion classes.


Document SVM Emotion Tagging Heuristic Features 


  1. 1.
    Esuli, A., Sebastiani, F.: SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining. In: LREC-06 (2006)Google Scholar
  2. 2.
    McCallum, A., Pereira, F., Lafferty, J.: Conditional Random Fields: Probabilistic Models for Segmenting and labeling Sequence Data. ISBN, 282–289 (2001)Google Scholar
  3. 3.
    Vincent, B., Xu, L., Chesley, P., Srhari, R.K.: Using verbs and adjectives to automatically classify blog sentiment. In: Proceedings of AAAI-CAAW-06, the Spring Symposia (2006)Google Scholar
  4. 4.
    Pang, B., Lee, L.: Foundations and Trends in Information Retrieval 2, vol. 1-2, pp. 1–135 (2008)Google Scholar
  5. 5.
    Cortes, C., Vapnik, V.: Support-Vector Network. Machine Learning 20, 273–297 (1995)zbMATHGoogle Scholar
  6. 6.
    Claire, C., Janyce, W., Theresa, W., Litman Diane, J.: Combining Low-Level and Summary Representations of Opinions for Multi-Perspective Question Answering. New Directions in Question Answering, 20–27 (2003)Google Scholar
  7. 7.
    Alm, C.O., Roth, D., Sproat, R.: Emotions from text: machine learning for text-based emotion prediction. In: Proceedings of the Conference on HLT (EMNP) Vancouver, British Columbia, Canada, pp. 579–586 (2005)Google Scholar
  8. 8.
    Yang, C., Lin, K.H.-Y., Chen, H.-H.: Emotion classification Using Web Blog Corpora. In: IEEE, WIC, ACM International Conference on Web Intelligence, pp. 275–278 (2007)Google Scholar
  9. 9.
    Das, D., Bandyopadhyay, S.: Emotion Tagging – A Comparative Study on Bengali and English Blogs. In: ICON-09, India, pp. 177–184 (2009)Google Scholar
  10. 10.
    Das, D., Bandyopadhyay, S.: Word to Sentence Level Emotion Tagging for Bengali Blogs. In: ACL-IJCNLP 2009, Suntec, Singapore, pp. 149–152 (2009)Google Scholar
  11. 11.
    Das, D., Bandyopadhyay, S.: Developing Bengali WordNet Affect for Analyzing Emotion. In: ICCPOL-2010, California, USA (2010)Google Scholar
  12. 12.
    Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1) (2002)Google Scholar
  13. 13.
    Mishne, G.: Experiments with Mood Classification in Blog Posts. In: Proceedings of 1st Workshop on Stylistic Analysis of Text for Information Access (2005)Google Scholar
  14. 14.
    Mishne, G., de Rijke, M.: Capturing Global Mood Levels using Blog Posts. In: Proceedings of AAAI, Spring Symposium on Computational Approaches to Analysing Weblogs, pp. 145–152 (2006)Google Scholar
  15. 15.
    Lin, K.H.-Y., Yang, C., Chen, H.-H.: What Emotions News Articles Trigger in Their Readers? In: Proceedings of SIGIR, pp. 733–734 (2007)Google Scholar
  16. 16.
    Ku, L.-W., Liang, Y.-T., Chen, H.-H.: Opinion extraction, summarization and tracking in news and blog corpora. In: AAAI-2006 Spring Symposium on Computational Approaches to Analyzing Weblogs, AAAI Technical Report, pp. 100–107 (2006)Google Scholar
  17. 17.
    Ekman, P.: Facial expression and emotion. American Psychologist 48(4), 384–392 (1993)CrossRefGoogle Scholar
  18. 18.
    Turney, P.D.: Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 417–424 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Dipankar Das
    • 1
  • Sivaji Bandyopadhyay
    • 1
  1. 1.Department of Computer Science and EngineeringJadavpur UniversityIndia

Personalised recommendations