N-gram Based Approach for Opinion Mining of Punjabi Text

Kaur, Amandeep; Gupta, Vishal

doi:10.1007/978-3-319-13365-2_8

Amandeep Kaur²³ &
Vishal Gupta²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8875))

Included in the following conference series:

International Workshop on Multi-disciplinary Trends in Artificial Intelligence

816 Accesses
7 Citations

Abstract

Opinion mining is the process of analyzing views, attitude or opinions of a writer or a speaker. Research in this particular area involves the detection of opinions from the text of any language. Vast amount of work has been done for the English language. In spite of lack of resources for Indian languages, work has been done for Telugu, Bengali and Hindi language. In this paper, we proposed a hybrid research approach for the emotion/opinion mining of the Punjabi text. Hybrid technique is the combination of Naïve Bayes and N-grams. As the part of presented research, we have extracted the features of N-grams model which are used to train Naïve Bayes. The trained model is then validated using the testing data. Results obtained are also compared with already existing approaches and the accuracy of the results shows the better efficacy of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Biadsy, F., Mckeown, K., Agarwal, A.: Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams (2009)
Google Scholar
Esuli, A., Sebastiani, F., Baccianella, S.: Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta (2010)
Google Scholar
Das, A.: Opinion Extraction and Summarization from Text Documents in Bengali. Doctoral Thesis. UMI Order Number: UMI Order No. GAX95-09398. Jadavpur University (2011)
Google Scholar
Wiebe, J., Banea, C., Mihalcea, R.: A bootstrapping method for building subjectivity lexicons for languages with scarce resources. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (2008)
Google Scholar
Bandyopadhyay, S., Das, A.: SentiWordNet for Bangla (2010)
Google Scholar
Bandyopadhyay, S., Das, A.: SentiWordNet for Indian Languages (2010)
Google Scholar
Arora, P.: Sentiment Analysis for Hindi Language. Masters thesis, IIT, Hyderabad (2013)
Google Scholar
Sebastiani, F., Esuli, A.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006), p. 54 (2006)
Google Scholar
McKeown, K.R., Hatzivassiloglou, V.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL 1998, Stroudsburg, PA, USA, p. 174 (1997)
Google Scholar
Liu, B., Hu, M.: Mining and summarizing customer reviews. In: KDD, p. 168 (2004)
Google Scholar
Wilson, T., Intelligent, T.W.: Annotating opinions in the world press. In: SIGdial 2003, p. 13 (2003)
Google Scholar
Bhattacharyya, P., Joshi, A., Balamurali, A.R.: A fall-back strategy for sentiment analysis in Hindi: A case study (2010)
Google Scholar
Rijke, M.D., Kamps, J., Marx, M., Mokken, R.J.: Using wordnet to measure semantic orientation of adjectives. National Institute for, p. 1115 (2004)
Google Scholar
Indian Institute of Technology, Hyderabad, http://www.iith.ac.in/
Kim, S.: Determining the sentiment of opinions. In: Proceedings of COLING, p. 1367 (2004)
Google Scholar
Hovy, E., Kim, S.: Identifying and analyzing judgment opinions. In: Proceedings of HLT/NAACL 2006, p. 200 (2006)
Google Scholar
Vaithyanathan, S., Pang, B., Lee, L.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 79 (2002)
Google Scholar
Ravichandran, D., Rao, D.: Semi-supervised polarity lexicon induction. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009, Stroudsburg, PA, USA, p. 675 (2009)
Google Scholar
Dunphy, D.C., Stone, P.J., Ogilvie, D.M., Smith, M.S.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Google Scholar
Turney, P.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews (2002)
Google Scholar
Wiebe, J.M., O’Hara, T.P., Bruce, R.E.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, ACL 1999, Stroudsburg, PA, USA, p. 246 (1999)
Google Scholar
Wilson, T.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of HLT-EMNLP, pp. 347–354 (2005)
Google Scholar
Hatzivassiloglou, V., Yu, H.: Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003, Stroudsburg, PA, USA, p. 129 (2003)
Google Scholar
Gupta, V.: Automatic Stemming of Words for Punjabi Language. In: Thampi, S.M., Gelbukh, A., Mukhopadhyay, J. (eds.) Advances in Signal Processing and Intelligent Recognition Systems. AISC, vol. 264, pp. 73–84. Springer, Heidelberg (2014)
Chapter Google Scholar
http://www.punjabitribuneonline.com
http://www.indotimes.com.au
http://www.beta.ajitjalandhar.com
http://www.rozanaspokesman.com
http://www.dailypunjabtimes.com
http://www.deshsewak.in
http://www.nawanzamana.in
http://www.dailyjanjagriti.com
http://www.punjabpost.in
http://www.seapunjab.com
http://www.ajdiawaaz.com
http://www.malwapost.com
http://www.punjabinfoline.com
http://www.chardhikala.com
http://www.deshvideshtimes.com
http://www.punjab-screen.blogspot.in
http://www.parchanve.wordpress.com
http://www.punjabiaarsi.blogspot.in
http://www.kamalkang.blogspot.in
http://www.shabadsanjh.com
http://www.cs.waikato.ac.nz/ml/weka/

Download references

Author information

Authors and Affiliations

U.I.E.T., Panjab University, Chandigarh, India
Amandeep Kaur & Vishal Gupta

Authors

Amandeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Vishal Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Automation, Indian Institute of Science, 560 012, Bangalore, India
M. Narasimha Murty
Faculty of Engineering and Information Technology, University of Technology Sydney, 2007, NSW, Australia
Xiangjian He
Department of Computer and Information Sciences, University of Hyderabad, 500046, Hyderabad, India
Raghavendra Rao Chillarige
LIP6, University Pierre et Marie Curie, 4 Place Jussieu, 75005, Paris, France
Paul Weng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaur, A., Gupta, V. (2014). N-gram Based Approach for Opinion Mining of Punjabi Text. In: Murty, M.N., He, X., Chillarige, R.R., Weng, P. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2014. Lecture Notes in Computer Science(), vol 8875. Springer, Cham. https://doi.org/10.1007/978-3-319-13365-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-13365-2_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13364-5
Online ISBN: 978-3-319-13365-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics