Comparative Analysis of Machine Learning Algorithms for Hybrid Sources of Textual Data: In Development of Domain Adaptable Sentiment Analysis Model

Arya, Vaishali; Agrawal, Rashmi

doi:10.1007/978-981-15-7527-3_16

Vaishali Arya¹⁹ &
Rashmi Agrawal¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1254))

1152 Accesses

Abstract

Sentiment classification is the task of categorizing the text into different opinionated categories positive, negative and neutral depending upon the user’s post on social media in a particular domain. In social media, Twitter is the most popular platform for classification but with the limitations of number of words to be posted by individual which produces the inaccurate classification of dataset. Hence, in this paper, we are trying to increase the performance of the sentiment classification model by collecting the textual data on the same domain from different sources. To validate the same, different state-of-the-art machine learning classification algorithms have been applied and analyzed that shows the better results for hybrid data in comparison with the single-source Twitter (microblog) data and also articulated the most suitable algorithms for the hybrid textual sources of data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kušen E, Strembeck M (2018) Politics, sentiments, and misinformation: an analysis of the Twitter discussion on the 2016 Austrian presidential elections. Online Soc Netw Media 5:37–50. https://doi.org/10.1016/j.osnem.2017.12.002
Article Google Scholar
Sun T, Wang J, Zhang P, Cao Y, Liu B, Wang D (2017) Predicting stock price returns using microblog sentiment for chinese stock market. In: Proceedings 2017 3rd International conference big data computer communications BigCom 2017, pp 87–96. https://doi.org/10.1109/BIGCOM.2017.59
Ghiassi M, Zimbra D, Lee S (2016) Targeted twitter sentiment analysis for brands using supervised feature engineering and the dynamic architecture for artificial neural networks. J Manag Inf Syst 33:1034–1058. https://doi.org/10.1080/07421222.2016.1267526
Article Google Scholar
Asghar MZ, Khan A, Ahmad S, Qasim M, Khan IA (2017) Lexicon-enhanced sentiment analysis framework using rule-based classification scheme. PLoS ONE 12:1–22. https://doi.org/10.1371/journal.pone.0171649
Article Google Scholar
Asghar MZ, Kundi FM, Ahmad S, Khan A, Khan F (2018) T-SAF: twitter sentiment analysis framework using a hybrid classification scheme. Expert Syst 35:1–19. https://doi.org/10.1111/exsy.12233
Article Google Scholar
Baccianella S, Esuli A, Sebastiani F (2010) SENTIWORDNET 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th international conference on language resources and evaluation, LREC 2010. pp 2200–2204
Google Scholar
Hamdan H, Bellot P, Bechet F (2015) Sentiment lexicon-based features for sentiment analysis in short text. In: Conference 16th international conference on intelligent text processing and computational linguistics, pp 217–226
Google Scholar
Strapparava C, Valitutti A (2004) WordNet-affect: an affective extension of WordNet. In: Proceedings of the 4th international conference on language resources and evaluation, LREC 2004, pp 1083–1086
Google Scholar
Appel O, Chiclana F, Carter J, Fujita H (2018) Successes and challenges in developing a hybrid approach to sentiment analysis. Appl Intell 48:1176–1188. https://doi.org/10.1007/s10489-017-0966-4
Article Google Scholar
Kolchyna O, Souza TTP, Treleaven P, Aste T (2015) Twitter sentiment analysis: lexicon method, machine learning method and their combination
Google Scholar
Zainuddin N, Selamat A, Ibrahim R (2018) Hybrid sentiment classification on twitter aspect-based sentiment analysis. Appl Intell 48:1218–1232. https://doi.org/10.1007/s10489-017-1098-6
Article Google Scholar
Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification. In: ACL-HLT 2011—Proceedings of the 49th annual meeting of the association for computational linguistics human language technologies, vol 1, pp 132–141
Google Scholar
Chan WN, Thein T (2018) A comparative study of machine learning techniques for real-time multi-tier sentiment analysis. In: 1st IEEE international conference on knowledge innovation and invention, ICKII 2018, Institute of Electrical and Electronics Engineers Inc, pp 90–93. https://doi.org/10.1109/ICKII.2018.8569169
Mansour R, Hady MFA, Hosam E, Amr H, Ashour A (2015) Feature selection for twitter sentiment analysis: an experimental study. In: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and Lecture notes in bioinformatics), Springer, pp 92–103. https://doi.org/10.1007/978-3-319-18117-2_7
Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40:6266–6282. https://doi.org/10.1016/j.eswa.2013.05.057
Article Google Scholar
Liu M, Song Y, Zou H, Zhang T (2019) Reinforced training data selection for domain adaptation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1957–1968
Google Scholar
Ducange P, Fazzolari M, Petrocchi M, Vecchio M (2019) Engineering applications of artificial intelligence an effective decision support system for social media listening based on cross-source sentiment analysis models. Eng Appl Artif Intell 78:71–85. https://doi.org/10.1016/j.engappai.2018.10.014
Article Google Scholar
Hassan F, Usman K, Saba Q (2018) Enhanced cross-domain sentiment classification utilizing a multi-source transfer learning approach. Soft Comput https://doi.org/10.1007/s00500-018-3187-9
Article Google Scholar
Sanders NJ (2011) Twitter sentiment corpus. Sanders analytics. Sanders analytics LLC Web 16 Nov 2013
Google Scholar

Download references

Author information

Authors and Affiliations

Manav Rachna International Institute of Research and Studies, Faridabad, India
Vaishali Arya & Rashmi Agrawal

Authors

Vaishali Arya
View author publications
You can also search for this author in PubMed Google Scholar
Rashmi Agrawal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vaishali Arya .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, GIET University, Gunupur, Odisha, India
Raghvendra Kumar
Institute of Engineering and Technology, Thu Dau Mot University, Binh Duong, Vietnam
Nguyen Ho Quang
CMR Institute of Technology, Hyderabad, Telangana, India
Vijender Kumar Solanki
Universidad Don Bosco, Antiguo Cuscatlán, El Salvador
Manuel Cardona
KIIT Deemed to be University, Bhubaneswar, Odisha, India
Prasant Kumar Pattnaik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arya, V., Agrawal, R. (2021). Comparative Analysis of Machine Learning Algorithms for Hybrid Sources of Textual Data: In Development of Domain Adaptable Sentiment Analysis Model. In: Kumar, R., Quang, N.H., Kumar Solanki, V., Cardona, M., Pattnaik, P.K. (eds) Research in Intelligent and Computing in Engineering. Advances in Intelligent Systems and Computing, vol 1254. Springer, Singapore. https://doi.org/10.1007/978-981-15-7527-3_16

Download citation

DOI: https://doi.org/10.1007/978-981-15-7527-3_16
Published: 05 January 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7526-6
Online ISBN: 978-981-15-7527-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics