Skip to main content

Comparative Analysis of Machine Learning Algorithms for Hybrid Sources of Textual Data: In Development of Domain Adaptable Sentiment Analysis Model

  • Conference paper
  • First Online:
Research in Intelligent and Computing in Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1254))

  • 1152 Accesses

Abstract

Sentiment classification is the task of categorizing the text into different opinionated categories positive, negative and neutral depending upon the user’s post on social media in a particular domain. In social media, Twitter is the most popular platform for classification but with the limitations of number of words to be posted by individual which produces the inaccurate classification of dataset. Hence, in this paper, we are trying to increase the performance of the sentiment classification model by collecting the textual data on the same domain from different sources. To validate the same, different state-of-the-art machine learning classification algorithms have been applied and analyzed that shows the better results for hybrid data in comparison with the single-source Twitter (microblog) data and also articulated the most suitable algorithms for the hybrid textual sources of data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kušen E, Strembeck M (2018) Politics, sentiments, and misinformation: an analysis of the Twitter discussion on the 2016 Austrian presidential elections. Online Soc Netw Media 5:37–50. https://doi.org/10.1016/j.osnem.2017.12.002

    Article  Google Scholar 

  2. Sun T, Wang J, Zhang P, Cao Y, Liu B, Wang D (2017) Predicting stock price returns using microblog sentiment for chinese stock market. In: Proceedings 2017 3rd International conference big data computer communications BigCom 2017, pp 87–96. https://doi.org/10.1109/BIGCOM.2017.59

  3. Ghiassi M, Zimbra D, Lee S (2016) Targeted twitter sentiment analysis for brands using supervised feature engineering and the dynamic architecture for artificial neural networks. J Manag Inf Syst 33:1034–1058. https://doi.org/10.1080/07421222.2016.1267526

    Article  Google Scholar 

  4. Asghar MZ, Khan A, Ahmad S, Qasim M, Khan IA (2017) Lexicon-enhanced sentiment analysis framework using rule-based classification scheme. PLoS ONE 12:1–22. https://doi.org/10.1371/journal.pone.0171649

    Article  Google Scholar 

  5. Asghar MZ, Kundi FM, Ahmad S, Khan A, Khan F (2018) T-SAF: twitter sentiment analysis framework using a hybrid classification scheme. Expert Syst 35:1–19. https://doi.org/10.1111/exsy.12233

    Article  Google Scholar 

  6. Baccianella S, Esuli A, Sebastiani F (2010) SENTIWORDNET 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th international conference on language resources and evaluation, LREC 2010. pp 2200–2204

    Google Scholar 

  7. Hamdan H, Bellot P, Bechet F (2015) Sentiment lexicon-based features for sentiment analysis in short text. In: Conference 16th international conference on intelligent text processing and computational linguistics, pp 217–226

    Google Scholar 

  8. Strapparava C, Valitutti A (2004) WordNet-affect: an affective extension of WordNet. In: Proceedings of the 4th international conference on language resources and evaluation, LREC 2004, pp 1083–1086

    Google Scholar 

  9. Appel O, Chiclana F, Carter J, Fujita H (2018) Successes and challenges in developing a hybrid approach to sentiment analysis. Appl Intell 48:1176–1188. https://doi.org/10.1007/s10489-017-0966-4

    Article  Google Scholar 

  10. Kolchyna O, Souza TTP, Treleaven P, Aste T (2015) Twitter sentiment analysis: lexicon method, machine learning method and their combination

    Google Scholar 

  11. Zainuddin N, Selamat A, Ibrahim R (2018) Hybrid sentiment classification on twitter aspect-based sentiment analysis. Appl Intell 48:1218–1232. https://doi.org/10.1007/s10489-017-1098-6

    Article  Google Scholar 

  12. Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification. In: ACL-HLT 2011—Proceedings of the 49th annual meeting of the association for computational linguistics human language technologies, vol 1, pp 132–141

    Google Scholar 

  13. Chan WN, Thein T (2018) A comparative study of machine learning techniques for real-time multi-tier sentiment analysis. In: 1st IEEE international conference on knowledge innovation and invention, ICKII 2018, Institute of Electrical and Electronics Engineers Inc, pp 90–93. https://doi.org/10.1109/ICKII.2018.8569169

  14. Mansour R, Hady MFA, Hosam E, Amr H, Ashour A (2015) Feature selection for twitter sentiment analysis: an experimental study. In: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and Lecture notes in bioinformatics), Springer, pp 92–103. https://doi.org/10.1007/978-3-319-18117-2_7

  15. Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40:6266–6282. https://doi.org/10.1016/j.eswa.2013.05.057

    Article  Google Scholar 

  16. Liu M, Song Y, Zou H, Zhang T (2019) Reinforced training data selection for domain adaptation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1957–1968

    Google Scholar 

  17. Ducange P, Fazzolari M, Petrocchi M, Vecchio M (2019) Engineering applications of artificial intelligence an effective decision support system for social media listening based on cross-source sentiment analysis models. Eng Appl Artif Intell 78:71–85. https://doi.org/10.1016/j.engappai.2018.10.014

    Article  Google Scholar 

  18. Hassan F, Usman K, Saba Q (2018) Enhanced cross-domain sentiment classification utilizing a multi-source transfer learning approach. Soft Comput https://doi.org/10.1007/s00500-018-3187-9

    Article  Google Scholar 

  19. Sanders NJ (2011) Twitter sentiment corpus. Sanders analytics. Sanders analytics LLC Web 16 Nov 2013

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vaishali Arya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Arya, V., Agrawal, R. (2021). Comparative Analysis of Machine Learning Algorithms for Hybrid Sources of Textual Data: In Development of Domain Adaptable Sentiment Analysis Model. In: Kumar, R., Quang, N.H., Kumar Solanki, V., Cardona, M., Pattnaik, P.K. (eds) Research in Intelligent and Computing in Engineering. Advances in Intelligent Systems and Computing, vol 1254. Springer, Singapore. https://doi.org/10.1007/978-981-15-7527-3_16

Download citation

Publish with us

Policies and ethics