Robust sentiment fusion on distribution of news

  • Mohammad Kamel
  • Farzaneh Namdar Siuky
  • Hadi Sadoghi YazdiEmail author


In recent years, since technology has revolutionized human life, plenty of data and information are published on the Internet each day. These shared data contain sentiments (i.e., positive, neutral or negative) toward various topics. Hence, there is a growing need for some techniques and tools which make anticipations about the sentiment of documents. Some sentiment analysis tools extract sentiments for words or sentences separately; moreover, these tools might generate noisy results. Moreover, some documents may contain noisy sentences(not relevant sentences to the context of the document). Therefore, we present some fusion methods to fuse all separated sentiments to have a noise robust value as the whole document sentiment. The proposed robust sentiment analysis tool is called RSF. Initially, the sentiment of sentences are extracted by CoreNLP; afterward, due to the histogram of extracted sentence sentiments, documents are divided into two separated groups. The first group consists of documents which have one main concept and some noisy signals. On the other hand, the second group consists of documents with two main concepts. Afterward, the separated sentiments are fused by using non-linear and linear operator with different loss functions. Moreover, an approach is proposed to evaluate the different loss functions utilized in the fusion step. This approach is based on spaCy and neural network which calculate the train and test errors of different fusion methods to determine the most efficient loss function. Eventually, a news corpus of English news of ISNA is introduced in this paper.


Natural language processing Sentiment analysis Fusion method Neural network News corpus ISNA-set 



  1. 1.
    Aggarwal CC, Zhai C (2012) Mining text data. Springer, New York. CrossRefGoogle Scholar
  2. 2.
    Ashkezari-Toussi S, Sadoghi-Yazdi H (2019) Robust diffusion LMS over adaptive networks. Signal Process 158:201–209. CrossRefGoogle Scholar
  3. 3.
    Boiy E, Moens M-F (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retr 12(5):526–558. CrossRefGoogle Scholar
  4. 4.
    Braunstein SL (1992) How large a sample is needed for the maximum likelihood estimator to be approximately gaussian?. J Phys A Math Gen 25(13):3813. MathSciNetCrossRefGoogle Scholar
  5. 5.
    Cai X, Hu S, Lin X (2012) Feature extraction using restricted boltzmann machine for stock price prediction. In: 2012 IEEE international conference on computer science and automation engineering (CSAE), IEEE.
  6. 6.
    Čapek D, Metscher BD, Müller GB (2002) Thumbs up or thumbs down?: semantic orientation applied to unsupervised classication of reviews. In: Proceedings of the 40th annual meeting on association for computational linguistics 322, 1–12.
  7. 7.
    Chatzakou D, Vakali A (2015) Harvesting opinions and emotions from social media textual resources. IEEE Internet Comput 19:46–50. CrossRefGoogle Scholar
  8. 8.
    Choi JD, Tetreault JR, Stent A (2015) It depends: Dependency parser comparison using a web-based evaluation tool. In: ACLGoogle Scholar
  9. 9.
    Dasarathy B (1997) Sensor fusion potential exploitation-innovative architectures and illustrative applications. Proc IEEE 85(1):24–38. CrossRefGoogle Scholar
  10. 10.
    Deonna J, Teroni F (2012) The emotions: a philosophical introduction, Vol. 69 RoutledgeGoogle Scholar
  11. 11.
    Durrant-Whyte H (1988) Sensor models and multisensor integration. Int J Robot Res 7(6):97–113. CrossRefGoogle Scholar
  12. 12.
    Durrant-whyte H, Stevens M Data fusion in decentralized sensing networks, Information Fusion - INFFUSGoogle Scholar
  13. 13.
    Duan X, Banchs RE, Zhang M, Li H, Kumara A (2016) Proceedings of news 2016 the sixth named entities workshopGoogle Scholar
  14. 14.
    Gan Q, Harris C (2001) Comparison of two measurement fusion methods for kalman-filter-based multisensor data fusion. IEEE Trans Aerosp Electron Syst 37 (1):273–279. CrossRefGoogle Scholar
  15. 15.
    Gao L, Guo Z, Zhang H, Xu X, Shen HT (2017) Video captioning with attention-based LSTM and semantic consistency. IEEE Trans Multimed 19(9):2045–2055. CrossRefGoogle Scholar
  16. 16.
    Greene D, Cunningham P (2006) Practical solutions to the problem of diagonal dominance in kernel document clustering. In: Proceedings of the 23rd international conference on machine learning, ICML ’06. ACM, New York, pp 377–384
  17. 17.
    Honnibal M, Johnson M (2015) An improved non-monotonic transition system for dependency parsing. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, pp 1373–1378.
  18. 18.
    Hussein DME-DM A survey on sentiment analysis challenges., King Suad University.
  19. 19.
    Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’04. ACM, New York, pp 168–177
  20. 20.
    Intarapaiboon P (2015) A framework for text classification using intuitionistic fuzzy sets.
  21. 21.
    Kadhim AI, Cheah Y-N, Ahamed NH Text document preprocessing and dimension reduction techniques for text document clustering.
  22. 22.
    Liang H, Sun X, Sun Y, Gao Y (2017) Text feature extraction based on deep learning: a review. EURASIP Journal on Wireless Communications and Networking, 1.
  23. 23.
    Liu B (2012) Sentiment analysis and opinion mining, Morgan & Claypool PublishersGoogle Scholar
  24. 24.
    Luo R, Yih C-C, Su KL (2002) Multisensor fusion and integration: approaches, applications, and future research directions. IEEE Sensors J 2(2):107–119. CrossRefGoogle Scholar
  25. 25.
    Manyika J, Durrant-Whyte H (1994) Data Fusion and Sensor Management: A Decentralized Information-Theoretic Approach (Ellis Horwood Series in Electrical and Electronic Engineering), Prentice Hall.
  26. 26.
    Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. A in Shams Eng J 5:1093–1113. CrossRefGoogle Scholar
  27. 27.
    Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - Volume 10, EMNLP ’02. Association for Computational Linguistics, Stroudsburg, pp 79–86
  28. 28.
    Pradhan L, Taneja NA, Dixit C, Suhag M (2017) Comparison of text classifiers on news articles. Procedia Technology 66:S104. Google Scholar
  29. 29.
    Qin ZL (2013) Sparse automatic encoder application in text categorization research. In: Science technology and engineeringGoogle Scholar
  30. 30.
    Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing. reprinted in (26). MIT Press, Cambridge, pp 318–362Google Scholar
  31. 31.
    Serrano-Guerrero J, Olivas JA, Romero FP, Herrera-Viedma E (2015) Sentiment analysis: a review and comparative analysis of web services. Inf Sci 311:18–38. CrossRefGoogle Scholar
  32. 32.
    Shaikh MA, Prendinger H, Mitsuru I (2007) Assessing sentiment of text by semantic dependency and contextual valence analysis. In: Proceedings of the 2nd international conference on affective computing and intelligent interaction, ACII ’07. Springer, Berlin, pp 191–202.
  33. 33.
    Shing J, Jang R (1993) Anfis: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23:665–685CrossRefGoogle Scholar
  34. 34.
    Shopon M, Mohammed N, Abedin MA (2016) Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: 2016 international workshop on computational intelligence (IWCI), IEEE.
  35. 35.
    Snyder B, Barzilay R (2007) Multiple aspect ranking using the good grief algorithm, In: Proceedings of the human language technology conference of the north american chapter of the association of computational linguistics (HLT-NAACL, pp. 300–307Google Scholar
  36. 36.
    Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C Recursive deep models for semantic compositionality over a sentiment treebankGoogle Scholar
  37. 37.
    Song J, Yang Y, Huang Z, Shen HT, Luo J (2013) Effective multiple feature hashing for large-scale near-duplicate video retrieval. IEEE Trans Multimed 15(8):1997–2008. CrossRefGoogle Scholar
  38. 38.
    Song J, Guo Y, Gao L, Li X, Hanjalic A, Shen HT (2018) From deterministic to generative: Multimodal stochastic RNNs for video captioning. IEEE Trans Neural Netw Learn Syst 2018:1–12. Google Scholar
  39. 39.
    Sun X, Ni Y (2006) Recurrent neural network with kernel feature extraction for stock prices forecasting. In: 2006 international conference on computational intelligence and security, IEEE.
  40. 40.
    Sun X, Li C, Xu W, Ren F (2014) Chinese microblog sentiment classification based on deep belief nets with extended multi-modality features. In: 2014 IEEE international conference on data mining workshop, IEEE.
  41. 41.
    Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Conference on language resources and evaluation (LREC), pp 427–432Google Scholar
  42. 42.
    Takamura H, Inui T, Okumura M (2005) Extracting semantic orientations of words using spin model. In: Proceedings of the 43rd annual meeting on association for computational linguistics, ACL ’05. Association for Computational Linguistics, Stroudsburg, pp 133–140
  43. 43.
    Tellez ES, Miranda-Jiménez S, Graff M, Moctezuma D, Siordia OS, Villaseñor EA (2017) A case study of spanish text transformations for twitter sentiment analysis. Expert System with Application 81:457–471. CrossRefGoogle Scholar
  44. 44.
    Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of ACL-08: HLT. Association for Computational Linguistics, Columbus, pp 308–316
  45. 45.
    Vaseghi SV (2006) Advanced digital signal processing and noise reduction. Wiley, New YorkGoogle Scholar
  46. 46.
    Vilares D, Alonso MA, Gómez-Rodríguez C (2017) Supervised sentiment analysis in multilingual environments. Inf Process Manag 53:595–607. CrossRefGoogle Scholar
  47. 47.
    Wang Z, Ziou D, Armenakis C, Li D, Li Q (2005) A comparative analysis of image fusion methods. IEEE Trans Geosci Remote Sens 43(6):1391–1402. CrossRefGoogle Scholar
  48. 48.
    Wei W, Gulla JA (2010) Sentiment learning on product reviews via sentiment ontology tree. In: Proceedings of the 48th annual meeting of the association for computational linguistics, ACL ’10. Association for Computational Linguistics, Stroudsburg, pp 404–413.
  49. 49.
    Wiebe J, Mihalcea R (2006) Word sense and subjectivity. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the association for computational linguistics, ACL-44. Association for Computational Linguistics, Stroudsburg, pp 1065–1072
  50. 50.
    Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, HLT ’05. Association for Computational Linguistics, Stroudsburg, pp 347–354.
  51. 51.
    Xiong Y, Wang D, Zhang Y, Feng S, Wang G (2014) Multimodal data fusion in text-image heterogeneous graph for social media recommendation. In: Web-Age information management, Springer International Publishing, pp 96–99.
  52. 52.
    Zhang S, He F, Ren W, Yao J Joint learning of image detail and transmission map for single image dehazing, The Visual Computer.

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.Center of Excellence on Soft Computing and Intelligent Information ProcessingFerdowsi University of MashhadMashhadIran

Personalised recommendations