Advertisement

Applied Intelligence

, Volume 49, Issue 5, pp 1688–1707 | Cite as

Improved whale optimization algorithm for feature selection in Arabic sentiment analysis

  • Mohammad Tubishat
  • Mohammad A. M. Abushariah
  • Norisma IdrisEmail author
  • Ibrahim Aljarah
Article
  • 140 Downloads

Abstract

To help individuals or companies make a systematic and more accurate decisions, sentiment analysis (SA) is used to evaluate the polarity of reviews. In SA, feature selection phase is an important phase for machine learning classifiers specifically when the datasets used in training is huge. Whale Optimization Algorithm (WOA) is one of the recent metaheuristic optimization algorithm that mimics the whale hunting mechanism. However, WOA suffers from the same problem faced by many other optimization algorithms and tend to fall in local optima. To overcome these problems, two improvements for WOA algorithm are proposed in this paper. The first improvement includes using Elite Opposition-Based Learning (EOBL) at initialization phase of WOA. The second improvement involves the incorporation of evolutionary operators from Differential Evolution algorithm at the end of each WOA iteration including mutation, crossover, and selection operators. In addition, we also used Information Gain (IG) as a filter features selection technique with WOA using Support Vector Machine (SVM) classifier to reduce the search space explored by WOA. To verify our proposed approach, four Arabic benchmark datasets for sentiment analysis are used since there are only a few studies in sentiment analysis conducted for Arabic language as compared to English. The proposed algorithm is compared with six well-known optimization algorithms and two deep learning algorithms. The comprehensive experiments results show that the proposed algorithm outperforms all other algorithms in terms of sentiment analysis classification accuracy through finding the best solutions, while its also minimizes the number of selected features.

Keywords

Arabic sentiment analysis Support vector machine Information gain Whale optimization algorithm 

References

  1. 1.
    Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830CrossRefGoogle Scholar
  2. 2.
    Rana TA, Cheah YN (2016) Aspect extraction in sentiment analysis: comparative analysis and survey. Artif Intell Rev 46(4):459–483CrossRefGoogle Scholar
  3. 3.
    Ahmad SR, Bakar AA, Yaakub MR (2015) Metaheuristic algorithms for feature selection in sentiment analysis. In Science and Information Conference (SAI), 2015 (pp. 222–226). IEEEGoogle Scholar
  4. 4.
    Liu H, Yu L (2005) Toward integrating feature selection algorithms for classification and clustering. IEEE Trans Knowl Data Eng 17(4):491–502CrossRefGoogle Scholar
  5. 5.
    Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67CrossRefGoogle Scholar
  6. 6.
    Aljarah I, Faris H, Mirjalili S (2018) Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput 22(1):1–15CrossRefGoogle Scholar
  7. 7.
    Wang J, Du P, Niu T, Yang W (2017) A novel hybrid system based on a new proposed algorithm—multi-objective whale optimization algorithm for wind speed forecasting. Appl Energy 208:344–360CrossRefGoogle Scholar
  8. 8.
    Prakash DB, Lakshminarayana C (2017) Optimal siting of capacitors in radial distribution network using whale optimization algorithm. Alexandria Eng J 56(4):499–509CrossRefGoogle Scholar
  9. 9.
    Reddy PDP, Reddy VV, Manohar TG (2017) Whale optimization algorithm for optimal sizing of renewable resources for loss reduction in distribution systems. Renewables: Wind Water Solar 4(1):3CrossRefGoogle Scholar
  10. 10.
    Mostafa A, Hassanien AE, Houseni M, Hefny H (2017) Liver segmentation in MRI images based on whale optimization algorithm. Multimed Tools Appl 76(23):24931–24954CrossRefGoogle Scholar
  11. 11.
    Zhou X, Wu Z, Wang H (2012) Elite opposition-based differential evolution for solving large-scale optimization problems and its implementation on GPU. In Parallel and Distrib Comput, Applications and Technologies (PDCAT), 2012 13th International Conference on (pp. 727–732). IEEEGoogle Scholar
  12. 12.
    Sarkhel R, Das N, Saha AK, Nasipuri M (2018) An improved harmony search algorithm embedded with a novel piecewise opposition based learning algorithm. Eng Appl Artif Intell 67:317–330CrossRefGoogle Scholar
  13. 13.
    Shan X, Liu K, Sun PL (2016) Modified Bat Algorithm Based on Lévy Flight and Opposition Based Learning Scientific Programming, 2016Google Scholar
  14. 14.
    Mahdavi S, Rahnamayan S, Deb K (2017) Opposition based learning: a literature review. Swarm and Evolutionary ComputationGoogle Scholar
  15. 15.
    Wang H, Wu Z, Liu Y, Wang J, Jiang D, Chen L (2009) Space transformation search: a new evolutionary technique. Proceedings of the first ACM/SIGEVO Summit on Genetic and Evol Comput (pp. 537–544). ACMGoogle Scholar
  16. 16.
    Wang H, Wu Z, Rahnamayan S, Liu Y, Ventresca M (2011) Enhancing particle swarm optimization using generalized opposition-based learning. Inf Sci 181(20):4699–4714MathSciNetCrossRefGoogle Scholar
  17. 17.
    Guo Z, Wang S, Yue X, Yang H (2017) Global harmony search with generalized opposition-based learning. Soft Comput 21(8):2129–2137CrossRefGoogle Scholar
  18. 18.
    Wang H, Wang W, Zhu H, Sun H (2013) Generalised opposition-based differential evolution for frequency modulation parameter optimisation. Int J Model Identif Control 18(4):372–379CrossRefGoogle Scholar
  19. 19.
    Si T, De A, Bhattacharjee AK (2014) Particle swarm optimization with generalized opposition based learning in particle's pbest position. In Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on (pp. 1662–1667). IEEEGoogle Scholar
  20. 20.
    Wang H, Wang W, Sun H (2015) Firefly algorithm with generalised opposition-based learning. Int J Wirel Mob Comput 9(4):370–376CrossRefGoogle Scholar
  21. 21.
    Zhou X, Wu Z, Deng C, Peng H (2015) Enhancing artificial bee colony algorithm with generalised opposition-based learning. Int J Comput Sci Math 6(3):297–309MathSciNetCrossRefGoogle Scholar
  22. 22.
    Chen X, Yu K, Du W, Zhao W, Liu G (2016) Parameters identification of solar cell models using generalized oppositional teaching learning-based optimization. Energy 99:170–180CrossRefGoogle Scholar
  23. 23.
    Zhou Y, Wang R, Luo Q (2016) Elite opposition-based flower pollination algorithm. Neurocomputing 188:294–310CrossRefGoogle Scholar
  24. 24.
    Zhang S, Luo Q, Zhou Y (2017) Hybrid grey wolf optimizer using elite opposition-based learning strategy and simplex method. Int J Comput Intell Appl 16(02):1750012CrossRefGoogle Scholar
  25. 25.
    Huang K, Zhou Y, Wu X, Luo Q (2016) A cuckoo search algorithm with elite opposition-based strategy. J Intell Syst 25(4):567–593Google Scholar
  26. 26.
    Storn R, Price K (1997) Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optim 11(4):341–359MathSciNetCrossRefzbMATHGoogle Scholar
  27. 27.
    Jadon SS, Tiwari R, Sharma H, Bansal JC (2017) Hybrid artificial bee colony algorithm with differential evolution. Appl Soft Comput 58:11–24CrossRefGoogle Scholar
  28. 28.
    Zheng YJ, Xu XL, Ling HF, Chen SY (2015) A hybrid fireworks optimization method with differential evolution operators. Neurocomputing 148:75–82CrossRefGoogle Scholar
  29. 29.
    Awad NH, Ali MZ, Suganthan PN, Reynolds RG (2017) CADE: a hybridization of cultural algorithm and differential evolution for numerical optimization. Inf Sci 378:215–241CrossRefGoogle Scholar
  30. 30.
    Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93CrossRefGoogle Scholar
  31. 31.
    Moraes R, Valiati JF, Neto WPG (2013) Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Syst Appl 40(2):621–633CrossRefGoogle Scholar
  32. 32.
    Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527–6535CrossRefGoogle Scholar
  33. 33.
    Saleh MR, Martín-Valdivia MT, Montejo-Ráez A, Ureña-López LA (2011) Experiments with SVM to classify opinions in different domains. Expert Syst Appl 38(12):14799–14804CrossRefGoogle Scholar
  34. 34.
    Aliane AA, Aliane H, Ziane M, Bensaou N (2016) A genetic algorithm feature selection based approach for Arabic sentiment classification, IEEEGoogle Scholar
  35. 35.
    Duwairi RM, Qarqaz I (2016) A framework for Arabic sentiment analysis using supervised classification. Int J Data Mining Model Manag 8(4):369–381Google Scholar
  36. 36.
    Al-Radaideh QA, Al-Qudah GY (2017) Application of rough set-based feature selection for Arabic sentiment analysis. Cogn Comput, 1–10Google Scholar
  37. 37.
    Sabri B, Saad S (2016) Arabic sentiment analysis with optimal combination of features selection and machine learning approaches. Res J Appl Sci Eng Technol 13(5):386–393CrossRefGoogle Scholar
  38. 38.
    Rushdi-Saleh M, Teresa M-VM, Urea-Lopez AL, Perea-Ortega JM (2011) OCA: opinion corpus for Arabic. J Am Soc Inf Sci 62:2045–2054CrossRefGoogle Scholar
  39. 39.
    Bahassine S, Madani A, Kissi M (2016) An improved Chi-sqaure feature selection for Arabic text classification using decision Trees In Intelligent Systems: Theories and Applications (SITA), 2016 11th International Conference on (pp. 1–5). IEEEGoogle Scholar
  40. 40.
    Al-Kabi MN, Abdulla NA, Al-Ayyoub M (2013) An analytical study of arabic sentiments: Maktoob case study. In Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for (pp. 89–94). IEEEGoogle Scholar
  41. 41.
    Abdulla NA, Ahmed NA, Shehab MA, Al-Ayyoub M (2013) Arabic sentiment analysis: Lexicon-based and corpus-based. In Applied Electr Eng and Computing Technologies (AEECT), 2013 IEEE Jordan Conference on (pp. 1–6). IEEEGoogle Scholar
  42. 42.
    Alomari KM, ElSherif HM, Shaalan K (2017) Arabic Tweets Sentimental Analysis Using Mach Learn In International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (pp. 602–610). Springer, ChamGoogle Scholar
  43. 43.
    Abdulla NA, Al-Ayyoub M, Al-Kabi MN (2014) An extended analytical study of Arabic sentiments. Int J Big Data Intell 1 1(1–2):103–113CrossRefGoogle Scholar
  44. 44.
    Omar N, Albared M, Al-Moslmi T, Al-Shabi A (2014) A comparative study of feature selection and Mach Learn algorithms for Arabic sentiment classification. In Asia Inf Retr symposium (pp. 429–443). Springer, ChamGoogle Scholar
  45. 45.
    Bostani H, Sheikhan M (2017) Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems. Soft Comput 21(9):2307–2324CrossRefGoogle Scholar
  46. 46.
    Zhang L, Shan L, Wang J (2017) Optimal feature selection using distance-based discrete firefly algorithm with mutual information criterion. Neural Comput & Applic 28(9):2795–2808CrossRefGoogle Scholar
  47. 47.
    Wang H, Niu B (2017) A novel bacterial algorithm with randomness control for feature selection in classification. Neurocomputing 228:176–186CrossRefGoogle Scholar
  48. 48.
    Barani F, Mirhosseini M, Nezamabadi-Pour H (2017) Application of binary quantum-inspired gravitational search algorithm in feature subset selection. Appl Intell 47(2):304–318CrossRefGoogle Scholar
  49. 49.
    Hancer E, Xue B, Zhang M, Karaboga D, Akay B (2018) Pareto front feature selection based on artificial bee colony optimization. Inf Sci 422:462–479CrossRefGoogle Scholar
  50. 50.
    Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82CrossRefGoogle Scholar
  51. 51.
    Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In Computational intelligence for modelling, control and automation, 2005 and international conference on intelligent agents, web technologies and internet commerce, international conference on (Vol. 1, pp. 695–701). IEEEGoogle Scholar
  52. 52.
    Song Q, Ni J, Wang G (2013) A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans Knowl Data Eng 25(1):1–14CrossRefGoogle Scholar
  53. 53.
    Abdulla N, Mahyoub N, Shehab M, Al-Ayyoub M (2013) Arabic sentiment analysis: Corpus-based and lexicon-based. In Proceedings of The IEEE conference on Applied Electrical Engineering and Computing Technologies (AEECT)Google Scholar
  54. 54.
    Al-Moslmi T, Albared M, Al-Shabi A, Omar N, Abdullah S (2017) Arabic senti-lexicon: constructing publicly available language resources for arabic sentiment analysis. J Inf Sci, 0165551516683908Google Scholar
  55. 55.
    Saremi S, Mirjalili S, Lewis A (2017) Grasshopper optimisation algorithm: theory and application. Adv Eng Softw 105:30–47CrossRefGoogle Scholar
  56. 56.
    Mirjalili S (2015) The ant lion optimizer. Adv Eng Softw 83:80–98CrossRefGoogle Scholar
  57. 57.
    Bahassine S, Madani A, Al-Sarem M, Kissi M (2018) Feature selection using an improved chi-square for Arabic text classification. Journal of King Saud University-Computer and Information SciencesGoogle Scholar
  58. 58.
    Al-Smadi M, Al-Ayyoub M, Jararweh Y, Qawasmeh O (2018) Enhancing aspect-based sentiment analysis of Arabic hotels’ reviews using morphological, syntactic and semantic features. Inf Process ManagGoogle Scholar
  59. 59.
    Abdul-Mageed M, Diab M, Kübler S (2014) SAMAR: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37CrossRefGoogle Scholar
  60. 60.
    Soliman TH, Elmasry MA, Hedar A, Doss MM (2014) Sentiment analysis of Arabic slang comments on facebook. Int J Comput Technol 12(5):3470–3478CrossRefGoogle Scholar
  61. 61.
    Nasser A, Dinçer K, Sever H (2016) Investigation of the feature selection problem for sentiment analysis in Arabic language. Res Comput Sci 110:41–54Google Scholar
  62. 62.
    Al-Rubaiee H, Qiu R, Li D (2016). Identifying Mubasher software products through sentiment analysis of Arabic tweets. In Industrial Informatics and Computer Systems (CIICS), 2016 International Conference on (pp. 1–6). IEEEGoogle Scholar
  63. 63.
    Cherif W, Madani A, Kissi M (2015) A new modeling approach for Arabic opinion mining recognition. In 2015 Intelligent Systems and Computer Vision (ISCV) (pp. 1–6). IEEEGoogle Scholar
  64. 64.
    Ibrahim HS, Abdou SM, Gheith M (2015) Sentiment analysis for modern standard Arabic and colloquial. arXiv preprint arXiv:1505.03105Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Mohammad Tubishat
    • 1
  • Mohammad A. M. Abushariah
    • 2
  • Norisma Idris
    • 1
    Email author
  • Ibrahim Aljarah
    • 3
  1. 1.Department of Artificial Intelligence, Faculty of Computer Science and Information TechnologyUniversity of MalayaKuala LumpurMalaysia
  2. 2.Computer Information Systems Department, King Abdullah II School of Information TechnologyThe University of JordanAmmanJordan
  3. 3.Business Information Technology Department, King Abdullah II School for Information TechnologyThe University of JordanAmmanJordan

Personalised recommendations