Advertisement

A Big Data Experiment to Assess the Effectiveness of Deep Learning Neural Networks in the Mining of Sustainable Aspects of the Hotels Clients Opinions

  • Thiago de Oliveira Lima
  • Methanias Colaço Júnior
  • Kleber H. de J. Prado
  • Adalberto dos S. Júnior
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 800)

Abstract

Context: Opinions given by hotel clients in tourism social networks, the ones which can be a great source of knowledge extraction in the Big Data context, including the sustainable aspects of the hotels clients opinions. Objective: Evaluate performance and quality of deep learning neural networks, especially the Target-Connection LSTM (TC-LSTM) and Attention-based LSTM (AT-LSTM) algorithms, aiming to mine and classify the opinions posted on the TripAdvisor and Booking social networks, by considering sustainability aspects. Method: A controlled experiment to compare the efficiency and efficacy of the classifiers was carried out. Results: The AT-LSTM algorithm presented the best results, especially in terms of accuracy, precision, f-measure, average training time and average classification time. The first with 74,58%, the second with 95,54%, the third with 85,37%, then fourth with 7,3 s and the last one with 1,12 s. Conclusion: The AT-LSTM algorithm was expressly more effective than TC-LSTM, making it an option to be considered for mining opinions based on specific aspects of tourism and peculiar market niches.

References

  1. 1.
    Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)Google Scholar
  2. 2.
    Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of International Conference on Learning Representations Workshop (ICLR-2013) (2013)Google Scholar
  3. 3.
    Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empiricial Methods in Natural Language Processing (EMNLP-2014), vol. 12, pp. 1532–1543 (2014)Google Scholar
  4. 4.
    Greff, K., et al.: LSTM: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28(10), 2222–2232 (2017)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)Google Scholar
  6. 6.
    Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. arXiv preprint arXiv:1503.00075 (2015)Google Scholar
  7. 7.
    Tang, D., et al.: Effective LSTMs for target-dependent sentiment classification. arXiv preprint arXiv:1512.01100 (2015)Google Scholar
  8. 8.
    Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)Google Scholar
  9. 9.
    Ling, W., Dyer, C., Black, A., Trancoso, I.: Two/too simple adaptations of word2vec for syntax problems. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics (2015)Google Scholar
  10. 10.
    Severino, A.J.: Metodologia do trabalho científico. Cortez editora (2017)Google Scholar
  11. 11.
    Lima, T., Junior, M.C., Augusta, M., Nunes, S.N.: Mining on line general opinions about sustainability of hotels: a systematic literature mapping. In: International Conference on Computational Science and Its Applications. Springer, Cham (2018)Google Scholar
  12. 12.
    Wang, Y., Huang, M., Zhao, L.: Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (2016)Google Scholar
  13. 13.
    Young, T., et al.: Recent trends in deep learning based natural language processing. arXiv preprint arXiv:1708.02709 (2017) 18Google Scholar
  14. 14.
    Wohlin, C., et al.: Experimentation in Software Engineering. Springer Science & Business Media, Berlin/New York (2012)Google Scholar
  15. 15.
    Juristo, N., Moreno, A.M.: Basics of Software Engineering Experimentation. Springer Science & Business Media, New York (2013)zbMATHGoogle Scholar
  16. 16.
    Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)Google Scholar
  17. 17.
    Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)CrossRefGoogle Scholar
  18. 18.
    Tsytsarau, M., Palpanas, T.: Survey on mining subjective data on the web. Data Min. Knowl. Disc. 24(3), 478–514 (2012)CrossRefGoogle Scholar
  19. 19.
    Srisuan, J., Hanskunatai, A.: The ensemble of Naïve Bayes classifiers for hotel searching. In: 2014 International Computer Science and Engineering Conference (ICSEC). IEEE (2014)Google Scholar
  20. 20.
    Songpan, W.: The analysis and prediction of customer review rating using opinion mining. In: 2017 IEEE 15th International Conference on Software Engineering Research, Management and Applications (SERA). IEEE (2017)Google Scholar
  21. 21.
    Chaabani, Y., Toujani, R., Akaichi, J.: Sentiment analysis method for tracking touristics reviews in social media network. In: International Conference on Intelligent Interactive Multimedia Systems and Services. Springer, Cham (2017)Google Scholar
  22. 22.
    Hung, C.: Word of mouth quality classification based on contextual sentiment lexicons. Inf. Process. Manag. 53(4), 751–763 (2017)MathSciNetCrossRefGoogle Scholar
  23. 23.
    Hu, Y.-H., Chen, Y.-L., Chou, H.-L.: Opinion mining from online hotel reviews–a text summarization approach. Inf. Process. Manag. 53(2), 436–449 (2017)CrossRefGoogle Scholar
  24. 24.
    James, G., et al.: An Introduction to Statistical Learning, vol. 112. Springer, New York (2013)CrossRefGoogle Scholar
  25. 25.
    Patro, V.M., Ranjan Patra, M.: Augmenting weighted average with confusion matrix to enhance classification accuracy. Transactions on Machine Learning and Artificial Intelligence 2(4), 77–91 (2014)CrossRefGoogle Scholar
  26. 26.
    Makhtar, M., Neagu, D.C., Ridley, M.J.: Binary classification models comparison: on the similarity of datasets and confusion matrix for predictive toxicology applications. In: International Conference on Information Technology in Bio-and Medical Informatics. Springer, Berlin/Heidelberg (2011)CrossRefGoogle Scholar
  27. 27.
    Caelen, O.: A Bayesian interpretation of the confusion matrix. Ann. Math. Artif. Intell. 81(3–4), 429–450 (2017)MathSciNetCrossRefGoogle Scholar
  28. 28.
    Dwivedi, A.K.: Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput. Appl. 1–9, 29:685–693 (2016)Google Scholar
  29. 29.
    de Oliveira, N., Robert A., Junior, M.C.: Experimental analysis of stemming on jurisprudential documents retrieval. Information 9(2), 28 (2018)CrossRefGoogle Scholar
  30. 30.
    Santos, B.S., Junior, M.C., de Souza, J.G.: A initial experimental evaluation of the neuromessenger: a collaborative tool to improve the empathy of text interactions. In: Latifi, S. (ed.) Information Technology-New Generations, pp. 411–419. Springer, Cham (2018)CrossRefGoogle Scholar
  31. 31.
    Basili, V.R., Weiss, D.M.: A methodology for collecting valid software engineering data. IEEE Trans. Softw. Eng. 6, 728–738 (1984)CrossRefGoogle Scholar
  32. 32.
    do Turismo, M.: Índice de Competividade do Turismo Nacional. Available in: https://goo.gl/c3agrm. Access in: 01 out (2018)
  33. 33.
    de Matos, J.K.E., Costa, M.A.N.: Sustentabilidade nos meios de hospedagem no Brasil: a norma NBR 15401: 2006. Congresso de Arquitetura, Turismo e Sustentabilidade, vol. 1 (2012)Google Scholar
  34. 34.
    John Lu, Z.Q.: The elements of statistical learning: data mining, inference, and prediction. J. R. Stat. Soc. A. Stat. Soc. 173(3), 693–694 (2010)CrossRefGoogle Scholar
  35. 35.
    Travassos, G.H., Gurov, D., Amaral, E.A.G.G.: Introdução àǎ Engenharia de Software Experimental. UFRJ, Rio de Janeiro (Brazil) (2002)Google Scholar
  36. 36.
    Kluyver, T., et al.: Jupyter Notebooks-a publishing format for reproducible computational workflows. ELPUB (2016)Google Scholar
  37. 37.
    Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)Google Scholar
  38. 38.
    Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetzbMATHGoogle Scholar
  39. 39.
    Shapiro, S.S., Wilk, M.B.: An analysis of variance test for normality (complete samples). Biometrika 52(3/4), 591–611 (1965)MathSciNetCrossRefGoogle Scholar
  40. 40.
    Field, A.: Descobrindo a Estatística Usando o SPSS-2. Bookman Editora, São Paulo (Brazil) (2009)Google Scholar
  41. 41.
    Levene, H.: Robust tests for equality of variances. In: Olkin, I., et al. (eds.) Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, pp. 279–292. Stanford University Press, Stanford (1961)Google Scholar
  42. 42.
    SPSS IBM.: IBM SPSS Statistics for Windows, Version 20.0. IBM Corp, New York (2011)Google Scholar
  43. 43.
    Litchfield, J.T. Jr., Wilcoxon, F.: A simplified method of evaluating dose-effect experiments. J. Pharmacol. Exp. Ther. 96(2), 99–113 (1949)Google Scholar
  44. 44.
    del Pino Medina Brito, M., dos Santos Júnior, A.: La gestión sostenible de los establecimientos de hospedaje de la provincia Sur de Brasil. I Foro Internacional de Turismo Maspalomas Costa Canaria. Universidad de Las Palmas de Gran Canaria (2013)Google Scholar
  45. 45.
    Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Thiago de Oliveira Lima
    • 1
  • Methanias Colaço Júnior
    • 1
  • Kleber H. de J. Prado
    • 1
    • 2
  • Adalberto dos S. Júnior
    • 1
    • 2
  1. 1.Postgraduate Program in Computer Science – PROCCFederal University of Sergipe (UFS)AracajuBrazil
  2. 2.Universidade Federal de Pelotas – Rio Grande do SulPelotasBrazil

Personalised recommendations