Abstract
Often customers refer to product specifications to know if a product fulfills their requirement, but in case they do not find answer to their queries they turn to the online reviews. A knowledge-based system tries to answer user queries using product specifications alone, but specifications available can be insufficient. An idea is introduced of combining question answer and review datasets to find the most relevant reviews corresponding to the question using semantic and statistical similarity measures and optimizing their weights. This is accomplished by using a large volume of already answered queries to find weights of four different similarity metrics (Cosine, BM25, WordNet and Word Embedding) used to find similarity between question and reviews. Similarity measure weights are optimized using PSO-based weight optimization technique where the fitness function is evaluated in terms of how best the sentiment extracted from top reviews agrees with answer of question in Q/A dataset. Results achieved surpass baseline in three out of four domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Links to amazon dataset:
References
Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM (1999)
Venugopalan, M., Gupta, D.: Exploring sentiment analysis on twitter data. In: 2015 Eighth International Conference on Contemporary Computing (IC3). IEEE (2015)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)
Venugopalan, M., Gupta, D.: An enhanced polarity lexicon by learning-based method using related domain knowledge. Int. J. Inf. Process. Manag. 6(2), 61 (2015)
Vishnu, K.S., Apoorva, T., Gupta, D.: Learning domain-specific and domain-independent opinion oriented lexicons using multiple domain knowledge. In: 2014 Seventh International Conference on Contemporary Computing (IC3). IEEE (2014)
Sarawgi, K., Pathak, V.: Opinion mining: aspect level sentiment analysis using SentiWordNet and amazon web services. Int. J. Comput. Appl. 158(6), 31–36 (2017). https://doi.org/10.5120/ijca2017912830
Daelemans, W., du Plessis, T., Snyman, C., Teck, L. (eds.) Text mining with information extraction. In: Proceedings of the 4th International MIDP Colloquium, September 2003, pp. 141–160
Chahal, P., et al.: Ranking of web documents using semantic similarity. In: 2013 International Conference on Information Systems and Computer Networks (2013). https://doi.org/10.1109/iciscon.2013.6524191
Botev, V., et al.: Word importance-based similarity of documents metric (WISDM). In: Proceedings of the 6th International Workshop on Mining Scientific Publications—WOSP 2017, (2017). https://doi.org/10.1145/3127526.3127530
Kumar Gupta D., Srikanth R.K., Shweta, E.A.: PSOASent: feature selection using particle swarm optimization for aspect based sentiment analysis. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) Natural Language Processing and Information Systems. NLDB 2015. Lecture Notes in Computer Science, vol. 9103. Springer, Cham (2015)
Akhtar, MS., et al.: Feature Selection and ensemble construction: a two-step method for aspect based sentiment analysis. Knowl.-Based Syst. 125, 116–135 (2017) https://doi.org/10.1016/j.knosys.2017.03.020
Shi, Y., Eberhart, R. (n.d.). A modified particle swarm optimizer. In: 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No. 98TH8360). https://doi.org/10.1109/icec.1998.699146
Davis, L. (ed.): Handbook of Genetic Algorithms. Van Nostrand Reinhold, New York, NY (1991)
Al-Chalabi, H., Ray, S., Shaalan, K.: Semantic based query expansion for arabic question answering systems. In: 2015 First International Conference on Arabic Computational Linguistics (ACLing) (2015). https://doi.org/10.1109/acling.2015.25
Sneiders, E.: Automated FAQ answering with question-specific knowledge representation for web self-service. In: 2009 2nd Conference on Human System Interactions (2009). https://doi.org/10.1109/hsi.2009.5090996
Moghaddam, S., Ester, M.: AQA: Aspect-based Opinion Question Answering. In: 2011 IEEE 11th International Conference on Data Mining Workshops (2011). https://doi.org/10.1109/icdmw.2011.34
Mcauley, J., Yang, A.: Addressing complex and subjective product-related queries with customer reviews. In: Proceedings of the 25th International Conference on World Wide Web-WWW ‘16, (2016). https://doi.org/10.1145/2872427.2883044
Wan, M., Mcauley, J.: Modeling ambiguity, subjectivity, and diverging viewpoints in opinion question answering systems. In: 2016 IEEE 16th International Conference on Data Mining (ICDM) (2016). https://doi.org/10.1109/icdm.2016.0060
Lahitani, A.R., Permanasari, A.E., Setiawan, N.A.: Cosine similarity to determine similarity measure: study case in online essay assessment. In: 2016 4th International Conference on Cyber and IT Service Management (2016). https://doi.org/10.1109/citsm.2016.7577578
Hong-Minh, T., Smith, D.: Word similarity in WordNet. In: Modeling, Simulation and Optimization of Complex Processes, pp. 293–302 (2008). https://doi.org/10.1007/978-3-540-79409-7_19
Xiang, L., Yu, J., Yang, C., Zeng, D., Shen, X.: A Word-Embedding-Based Steganalysis Method For Linguistic Steganography Via Synonym Substitution. IEEE Access 6, 64131–64141 (2018). https://doi.org/10.1109/access.2018.2878273
Lv, Y., Zhai, C.: Lower-bounding term frequency normalization. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management—CIKM 11, (2011) https://doi.org/10.1145/2063576.2063584
Eberhart, Y.S.: Particle Swarm optimization: developments, applications and resources. Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546), https://doi.org/10.1109/cec.2001.934374
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dwivedi, G., Venugopalan, M., Gupta, D. (2020). A Statistical-Semantic PSO Model for Customer Reviews-Based Question Answering Systems. In: Reddy, V., Prasad, V., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2019. Advances in Intelligent Systems and Computing, vol 1118. Springer, Singapore. https://doi.org/10.1007/978-981-15-2475-2_13
Download citation
DOI: https://doi.org/10.1007/978-981-15-2475-2_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2474-5
Online ISBN: 978-981-15-2475-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)