Semantic Textual Similarity and Factorization Machine Model for Retrieval of Question-Answering

  • Nivid LimbasiyaEmail author
  • Prateek Agrawal
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 1046)


Question and Answering (QA) in many collaborative social networks such as Yahoo!-answers, Stack Overflow have attracted copious users to post and transfer knowledge between users. This paper proposes an Adaptive global T-max Long Short-Term Memory-Convolutional Neural Network (ALSTM-CNN) method to retrieve semantically matching questions from historical questions and forecast the best answers by saving their effort and time. Moreover, a novel Field-aware Factorization Machine (FFM) classifier is adapted to rank the high-quality answers from large sparse data. This method has certain advantages include: (a) effectively learns the similarity based on simple pertained models with various multiple dimensions, (b) does not uses handcrafted features. This algorithm shows robust performance for various tasks (i.e., measuring textual similarity and paraphrase identification), when it employs on datasets such as Semantic Textual Similarity (STS) benchmark, Sentence Involving Compositional Knowledge (SICK), Microsoft Research Paraphrase Corpus (MRPC) and Wikipedia Question Answer dataset. The performance of our proposed method is compared with different classifiers and the result shows a better accuracy measure than other state-of-the-art methods.


Convolutional neural network Factorization machine Long short-term memory networks Machine learning Natural language processing Question-answering Semantic textual similarity 


  1. 1.
    Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 475–482 (2008)Google Scholar
  2. 2.
    Chen, M., Li, L., Xie, Q.: Translation language model enhancement for community question retrieval using user adoption answer. In: Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, pp. 251–265 (2017)CrossRefGoogle Scholar
  3. 3.
    Wang, P., Zhang, Y., Ji, L., Yan, J., Jin, L.: Concept embedded convolutional semantic model for question retrieval. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 395–403 (2017)Google Scholar
  4. 4.
    Zhou, G., Zhou, Y., He, T., Wu, W.: Learning semantic representation with neural networks for community question answering retrieval. Knowl.-Based Syst. 93, 75–83 (2016)CrossRefGoogle Scholar
  5. 5.
    Zhou, G., He, T., Zhao, J., Hu, P.: Learning continuous word embedding with metadata for question retrieval in community question answering. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 250–259 (2015)Google Scholar
  6. 6.
    Chen, L., Jose, J.M., Yu, H., Yuan, F.: A hybrid approach for question retrieval in community question answering. Comput. J. 60(7), 1019–1031 (2016)Google Scholar
  7. 7.
    Dai, F., Feng, C., Wang, Z., Pei, Y., Huang, H.: Intent identification for knowledge base question answering. In: 2017 Conference on Technologies and Applications of Artificial Intelligence (TAAI), pp. 96–99. IEEE (2017)Google Scholar
  8. 8.
    Zhang, W.N., Ming, Z.Y., Zhang, Y., Liu, T., Chua, T.S.: Capturing the semantics of key phrases using multiple languages for question retrieval. IEEE Trans. Knowl. Data Eng. 28(4), 888–900 (2016)CrossRefGoogle Scholar
  9. 9.
    Zhou, G., Xie, Z., He, T., Zhao, J., Hu, X.T.: Learning the multilingual translation representations for question retrieval in community question answering via non-negative matrix factorization. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 24(7), 1305–1314 (2016)CrossRefGoogle Scholar
  10. 10.
    Wu, Y., Hori, C., Kashioka, H., Kawai, H.: Leveraging social Q&A collections for improving complex question answering. Comput. Speech Lang. 29(1), 1–19 (2015)CrossRefGoogle Scholar
  11. 11.
    Wu, H., Huang, H., Jian, P., Guo, Y., Su, C.: BIT at SemEval-2017 task 1: using semantic information space to evaluate semantic textual similarity. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval 2017), pp. 77–84 (2017)Google Scholar
  12. 12.
    Ji, Y., Eisenstein, J.: Discriminative improvements to distributional sentence similarity. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 891–896 (2013)Google Scholar
  13. 13.
    Shao, Y.: HCTI at SemEval-2017 task 1: use convolutional neural network to evaluate semantic textual similarity. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval 2017), pp. 130–133 (2017)Google Scholar
  14. 14.
    Elalfy, D., Gad, W., Ismail, R.: A hybrid model to predict best answers in question answering communities. Egypt. Inform. J. 19(1), 21–31 (2018)CrossRefGoogle Scholar
  15. 15.
    Fang, H., Wu, F., Zhao, Z., Duan, X., Zhuang, Y., Ester, M.: Community-based question answering via heterogeneous social network learning. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)Google Scholar
  16. 16.
    Tian, J., Zhou, Z., Lan, M., Wu, Y.: ECNU at SemEval-2017 task 1: leverage kernel-based traditional NLP features and neural networks to build a universal model for multilingual and cross-lingual semantic textual similarity. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval 2017), pp. 191–197 (2017)Google Scholar
  17. 17.
    He, H., Gimpel, K., Lin, J.: Multi-perspective sentence similarity modeling with convolutional neural networks. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1576–1586 (2015)Google Scholar
  18. 18.
    Hu, Z., Zhang, Z., Yang, H., Chen, Q., Zuo, D.: A deep learning approach for predicting the quality of online health expert question-answering services. J. Biomed. Inform. 71, 241–253 (2017)CrossRefGoogle Scholar
  19. 19.
    Agirre, E., Cer, D., Diab, M., Gonzalez-Agirre, A., Guo, W.: *SEM 2013 shared task: semantic textual similarity. In: Second Joint Conference on Lexical and Computational Semantics (*SEM). Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, Proceedings of *SEM, vol. 1, pp. 32–43 (2013)Google Scholar
  20. 20.
    Marelli, M., Menini, S., Baroni, M., Bentivogli, L., Bernardi, R., Zamparelli, R.: A SICK cure for the evaluation of compositional distributional semantic models. In: LREC, pp. 216–223 (2014)Google Scholar
  21. 21.
    Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detection. In: Proceedings of the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics, pp. 45–52 (2008)Google Scholar
  22. 22.
    Smith, N.A., Heilman, M., Hwa, R.: Question generation as a competitive undergraduate course project. In: Proceedings of the NSF Workshop on the Question Generation Shared Task and Evaluation Challenge (2008)Google Scholar
  23. 23.
    Kiros, R., et al.: Skip-thought vectors. In: Advances in Neural Information Processing Systems, pp. 3294–3302 (2015)Google Scholar
  24. 24.
    Liu, M., Fang, Y., Choulos, A.G., Park, D.H., Hu, X.: Product review summarization through question retrieval and diversification. Inf. Retrieval J. 20(6), 575–605 (2017)CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringLovely Professional UniversityJalandharIndia

Personalised recommendations