Semantic Question Matching in Data Constrained Environment

  • Anutosh Maitra
  • Shubhashis Sengupta
  • Abhisek MukhopadhyayEmail author
  • Deepak Gupta
  • Rajkumar Pujari
  • Pushpak Bhattacharya
  • Asif Ekbal
  • Tom Geo Jain
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11107)


Machine comprehension of various forms of semantically similar questions with same or similar answers has been an ongoing challenge. Especially in many industrial domains with limited set of questions, it is hard to identify proper semantic match for a newly asked question having the same answer but presented in different lexical form. This paper proposes a linguistically motivated taxonomy for English questions and an effective approach for question matching by combining deep learning models for question representations with general taxonomy based features. Experiments performed on short datasets demonstrate the effectiveness of the proposed approach as better matching classification was observed by coupling the standard distributional features with knowledge-based methods.


Question answering Semantic matching Taxonomy 


  1. 1.
    Achananuparp, P., Hu, X., Sheng, X.: The evaluation of sentence similarity measures. In: Proceedings of 10th International Conference on Data Warehousing and Knowledge Discovery, pp. 305–316 (2008)Google Scholar
  2. 2.
    Bunescu, R., Huang, Y.: Towards a general model of answer typing: question focus identification. In: CICLing (2010)Google Scholar
  3. 3.
    Burke, R.D.: Question answering from frequently asked question files: in experiences with the FAQ finder system. AI Mag. 18(2), 57 (1997)Google Scholar
  4. 4.
    Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)Google Scholar
  5. 5.
    Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)zbMATHGoogle Scholar
  6. 6.
    Feng, M., Xiang, B., Glass, M.R., Wang, L., Zhou, B.: Applying deep learning to answer selection: a study and an open task. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 813–820. IEEE (2015)Google Scholar
  7. 7.
    Gunther, F.: LSAfun - an R package for computations based on latent semantic analysis. Behav. Res. Methods 47(4), 930–944 (2015)CrossRefGoogle Scholar
  8. 8.
    Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: Proceedings of the 14th ACM international conference on Information and Knowledge Management, pp. 84–90. ACM (2005)Google Scholar
  9. 9.
    Lei, T., et al.: Semi-supervised question retrieval with gated convolutions. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1279–1289. Association for Computational Linguistics, San Diego (2016)Google Scholar
  10. 10.
    Li, S., Manandhar, S.: Improving question recommendation by exploiting information need. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 1425–1434. Association for Computational Linguistics (2011)Google Scholar
  11. 11.
    M‘arquez, L., Glass, J., Magdy, W., Moschitti, A., Nakov, P., Randeree, B.: Semeval-2015 task 3: answer selection in community question answering. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (2015)Google Scholar
  12. 12.
    Mlynarczyk, S., Lytinen, S.: FAQFinder question answering improvements using question/answer matching. In: Proceedings of L&T-2005-Human Language Technologies as a Challenge for Computer Science and Linguistics (2005)Google Scholar
  13. 13.
    Moldovan, D., et al.: Lasso: a tool for surfing the answer net. In: Proceedings 8th Text Retrieval Conference (TREC-8) (2000)Google Scholar
  14. 14.
    Nakov, P., et al.: SemEval-2016 task 3: community question answering. In: Proceedings of the 10th International Workshop on Semantic Evaluation, vol. 16 (2016)Google Scholar
  15. 15.
    Al-Harbi, O., Jusoh, S., Norwawi, N.M.: Lexical disambiguation in natural language questions. Int. J. Comput. Sci. Issues 8(4), 143–150 (2011)Google Scholar
  16. 16.
    Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: Squad: 100,000+ questions for machine comprehension of text. CoRR abs/1606.05250 (2016)Google Scholar
  17. 17.
    Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 373–382. ACM (2015)Google Scholar
  18. 18.
    Wang, D., Nyberg, E.: A long short-term memory model for answer sentence selection in question answering. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, Beijing, China, pp. 707–712 (2015)Google Scholar
  19. 19.
    Wang, K., Ming, Z., Chua, T.-S.: A syntactic tree matching approach to finding similar questions in community-based QA services. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 187–194. ACM (2009)Google Scholar
  20. 20.
    Li, X., Roth, D.: Learning question classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1 (2002)Google Scholar
  21. 21.
    LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. In: Nature, vol. 521 (2015)Google Scholar
  22. 22.
    Zhong, Z., Ng, H.T.: It makes sense: a wide coverage word sense disambiguation system for free text. In: Proceedings of the ACL 2010 System Demonstrations, ACL Demos 10, pp. 78–83. ACM, Stroudsburg (2010)Google Scholar
  23. 23.
    Zhou, G., Liu, Y., Liu, F., Zeng, D., Zhao, J.: Improving question retrieval in community question answering using world knowledge. In: IJCAI, vol. 13, pp. 2239–2245 (2013)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Anutosh Maitra
    • 1
  • Shubhashis Sengupta
    • 1
  • Abhisek Mukhopadhyay
    • 1
    Email author
  • Deepak Gupta
    • 2
  • Rajkumar Pujari
    • 2
  • Pushpak Bhattacharya
    • 2
  • Asif Ekbal
    • 2
  • Tom Geo Jain
    • 1
  1. 1.Accenture LabsBangaloreIndia
  2. 2.Indian Institute of TechnologyPatnaIndia

Personalised recommendations