Semantic Question Matching in Data Constrained Environment

Maitra, Anutosh; Sengupta, Shubhashis; Mukhopadhyay, Abhisek; Gupta, Deepak; Pujari, Rajkumar; Bhattacharya, Pushpak; Ekbal, Asif; Jain, Tom Geo

doi:10.1007/978-3-030-00794-2_29

Anutosh Maitra¹⁹,
Shubhashis Sengupta¹⁹,
Abhisek Mukhopadhyay¹⁹,
Deepak Gupta²⁰,
Rajkumar Pujari²⁰,
Pushpak Bhattacharya²⁰,
Asif Ekbal²⁰ &
…
Tom Geo Jain¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11107))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1404 Accesses
1 Citations

Abstract

Machine comprehension of various forms of semantically similar questions with same or similar answers has been an ongoing challenge. Especially in many industrial domains with limited set of questions, it is hard to identify proper semantic match for a newly asked question having the same answer but presented in different lexical form. This paper proposes a linguistically motivated taxonomy for English questions and an effective approach for question matching by combining deep learning models for question representations with general taxonomy based features. Experiments performed on short datasets demonstrate the effectiveness of the proposed approach as better matching classification was observed by coupling the standard distributional features with knowledge-based methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Achananuparp, P., Hu, X., Sheng, X.: The evaluation of sentence similarity measures. In: Proceedings of 10th International Conference on Data Warehousing and Knowledge Discovery, pp. 305–316 (2008)
Google Scholar
Bunescu, R., Huang, Y.: Towards a general model of answer typing: question focus identification. In: CICLing (2010)
Google Scholar
Burke, R.D.: Question answering from frequently asked question files: in experiences with the FAQ finder system. AI Mag. 18(2), 57 (1997)
Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Feng, M., Xiang, B., Glass, M.R., Wang, L., Zhou, B.: Applying deep learning to answer selection: a study and an open task. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 813–820. IEEE (2015)
Google Scholar
Gunther, F.: LSAfun - an R package for computations based on latent semantic analysis. Behav. Res. Methods 47(4), 930–944 (2015)
Article Google Scholar
Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: Proceedings of the 14th ACM international conference on Information and Knowledge Management, pp. 84–90. ACM (2005)
Google Scholar
Lei, T., et al.: Semi-supervised question retrieval with gated convolutions. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1279–1289. Association for Computational Linguistics, San Diego (2016)
Google Scholar
Li, S., Manandhar, S.: Improving question recommendation by exploiting information need. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 1425–1434. Association for Computational Linguistics (2011)
Google Scholar
M‘arquez, L., Glass, J., Magdy, W., Moschitti, A., Nakov, P., Randeree, B.: Semeval-2015 task 3: answer selection in community question answering. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (2015)
Google Scholar
Mlynarczyk, S., Lytinen, S.: FAQFinder question answering improvements using question/answer matching. In: Proceedings of L&T-2005-Human Language Technologies as a Challenge for Computer Science and Linguistics (2005)
Google Scholar
Moldovan, D., et al.: Lasso: a tool for surfing the answer net. In: Proceedings 8th Text Retrieval Conference (TREC-8) (2000)
Google Scholar
Nakov, P., et al.: SemEval-2016 task 3: community question answering. In: Proceedings of the 10th International Workshop on Semantic Evaluation, vol. 16 (2016)
Google Scholar
Al-Harbi, O., Jusoh, S., Norwawi, N.M.: Lexical disambiguation in natural language questions. Int. J. Comput. Sci. Issues 8(4), 143–150 (2011)
Google Scholar
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: Squad: 100,000+ questions for machine comprehension of text. CoRR abs/1606.05250 (2016)
Google Scholar
Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 373–382. ACM (2015)
Google Scholar
Wang, D., Nyberg, E.: A long short-term memory model for answer sentence selection in question answering. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, Beijing, China, pp. 707–712 (2015)
Google Scholar
Wang, K., Ming, Z., Chua, T.-S.: A syntactic tree matching approach to finding similar questions in community-based QA services. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 187–194. ACM (2009)
Google Scholar
Li, X., Roth, D.: Learning question classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1 (2002)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. In: Nature, vol. 521 (2015)
Google Scholar
Zhong, Z., Ng, H.T.: It makes sense: a wide coverage word sense disambiguation system for free text. In: Proceedings of the ACL 2010 System Demonstrations, ACL Demos 10, pp. 78–83. ACM, Stroudsburg (2010)
Google Scholar
Zhou, G., Liu, Y., Liu, F., Zeng, D., Zhao, J.: Improving question retrieval in community question answering using world knowledge. In: IJCAI, vol. 13, pp. 2239–2245 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Accenture Labs, Bangalore, India
Anutosh Maitra, Shubhashis Sengupta, Abhisek Mukhopadhyay & Tom Geo Jain
Indian Institute of Technology, Patna, India
Deepak Gupta, Rajkumar Pujari, Pushpak Bhattacharya & Asif Ekbal

Authors

Anutosh Maitra
View author publications
You can also search for this author in PubMed Google Scholar
Shubhashis Sengupta
View author publications
You can also search for this author in PubMed Google Scholar
Abhisek Mukhopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Deepak Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Rajkumar Pujari
View author publications
You can also search for this author in PubMed Google Scholar
Pushpak Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Asif Ekbal
View author publications
You can also search for this author in PubMed Google Scholar
Tom Geo Jain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abhisek Mukhopadhyay .

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Aleš Horák
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maitra, A. et al. (2018). Semantic Question Matching in Data Constrained Environment. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2018. Lecture Notes in Computer Science(), vol 11107. Springer, Cham. https://doi.org/10.1007/978-3-030-00794-2_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-00794-2_29
Published: 08 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00793-5
Online ISBN: 978-3-030-00794-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics