A Shallow Convolutional Neural Network Architecture for Open Domain Question Answering

Rosso-Mateus, Andrés; González, Fabio A.; Montes-y-Gómez, Manuel

doi:10.1007/978-3-319-66562-7_35

Andrés Rosso-Mateus¹¹,
Fabio A. González¹¹ &
Manuel Montes-y-Gómez¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 735))

Included in the following conference series:

Colombian Conference on Computing

1711 Accesses

Abstract

This paper addresses the problem of answering a question by choosing the best answer from a set of candidate text fragments. This task requires to identify and measure the semantical relationship between the question and the candidate answers. Unlike previous solutions to this problem based on deep neural networks with million of parameters, we present a novel convolutional neural network approach that despite having a simple architecture is able to capture the semantical relationships between terms in a generated similarity matrix. The method was systematically evaluated over two different standard data sets. The results show that our approach is competitive with state-of-the-art methods despite having a simpler and efficient architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
GitHub passage retrieval code https://github.com/andresrosso/passage_retrieval.

References

Association for Computational Linguistics — ACL: Question Answering (State of the art) (2007)
Google Scholar
Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the COLING/ACL on Interactive Presentation Sessions, pp. 69–72. Association for Computational Linguistics (2006)
Google Scholar
Dong, L., Wei, F., Zhou, M., Xu, K.: Question answering over freebase with multi-column convolutional neural networks. In: ACL, vol. 1, pp. 260–269 (2015)
Google Scholar
Etzioni, O.: Search needs a shake-up. Nature 476(7358), 25–26 (2011)
Article Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
MATH Google Scholar
He, H., Gimpel, K., Lin, J.J.: Multi-perspective sentence similarity modeling with convolutional neural networks. In: EMNLP, pp. 1576–1586 (2015)
Google Scholar
Hua, H., Lin, J.: Pairwise word interaction modeling with deep neural networks for semantic similarity measurement. In: Proceedings of NAACL-HLT, pp. 937–948 (2016)
Google Scholar
Heilman, M., Smith, N.A.: Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In: Human Language Technologies, pp. 1011–1019. Association for Computational Linguistics (ACL) (2010)
Google Scholar
Hirschman, L., Gaizauskas, R.: Natural language question answering: the view from here. Nat. Lang. Eng. 7(04), 275–300 (2001)
Article Google Scholar
Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: ICML, vol. 14, pp. 1188–1196 (2014)
Google Scholar
Liu, F., Pennell, D., Liu, F., Liu, Y.: Unsupervised approaches for automatic keyword extraction using meeting transcripts. In: Proceedings of Human Language Technologies, pp. 620–628. Association for Computational Linguistics (ACL) (2009)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: word2vec (2014)
Google Scholar
Rao, J., He, H., Lin, J.: Noise-contrastive estimation for answer selection with deep neural networks. In: Proceedings of the 25th ACM, pp. 1913–1916. ACM (2016)
Google Scholar
Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings ACM SIGIR Conference, pp. 373–382. ACM (2015)
Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Talathi, S.S., Vartak, A.: Improving performance of recurrent neural network with relu nonlinearity. CoRR, abs/1511.03771 (2015)
Google Scholar
Wang, C., Kalyanpur, A., Boguraev, B.K.: Relation extraction and scoring in DeepQA. IBM J. Res. Dev. 56(3), 9:1–9:12 (2012)
Article Google Scholar
Wang, M., Manning, C.D.: Probabilistic tree-edit models with structured latent variables for textual entailment and question answering. In: ACL Proceedings, pp. 1164–1172. Association for Computational Linguistics (2010)
Google Scholar
Wang, M., Smith, N.A., Mitamura, T.: What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA, pp. 22–32, June 2007
Google Scholar
Yang, L., Ai, Q., Guo, J., Croft, W.B.: aNNM: ranking short answer texts with attention-based neural matching model. In: Proceedings ACM, pp. 287–296. ACM (2016)
Google Scholar
Yang, Y., Yih, W-T., Meek, C.: WikiQA: a challenge dataset for open-domain question answering. In: EMNLP, pp. 2013–2018. Citeseer (2015)
Google Scholar
Yao, X., Van Durme, B., Callison-Burch, C., Clark, P.: Answer extraction as sequence tagging with tree edit distance. In: HLT-NAACL, pp. 858–867. Citeseer (2013)
Google Scholar
Lei, Y., Hermann, K.M., Blunsom, P., Pulman, S.: Deep learning for answer sentence selection. In: NIPS Deep Learning Workshop (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

MindLab Research Group, Universidad Nacional de Colombia, Bogotá, Colombia
Andrés Rosso-Mateus & Fabio A. González
Computer Science Department, Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla, Mexico
Manuel Montes-y-Gómez

Authors

Andrés Rosso-Mateus
View author publications
You can also search for this author in PubMed Google Scholar
Fabio A. González
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Montes-y-Gómez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrés Rosso-Mateus .

Editor information

Editors and Affiliations

Universidad Autónoma de Occidente, Cali, Colombia
Andrés Solano
Universidad de San Buenaventura, Cali, Colombia
Hugo Ordoñez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rosso-Mateus, A., González, F.A., Montes-y-Gómez, M. (2017). A Shallow Convolutional Neural Network Architecture for Open Domain Question Answering. In: Solano, A., Ordoñez, H. (eds) Advances in Computing. CCC 2017. Communications in Computer and Information Science, vol 735. Springer, Cham. https://doi.org/10.1007/978-3-319-66562-7_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-66562-7_35
Published: 17 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66561-0
Online ISBN: 978-3-319-66562-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics