Translation of Natural Language Queries to SQL that Involve Aggregate Functions, Grouping and Subqueries for a Natural Language Interface to Databases
Currently, huge amounts of information are stored in databases (DBs). In order to facilitate access to information to all users, natural language interfaces to databases (NLIDBs) have been developed. To this end, these interfaces translate natural language queries to a DB query language. For businesses, the main application of NLIDBs is for decision making by facilitating access to information in a flexible manner. For a NLIDB to be considered complete, it must deal with queries that involve aggregate functions: COUNT, MIN, MAX, SUM and AVG. The prototype developed at the Instituto Tecnológico de Cd. Madero (ITCM) can translate queries in natural language to SQL; however, it did not have a module for dealing with aggregate functions, grouping and subqueries. In this paper a new module of this NLIDB for dealing with aggregate functions, grouping and subqueries is described, and experimental results are presented, which show that this interface has a performance (recall) better than that of C-Phrase.
KeywordsNatural language Natural language interfaces to databases aggregate functions Grouping Subqueries
- 1.R. Pazos, M. Aguirre, J. Gonzalez, J. Martínez, J. Pérez, A. Verástegui, Comparative study on the customization of Natural Language Interfaces to Databases. SpringerPlus 5, 553 (2016)Google Scholar
- 3.L. Tang, R. Mooney, Using multiple clause constructors in inductive logic programming for semantic parsing, in Proceedings of the 12th European Conference on Machine Learning, pp. 466–477 (2001)Google Scholar
- 4.Linguistic Data Consortium, “ATIS2”. https://catalog.ldc.upenn.edu/ LDC93S5. Accessed 2017