Advertisement

Translation of Natural Language Queries to SQL that Involve Aggregate Functions, Grouping and Subqueries for a Natural Language Interface to Databases

  • Rodolfo A. Pazos R.Email author
  • Andres A. Verastegui
  • José A. Martínez F.
  • Martin Carpio
  • Juana Gaspar H.
Chapter
Part of the Studies in Computational Intelligence book series (SCI, volume 749)

Abstract

Currently, huge amounts of information are stored in databases (DBs). In order to facilitate access to information to all users, natural language interfaces to databases (NLIDBs) have been developed. To this end, these interfaces translate natural language queries to a DB query language. For businesses, the main application of NLIDBs is for decision making by facilitating access to information in a flexible manner. For a NLIDB to be considered complete, it must deal with queries that involve aggregate functions: COUNT, MIN, MAX, SUM and AVG. The prototype developed at the Instituto Tecnológico de Cd. Madero (ITCM) can translate queries in natural language to SQL; however, it did not have a module for dealing with aggregate functions, grouping and subqueries. In this paper a new module of this NLIDB for dealing with aggregate functions, grouping and subqueries is described, and experimental results are presented, which show that this interface has a performance (recall) better than that of C-Phrase.

Keywords

Natural language Natural language interfaces to databases aggregate functions Grouping Subqueries 

References

  1. 1.
    R. Pazos, M. Aguirre, J. Gonzalez, J. Martínez, J. Pérez, A. Verástegui, Comparative study on the customization of Natural Language Interfaces to Databases. SpringerPlus 5, 553 (2016)Google Scholar
  2. 2.
    M. Minock, C-phrase: a system for building robust natural language interfaces to databases. Data Knowl. Eng. 69, 290–302 (2010)CrossRefGoogle Scholar
  3. 3.
    L. Tang, R. Mooney, Using multiple clause constructors in inductive logic programming for semantic parsing, in Proceedings of the 12th European Conference on Machine Learning, pp. 466–477 (2001)Google Scholar
  4. 4.
    Linguistic Data Consortium, “ATIS2”. https://catalog.ldc.upenn.edu/ LDC93S5. Accessed 2017

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Rodolfo A. Pazos R.
    • 1
    Email author
  • Andres A. Verastegui
    • 1
  • José A. Martínez F.
    • 1
  • Martin Carpio
    • 2
  • Juana Gaspar H.
    • 1
  1. 1.Instituto Tecnológico de Ciudad MaderoCiudad MaderoMexico
  2. 2.Instituto Tecnológico de LeónLeónMexico

Personalised recommendations