arXiv:1709.00103 Abstract | arXiv Analytics

arXiv:1709.00103 [cs.CL]Abstract References Reviews Resources

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning

Victor Zhong, Caiming Xiong, Richard Socher

Published 2017-08-31Version 1

A significant amount of the world's knowledge is stored in relational databases. However, the ability for users to retrieve facts from a database is limited due to a lack of understanding of query languages such as SQL. We propose Seq2SQL, a deep neural network for translating natural language questions to corresponding SQL queries. Our model leverages the structure of SQL queries to significantly reduce the output space of generated queries. Moreover, we use rewards from in-the-loop query execution over the database to learn a policy to generate unordered parts of the query, which we show are less suitable for optimization via cross entropy loss. In addition, we will publish WikiSQL, a dataset of 87726 hand-annotated examples of questions and SQL queries distributed across 26375 tables from Wikipedia. This dataset is required to train our model and is an order of magnitude larger than comparable datasets. By applying policy-based reinforcement learning with a query execution environment to WikiSQL, our model Seq2SQL outperforms attentional sequence to sequence models, improving execution accuracy from 35.9% to 60.3% and logical form accuracy from 23.4% to 49.2%.

Comments: 10 pages, 5 figures

Categories: cs.CL, cs.AI

Keywords: natural language, generating structured queries, reinforcement learning, sql queries, model seq2sql outperforms attentional sequence

Related articles: Most relevant | Search more

arXiv:1711.04436 [cs.CL] (Published 2017-11-13)

SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning

Xiaojun Xu, Chang Liu, Dawn Song

arXiv:1807.01670 [cs.CL] (Published 2018-07-04)

Encoding Spatial Relations from Natural Language

Tiago Ramalho et al.

arXiv:1706.08502 [cs.CL] (Published 2017-06-26)

Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog

Satwik Kottur, José M. F. Moura, Stefan Lee, Dhruv Batra

arXiv Analytics

arXiv:1709.00103 [cs.CL]Abstract References Reviews Resources

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning

Links

Toolbox

arXiv:1709.00103 [cs.CL]AbstractReferencesReviewsResources

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning

Links

Toolbox

arXiv:1709.00103 [cs.CL]Abstract References Reviews Resources