WikiSQL

Dataset Information
Modalities
Texts
Introduced
2017
Homepage

Overview

WikiSQL consists of a corpus of 87,726 hand-annotated SQL query and natural language question pairs. These SQL queries are further split into training (61,297 examples), development (9,145 examples) and test sets (17,284 examples). It can be used for natural language inference tasks related to relational databases.

Source: SQL-to-Text Generation with Graph-to-Sequence Model
Image Source: https://blog.einstein.ai/how-to-talk-to-your-database/

Variants: WikiSQL

Associated Benchmarks

This dataset is used in 4 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Question Answering PieTa Piece of Table: A Divide-and-Conquer … 2024-12-10
Question Answering TabSQLify TabSQLify: Enhancing Reasoning Capabilities of … 2024-04-15
Semantic Parsing CABINET CABINET: Content Relevance based Noise … 2024-02-02
Semantic Parsing ReasTAP-Large (weak supervision) ReasTAP: Injecting Table Reasoning Skills … 2022-10-22
Semantic Parsing TAPEX-Large (weak supervision) TAPEX: Table Pre-training via Learning … 2021-07-16
Semantic Parsing TAPAS-Large (weak supervision) TAPAS: Weakly Supervised Table Parsing … 2020-04-05
Code Generation NL2SQL-RULE Content Enhanced BERT-based Text-to-SQL Generation 2019-10-16
Semantic Parsing NL2SQL-BERT Content Enhanced BERT-based Text-to-SQL Generation 2019-10-16
Code Generation Tranx TRANX: A Transition-based Neural Abstract … 2018-10-05
Code Generation TypeSQL+TC (Yu et al., 2018)+ TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL … 2018-04-25
Code Generation TypeSQL (Yu et al., 2018) TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL … 2018-04-25
Code Generation STAMP (Sun et al., 2018)+ Semantic Parsing with Syntax- and … 2018-04-23
Code Generation STAMP+RL (Sun et al., 2018)+ Semantic Parsing with Syntax- and … 2018-04-23
SQL-to-Text Graph2Seq-PGE Graph2Seq: Graph to Sequence Learning … 2018-04-03
Code Generation PT-MAML (Huang et al., 2018) Natural Language to Structured Query … 2018-03-02
Code Generation Bidirectional Attention for SQL Generation Bidirectional Attention for SQL Generation 2017-12-30
Code Generation Seq2Seq (Zhong et al., 2017) Seq2SQL: Generating Structured Queries from … 2017-08-31
Code Generation Seq2SQL (Zhong et al., 2017) Seq2SQL: Generating Structured Queries from … 2017-08-31
SQL-to-Text GGS-NN Gated Graph Sequence Neural Networks 2015-11-17

Research Papers

Recent papers with results on this dataset: