Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 424 papers

Title	Date	Tasks	Status	Hype
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models	Apr 16, 2024	Data InteractionText to SQL	CodeCode Available	11
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?	Aug 9, 2024	Natural Language QueriesText to SQL	CodeCode Available	5
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL	Jul 7, 2025	Text to SQLText-To-SQL	CodeCode Available	4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL	Nov 13, 2024	DiversityIn-Context Learning	CodeCode Available	4
Text2SQL is Not Enough: Unifying AI and Databases with TAG	Aug 27, 2024	RAGRetrieval-augmented Generation	CodeCode Available	4
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL	May 22, 2025	Natural Language UnderstandingReinforcement Learning (RL)	CodeCode Available	3
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward	May 18, 2025	GPUGraph Matching	CodeCode Available	3
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback	Mar 25, 2025	Text to SQLText-To-SQL	CodeCode Available	3
OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale	Mar 4, 2025	Text to SQLText-To-SQL	CodeCode Available	3
Cognify: Supercharging Gen-AI Workflows With Hierarchical Autotuning	Feb 12, 2025	RAGText to SQL	CodeCode Available	3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications	Nov 7, 2024	Code GenerationLanguage Modeling	CodeCode Available	3
CHESS: Contextual Harnessing for Efficient SQL Synthesis	May 27, 2024	Large Language ModelPrivacy Preserving	CodeCode Available	3
Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning	Jun 2, 2025	Fact VerificationLanguage Modeling	CodeCode Available	2
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning	May 19, 2025	Text to SQLText-To-SQL	CodeCode Available	2
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL	Mar 24, 2025	RetrievalText to SQL	CodeCode Available	2
Datrics Text2SQL. A Framework for Natural Language to SQL Query Generation	Mar 15, 2025	Natural Language QueriesRAG	CodeCode Available	2
Automatic database description generation for Text-to-SQL	Feb 28, 2025	Text to SQLText-To-SQL	CodeCode Available	2
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL	Feb 17, 2025	Few-Shot LearningHeuristic Search	CodeCode Available	2
RSL-SQL: Robust Schema Linking in Text-to-SQL Generation	Oct 31, 2024	Text to SQLText-To-SQL	CodeCode Available	2
E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL	Sep 25, 2024	Natural Language QueriesText to SQL	CodeCode Available	2
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation	May 24, 2024	In-Context LearningText to SQL	CodeCode Available	2
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records	May 4, 2024	Information RetrievalQuestion Answering	CodeCode Available	2
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency	Mar 13, 2024	In-Context LearningText to SQL	CodeCode Available	2
CodeS: Towards Building Open-source Language Models for Text-to-SQL	Feb 26, 2024	Data AugmentationDiagnostic	CodeCode Available	2
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator	Feb 16, 2024	Mathematical ReasoningRe-Ranking	CodeCode Available	2
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL	Dec 18, 2023	SQL ParsingText to SQL	CodeCode Available	2
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation	Aug 29, 2023	Prompt EngineeringText to SQL	CodeCode Available	2
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction	Apr 21, 2023	In-Context LearningText to SQL	CodeCode Available	2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL	Feb 12, 2023	DecoderLanguage Modeling	CodeCode Available	2
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising	May 17, 2021	DecoderDenoising	CodeCode Available	2
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language	Nov 7, 2020	Text GenerationText to SQL	CodeCode Available	2
TableQA: a Large-Scale Chinese Text-to-SQL Dataset for Table-Aware SQL Generation	Jun 10, 2020	Text to SQLText-To-SQL	CodeCode Available	2
A Pilot Study for Chinese SQL Semantic Parsing	Sep 29, 2019	Cross-Lingual Word EmbeddingsQuestion Answering	CodeCode Available	2
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task	Sep 24, 2018	Semantic ParsingText to SQL	CodeCode Available	2
SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning	Nov 13, 2017	Decoderreinforcement-learning	CodeCode Available	2
Schema-R1: A reasoning training approach for schema linking in Text-to-SQL Task	Jun 13, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	1
SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence Generation	Jun 9, 2025	Natural Language QueriesText to SQL	CodeCode Available	1
ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects	May 22, 2025	Text to SQLText-To-SQL	CodeCode Available	1
Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards	May 7, 2025	Text to SQLText-To-SQL	CodeCode Available	1
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression	Apr 10, 2025	MathMMLU	CodeCode Available	1
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines	Apr 7, 2025	AI AgentText to SQL	CodeCode Available	1
Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL	Feb 17, 2025	Code GenerationMath	CodeCode Available	1
BASE-SQL: A powerful open source Text-To-SQL baseline approach	Feb 15, 2025	In-Context LearningLarge Language Model	CodeCode Available	1
A Study of In-Context-Learning-Based Text-to-SQL Errors	Jan 16, 2025	In-Context LearningText to SQL	CodeCode Available	1
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL	Dec 13, 2024	In-Context LearningText to SQL	CodeCode Available	1
Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models	Dec 10, 2024	Data VisualizationDomain Generalization	CodeCode Available	1
MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation	Oct 16, 2024	Text to SQLText-To-SQL	CodeCode Available	1
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark	Sep 24, 2024	Text to SQLText-To-SQL	CodeCode Available	1
MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL	Aug 15, 2024	In-Context LearningText to SQL	CodeCode Available	1
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries	Jun 27, 2024	Text to SQLText-To-SQL	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 9Next →

All datasets BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)spider Spider 2.0 SParC KaggleDBQA SEDE SQL-Eval Text-To-SQL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Human Performance	Execution Accurarcy (Human)	92.96	—	Unverified
2	XiYan-SQL	Execution Accuracy % (Test)	75.63	—	Unverified
3	DSAIR + GPT-4o	Execution Accuracy % (Test)	74.12	—	Unverified
4	CHASE-SQL + Gemini	Execution Accuracy % (Test)	74.06	—	Unverified
5	ExSL + granite-34b-code	Execution Accuracy % (Test)	73.17	—	Unverified
6	OpenSearch-SQL+ v2 + GPT-4o	Execution Accuracy % (Test)	72.28	—	Unverified
7	Distillery + GPT-4o	Execution Accuracy % (Test)	71.83	—	Unverified
8	Insights AI	Execution Accuracy % (Test)	70.26	—	Unverified
9	PURPLE + RED + GPT-4o	Execution Accuracy % (Test)	70.21	—	Unverified
10	MCTS-SQL	Execution Accuracy % (Test)	69.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy (Test)	89.65	—	Unverified
2	PET-SQL	Execution Accuracy (Test)	87.6	—	Unverified
3	datagpt-sql-7B + InvalidSQL-Feedback	Execution Accuracy (Dev)	87.2	—	Unverified
4	DAIL-SQL + GPT-4 + Self-Consistency	Execution Accuracy (Test)	86.6	—	Unverified
5	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	—	Unverified
6	datagpt-sql-7B	Execution Accuracy (Dev)	84.8	—	Unverified
7	MSc-SQL	Execution Accuracy (Test)	84.7	—	Unverified
8	MARLO + Claude 2.1	Execution Accuracy (Test)	84	—	Unverified
9	C3 + ChatGPT + Zero-Shot	Execution Accuracy (Test)	82.3	—	Unverified
10	code-davinci-002 175B (LEVER)	Execution Accuracy (Dev)	81.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spider-Agent + o1-preview	Success Rate	17.03	—	Unverified
2	Spider-Agent + GPT-4o	Success Rate	10.13	—	Unverified
3	Spider-Agent + Claude-3.5-Sonnect	Success Rate	9.02	—	Unverified
4	Spider-Agent + GPT-4	Success Rate	8.86	—	Unverified
5	Spider-Agent + Qwen2.5-72B	Success Rate	6.17	—	Unverified
6	Spider-Agent + DeepSeek-V2.5	Success Rate	5.22	—	Unverified
7	Spider-Agent + Gemini-Pro-1.5	Success Rate	2.53	—	Unverified
8	Spider-Agent + Llama-3.1-405B	Success Rate	2.21	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RASAT+PICARD	interaction match accuracy	45.2	—	Unverified
2	RAT-SQL-TC + GAP	interaction match accuracy	43.2	—	Unverified
3	HIE-SQL + GraPPa	interaction match accuracy	42.9	—	Unverified
4	RAT-SQL + SCoRe	interaction match accuracy	38.1	—	Unverified
5	EditSQL + BERT	interaction match accuracy	25.3	—	Unverified
6	GAZP + BERT	interaction match accuracy	23.5	—	Unverified
7	SyntaxSQL-con	interaction match accuracy	5.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RAT-SQL	Exact Match (EM)	26.77	—	Unverified
2	Edit-SQL	Exact Match (EM)	11.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5-Large	PCM-F1 (dev)	48.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy	69.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Orange-mini	0-shot MRR	74.17	—	Unverified