Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 424 papers

Title	Date	Tasks	Status
STaR-SQL: Self-Taught Reasoner for Text-to-SQL	Feb 19, 2025	Text to SQLText-To-SQL	—Unverified
StatBot.Swiss: Bilingual Open Data Exploration in Natural Language	Jun 5, 2024	In-Context LearningText to SQL	—Unverified
Structured Case-based Reasoning for Inference-time Adaptation of Text-to-SQL parsers	Jan 10, 2023	DecoderSemantic Parsing	—Unverified
Structure-Grounded Pretraining for Text-to-SQL	Oct 24, 2020	Text to SQLText-To-SQL	—Unverified
Structure Guided Large Language Model for SQL Generation	Feb 19, 2024	Language ModelingLanguage Modelling	—Unverified
Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance	May 25, 2025	Natural Language QueriesRetrieval	—Unverified
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications	Jun 23, 2025	Text to SQLText-To-SQL	—Unverified
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo	Apr 17, 2025	Code GenerationProbabilistic Programming	—Unverified
SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task	Oct 1, 2018	DecoderSemantic Parsing	—Unverified
T5QL: Taming language models for SQL generation	Sep 21, 2022	Code GenerationRe-Ranking	—Unverified
TARGET: Benchmarking Table Retrieval for Generative Tasks	May 14, 2025	BenchmarkingRepresentation Learning	—Unverified
Text-to-SQL based on Large Language Models and Database Keyword Search	Jan 23, 2025	Text to SQLText-To-SQL	—Unverified
Text-to-SQL Calibration: No Need to Ask -- Just Rescale Model Probabilities	Nov 23, 2024	Natural Language QueriesText to SQL	—Unverified
The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models	Aug 14, 2024	Natural Language QueriesText to SQL	—Unverified
The Role of Accuracy and Validation Effectiveness in Conversational Business Analytics	Nov 18, 2024	Text to SQLText-To-SQL	—Unverified
TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research	Mar 17, 2025	Intent RecognitionText to SQL	—Unverified
Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios	Aug 30, 2024	ManagementText to SQL	—Unverified
Towards Compositionally Generalizable Semantic Parsing in Large Language Models: A Survey	Apr 15, 2024	Information RetrievalRetrieval	—Unverified
Towards Optimizing SQL Generation via LLM Routing	Nov 6, 2024	Text to SQLText-To-SQL	—Unverified
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation	Nov 16, 2021	Text to SQLText-To-SQL	—Unverified
Towards Understanding the Generalization of Medical Text-to-SQL Models and Datasets	Mar 22, 2023	Data AugmentationText to SQL	—Unverified
Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface	Jun 8, 2021	Text GenerationText to SQL	—Unverified
UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL	Nov 16, 2021	Language ModelingLanguage Modelling	—Unverified
UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification	May 23, 2025	RetrievalText to SQL	—Unverified
Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems	Jun 20, 2024	Language ModellingText to SQL	—Unverified
Using LLM to select the right SQL Query from candidates	Jan 4, 2024	Code GenerationText to SQL	—Unverified
V-SQL: A View-based Two-stage Text-to-SQL Framework	Dec 17, 2024	Text to SQLText-To-SQL	—Unverified
Weakly Supervised Text-to-SQL Parsing through Question Decomposition	Jan 16, 2022	SQL ParsingText to SQL	—Unverified
``What Do You Mean by That?'' A Parser-Independent Interactive Approach for Enhancing Text-to-SQL	Nov 1, 2020	Text to SQLText-To-SQL	—Unverified
You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL	Sep 18, 2024	Text to SQLText-To-SQL	—Unverified
Valid Text-to-SQL Generation with Unification-based DeepStochLog	Mar 17, 2025	Language ModelingLanguage Modelling	CodeCode Available
AraSpider: Democratizing Arabic-to-SQL	Feb 12, 2024	Text to SQLText-To-SQL	CodeCode Available
BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases	May 23, 2025	Causal Inferencescientific discovery	CodeCode Available
Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing	Feb 25, 2025	Semantic ParsingText to SQL	CodeCode Available
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	CodeCode Available
Bertrand-DR: Improving Text-to-SQL using a Discriminative Re-ranker	Feb 3, 2020	Text to SQLText-To-SQL	CodeCode Available
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL	Feb 16, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL	Sep 21, 2024	MathText to SQL	CodeCode Available
Towards Generalizable and Robust Text-to-SQL Parsing	Oct 23, 2022	SQL ParsingText to SQL	CodeCode Available
NL-EDIT: Correcting semantic parse errors through natural language interaction	Mar 26, 2021	Semantic ParsingText to SQL	CodeCode Available
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations	Oct 18, 2023	In-Context LearningSemantic Parsing	CodeCode Available
LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges	May 24, 2025	BenchmarkingMathematical Reasoning	CodeCode Available
Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge	Jan 3, 2023	Semantic ParsingText to SQL	CodeCode Available
DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL	Sep 24, 2024	Language ModelingLanguage Modelling	CodeCode Available
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural Language	May 2, 2025	ManagementNatural Language Queries	CodeCode Available
OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Feb 11, 2025	Knowledge DistillationMMLU	CodeCode Available
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation	Oct 27, 2023	DecoderGraph Generation	CodeCode Available
Benchmarking and Improving Text-to-SQL Generation under Ambiguity	Oct 20, 2023	BenchmarkingDiversity	CodeCode Available
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?	Dec 16, 2023	Question AnsweringText to SQL	CodeCode Available
Leveraging Prior Experience: An Expandable Auxiliary Knowledge Base for Text-to-SQL	Nov 20, 2024	Continual LearningIn-Context Learning	CodeCode Available

Show:10 25 50

← PrevPage 7 of 9Next →

All datasets BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)spider Spider 2.0 SParC KaggleDBQA SEDE SQL-Eval Text-To-SQL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Human Performance	Execution Accurarcy (Human)	92.96	—	Unverified
2	XiYan-SQL	Execution Accuracy % (Test)	75.63	—	Unverified
3	DSAIR + GPT-4o	Execution Accuracy % (Test)	74.12	—	Unverified
4	CHASE-SQL + Gemini	Execution Accuracy % (Test)	74.06	—	Unverified
5	ExSL + granite-34b-code	Execution Accuracy % (Test)	73.17	—	Unverified
6	OpenSearch-SQL+ v2 + GPT-4o	Execution Accuracy % (Test)	72.28	—	Unverified
7	Distillery + GPT-4o	Execution Accuracy % (Test)	71.83	—	Unverified
8	Insights AI	Execution Accuracy % (Test)	70.26	—	Unverified
9	PURPLE + RED + GPT-4o	Execution Accuracy % (Test)	70.21	—	Unverified
10	MCTS-SQL	Execution Accuracy % (Test)	69.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy (Test)	89.65	—	Unverified
2	PET-SQL	Execution Accuracy (Test)	87.6	—	Unverified
3	datagpt-sql-7B + InvalidSQL-Feedback	Execution Accuracy (Dev)	87.2	—	Unverified
4	DAIL-SQL + GPT-4 + Self-Consistency	Execution Accuracy (Test)	86.6	—	Unverified
5	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	—	Unverified
6	datagpt-sql-7B	Execution Accuracy (Dev)	84.8	—	Unverified
7	MSc-SQL	Execution Accuracy (Test)	84.7	—	Unverified
8	MARLO + Claude 2.1	Execution Accuracy (Test)	84	—	Unverified
9	C3 + ChatGPT + Zero-Shot	Execution Accuracy (Test)	82.3	—	Unverified
10	code-davinci-002 175B (LEVER)	Execution Accuracy (Dev)	81.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spider-Agent + o1-preview	Success Rate	17.03	—	Unverified
2	Spider-Agent + GPT-4o	Success Rate	10.13	—	Unverified
3	Spider-Agent + Claude-3.5-Sonnect	Success Rate	9.02	—	Unverified
4	Spider-Agent + GPT-4	Success Rate	8.86	—	Unverified
5	Spider-Agent + Qwen2.5-72B	Success Rate	6.17	—	Unverified
6	Spider-Agent + DeepSeek-V2.5	Success Rate	5.22	—	Unverified
7	Spider-Agent + Gemini-Pro-1.5	Success Rate	2.53	—	Unverified
8	Spider-Agent + Llama-3.1-405B	Success Rate	2.21	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RASAT+PICARD	interaction match accuracy	45.2	—	Unverified
2	RAT-SQL-TC + GAP	interaction match accuracy	43.2	—	Unverified
3	HIE-SQL + GraPPa	interaction match accuracy	42.9	—	Unverified
4	RAT-SQL + SCoRe	interaction match accuracy	38.1	—	Unverified
5	EditSQL + BERT	interaction match accuracy	25.3	—	Unverified
6	GAZP + BERT	interaction match accuracy	23.5	—	Unverified
7	SyntaxSQL-con	interaction match accuracy	5.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RAT-SQL	Exact Match (EM)	26.77	—	Unverified
2	Edit-SQL	Exact Match (EM)	11.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5-Large	PCM-F1 (dev)	48.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy	69.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Orange-mini	0-shot MRR	74.17	—	Unverified