Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 424 papers

Title	Date	Tasks	Status	Score
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural Language	May 2, 2025	ManagementNatural Language Queries	CodeCode Available	5
When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator	Apr 30, 2025	Text to SQLText-To-SQL	CodeCode Available	5
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing	Oct 25, 2022	In-Context LearningRetrieval	CodeCode Available	5
Zero-shot Text-to-SQL Learning with Auxiliary Task	Aug 29, 2019	Text to SQLText-To-SQL	CodeCode Available	5
Learning Metadata-Agnostic Representations for Text-to-SQL In-Context Example Selection	Oct 17, 2024	In-Context LearningQuestion Similarity	—Unverified	0
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training	May 31, 2024	Machine Reading ComprehensionQuestion Answering	—Unverified	0
Evaluating the Text-to-SQL Capabilities of Large Language Models	Nov 16, 2021	Language ModelingLanguage Modelling	—Unverified	0
Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload	Jul 28, 2024	Decision MakingPrompt Engineering	—Unverified	0
LEDD: Large Language Model-Empowered Data Discovery in Data Lakes	Feb 21, 2025	Language ModelingLanguage Modelling	—Unverified	0
Leveraging Adjective-Noun Phrasing Knowledge for Comparison Relation Prediction in Text-to-SQL	Nov 1, 2019	RelationRelation Prediction	—Unverified	0
Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing	May 1, 2022	SQL ParsingText to SQL	—Unverified	0
Towards Generalizable and Robust Text-to-SQL Parsing	Oct 23, 2022	SQL ParsingText to SQL	—Unverified	0
Evaluating Cross-Domain Text-to-SQL Models and Benchmarks	Oct 27, 2023	Natural Language QueriesText to SQL	—Unverified	0
EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention Instructions	Apr 21, 2024	Natural Language QueriesText to SQL	—Unverified	0
LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs	May 18, 2024	Decision MakingMisinformation	—Unverified	0
Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection	Sep 24, 2024	HallucinationSemantic Parsing	—Unverified	0
Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies	May 21, 2023	DiversityIn-Context Learning	—Unverified	0
LLM-Driven Data Generation and a Novel Soft Metric for Evaluating Text-to-SQL in Aviation MRO	Jun 11, 2025	Text to SQLText-To-SQL	—Unverified	0
LLM-Powered Agents for Navigating Venice's Historical Cadastre	May 22, 2025	HallucinationNatural Language Queries	—Unverified	0
Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge	Jan 3, 2023	Semantic ParsingText to SQL	—Unverified	0
Towards Optimizing SQL Generation via LLM Routing	Nov 6, 2024	Text to SQLText-To-SQL	—Unverified	0
Lucy: Think and Reason to Solve Text-to-SQL	Jul 6, 2024	Text to SQLText-To-SQL	—Unverified	0
End-to-end Text-to-SQL Generation within an Analytics Insight Engine	Jun 17, 2024	Text to SQLText-To-SQL	—Unverified	0
End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task	Jun 17, 2021	Semantic ParsingText to SQL	—Unverified	0
A Review of Cross-Domain Text-to-SQL Models	Dec 1, 2020	Text to SQLText-To-SQL	—Unverified	0
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing	Mar 28, 2025	Natural Language QueriesText to SQL	—Unverified	0
Makadi: A Large-Scale Human-Labeled Dataset for Hindi Semantic Parsing	Jun 1, 2022	DiversityNatural Language Queries	—Unverified	0
Making LLMs Work for Enterprise Data Tasks	Jul 22, 2024	ManagementText to SQL	—Unverified	0
MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation	May 13, 2024	In-Context LearningMultiple-choice	—Unverified	0
MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search	Jan 28, 2025	Natural Language QueriesText to SQL	—Unverified	0
Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment	Oct 16, 2021	SentenceText to SQL	—Unverified	0
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset	Nov 1, 2020	SQL ParsingText to SQL	—Unverified	0
Mention Extraction and Linking for SQL Query Generation	Dec 18, 2020	Sentenceslot-filling	—Unverified	0
Meta-aware Learning in text-to-SQL Large Language Model	May 25, 2025	Language ModelingLanguage Modelling	—Unverified	0
MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL	Dec 19, 2022	Text to SQLText-To-SQL	—Unverified	0
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models	Feb 2, 2024	Text to SQLText-To-SQL	—Unverified	0
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries	Nov 16, 2021	Question AnsweringText to SQL	—Unverified	0
Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found	Dec 9, 2023	Domain AdaptationLanguage Modeling	—Unverified	0
MT-Teql: Evaluating and Augmenting Consistency of Text-to-SQL Models with Metamorphic Testing	Dec 21, 2020	Text to SQLText-To-SQL	—Unverified	0
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation	Nov 16, 2021	Text to SQLText-To-SQL	—Unverified	0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	—Unverified	0
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey	Oct 27, 2023	Data InteractionData Visualization	—Unverified	0
DocuT5: Seq2seq SQL Generation with Table Documentation	Nov 11, 2022	Domain GeneralizationLanguage Modeling	—Unverified	0
N-Best Hypotheses Reranking for Text-To-SQL Systems	Oct 19, 2022	RerankingText to SQL	—Unverified	0
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL	Jun 12, 2024	Natural Language UnderstandingText to SQL	—Unverified	0
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation	Dec 20, 2022	Text to SQLText-To-SQL	—Unverified	0
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL	Apr 23, 2023	Semantic ParsingText to SQL	—Unverified	0
Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers	Oct 29, 2022	Data AugmentationNatural Language Queries	—Unverified	0
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL	Apr 3, 2024	DecoderKnowledge Graphs	—Unverified	0
On the Security Vulnerabilities of Text-to-SQL Models	Nov 28, 2022	Text to SQLText-To-SQL	—Unverified	0

Show:10 25 50

← PrevPage 5 of 9Next →

All datasets BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)spider Spider 2.0 SParC KaggleDBQA SEDE SQL-Eval Text-To-SQL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Human Performance	Execution Accurarcy (Human)	92.96	—	Unverified
2	XiYan-SQL	Execution Accuracy % (Test)	75.63	—	Unverified
3	DSAIR + GPT-4o	Execution Accuracy % (Test)	74.12	—	Unverified
4	CHASE-SQL + Gemini	Execution Accuracy % (Test)	74.06	—	Unverified
5	ExSL + granite-34b-code	Execution Accuracy % (Test)	73.17	—	Unverified
6	OpenSearch-SQL+ v2 + GPT-4o	Execution Accuracy % (Test)	72.28	—	Unverified
7	Distillery + GPT-4o	Execution Accuracy % (Test)	71.83	—	Unverified
8	Insights AI	Execution Accuracy % (Test)	70.26	—	Unverified
9	PURPLE + RED + GPT-4o	Execution Accuracy % (Test)	70.21	—	Unverified
10	MCTS-SQL	Execution Accuracy % (Test)	69.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy (Test)	89.65	—	Unverified
2	PET-SQL	Execution Accuracy (Test)	87.6	—	Unverified
3	datagpt-sql-7B + InvalidSQL-Feedback	Execution Accuracy (Dev)	87.2	—	Unverified
4	DAIL-SQL + GPT-4 + Self-Consistency	Execution Accuracy (Test)	86.6	—	Unverified
5	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	—	Unverified
6	datagpt-sql-7B	Execution Accuracy (Dev)	84.8	—	Unverified
7	MSc-SQL	Execution Accuracy (Test)	84.7	—	Unverified
8	MARLO + Claude 2.1	Execution Accuracy (Test)	84	—	Unverified
9	C3 + ChatGPT + Zero-Shot	Execution Accuracy (Test)	82.3	—	Unverified
10	code-davinci-002 175B (LEVER)	Execution Accuracy (Dev)	81.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spider-Agent + o1-preview	Success Rate	17.03	—	Unverified
2	Spider-Agent + GPT-4o	Success Rate	10.13	—	Unverified
3	Spider-Agent + Claude-3.5-Sonnect	Success Rate	9.02	—	Unverified
4	Spider-Agent + GPT-4	Success Rate	8.86	—	Unverified
5	Spider-Agent + Qwen2.5-72B	Success Rate	6.17	—	Unverified
6	Spider-Agent + DeepSeek-V2.5	Success Rate	5.22	—	Unverified
7	Spider-Agent + Gemini-Pro-1.5	Success Rate	2.53	—	Unverified
8	Spider-Agent + Llama-3.1-405B	Success Rate	2.21	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RASAT+PICARD	interaction match accuracy	45.2	—	Unverified
2	RAT-SQL-TC + GAP	interaction match accuracy	43.2	—	Unverified
3	HIE-SQL + GraPPa	interaction match accuracy	42.9	—	Unverified
4	RAT-SQL + SCoRe	interaction match accuracy	38.1	—	Unverified
5	EditSQL + BERT	interaction match accuracy	25.3	—	Unverified
6	GAZP + BERT	interaction match accuracy	23.5	—	Unverified
7	SyntaxSQL-con	interaction match accuracy	5.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RAT-SQL	Exact Match (EM)	26.77	—	Unverified
2	Edit-SQL	Exact Match (EM)	11.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5-Large	PCM-F1 (dev)	48.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy	69.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Orange-mini	0-shot MRR	74.17	—	Unverified