Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 424 papers

Title	Date	Tasks	Status	Hype
RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL	Jun 13, 2024	Language ModelingLanguage Modelling	—Unverified	0
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL	Jun 12, 2024	Natural Language UnderstandingText to SQL	—Unverified	0
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain	Jun 12, 2024	Natural Language QueriesText to SQL	CodeCode Available	1
StatBot.Swiss: Bilingual Open Data Exploration in Natural Language	Jun 5, 2024	In-Context LearningText to SQL	—Unverified	0
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training	May 31, 2024	Machine Reading ComprehensionQuestion Answering	—Unverified	0
CHESS: Contextual Harnessing for Efficient SQL Synthesis	May 27, 2024	Large Language ModelPrivacy Preserving	CodeCode Available	3
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation	May 24, 2024	In-Context LearningText to SQL	CodeCode Available	2
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records	May 23, 2024	SQL ParsingText to SQL	CodeCode Available	1
KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR	May 22, 2024	Language ModelingLanguage Modelling	—Unverified	0
Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!	May 20, 2024	Knowledge GraphsQuestion Answering	—Unverified	0
LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs	May 18, 2024	Decision MakingMisinformation	—Unverified	0
SQL-to-Schema Enhances Schema Linking in Text-to-SQL	May 15, 2024	Text to SQLText-To-SQL	—Unverified	0
PromptMind Team at EHRSQL-2024: Improving Reliability of SQL Generation using Ensemble LLMs	May 14, 2024	Text to SQLText-To-SQL	—Unverified	0
MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation	May 13, 2024	In-Context LearningMultiple-choice	—Unverified	0
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records	May 4, 2024	Information RetrievalQuestion Answering	CodeCode Available	2
Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models	May 4, 2024	Few-Shot LearningText to SQL	—Unverified	0
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions	May 4, 2024	In-Context LearningText to SQL	CodeCode Available	1
ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling	Apr 25, 2024	Text to SQLText-To-SQL	CodeCode Available	0
EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention Instructions	Apr 21, 2024	Natural Language QueriesText to SQL	—Unverified	0
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL	Apr 19, 2024	RAGRetrieval	CodeCode Available	1
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models	Apr 16, 2024	Data InteractionText to SQL	CodeCode Available	11
Towards Compositionally Generalizable Semantic Parsing in Large Language Models: A Survey	Apr 15, 2024	Information RetrievalRetrieval	—Unverified	0
TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition	Apr 15, 2024	Natural Language UnderstandingQuestion Answering	CodeCode Available	1
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL	Apr 3, 2024	DecoderKnowledge Graphs	—Unverified	0
TrustSQL: Benchmarking Text-to-SQL Reliability with Penalty-Based Scoring	Mar 23, 2024	BenchmarkingText to SQL	CodeCode Available	0
Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records	Mar 14, 2024	Question AnsweringRAG	CodeCode Available	1
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency	Mar 13, 2024	In-Context LearningText to SQL	CodeCode Available	2
Schema-Aware Multi-Task Learning for Complex Text-to-SQL	Mar 9, 2024	DecoderMulti-Task Learning	—Unverified	0
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation	Mar 5, 2024	BenchmarkingIn-Context Learning	—Unverified	0
DFIN-SQL: Integrating Focused Schema with DIN-SQL for Superior Accuracy in Large-Scale Databases	Mar 1, 2024	In-Context LearningNatural Language Queries	—Unverified	0
CodeS: Towards Building Open-source Language Models for Text-to-SQL	Feb 26, 2024	Data AugmentationDiagnostic	CodeCode Available	2
Ar-Spider: Text-to-SQL in Arabic	Feb 22, 2024	Semantic ParsingText to SQL	—Unverified	0
R^3: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks	Feb 20, 2024	Text to SQLText-To-SQL	—Unverified	0
Structure Guided Large Language Model for SQL Generation	Feb 19, 2024	Language ModelingLanguage Modelling	—Unverified	0
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning	Feb 19, 2024	Text to SQLText-To-SQL	—Unverified	0
Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark	Feb 19, 2024	Text to SQLText-To-SQL	CodeCode Available	1
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM	Feb 18, 2024	Text to SQLText-To-SQL	CodeCode Available	0
Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm	Feb 16, 2024	Active LearningIn-Context Learning	CodeCode Available	1
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL	Feb 16, 2024	DiversityIn-Context Learning	CodeCode Available	0
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL	Feb 16, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	0
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator	Feb 16, 2024	Mathematical ReasoningRe-Ranking	CodeCode Available	2
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation	Feb 13, 2024	Data AugmentationSemantic Parsing	CodeCode Available	0
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries	Feb 13, 2024	Language ModellingText to SQL	CodeCode Available	0
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation	Feb 12, 2024	Instruction FollowingText to SQL	—Unverified	0
AraSpider: Democratizing Arabic-to-SQL	Feb 12, 2024	Text to SQLText-To-SQL	CodeCode Available	0
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models	Feb 2, 2024	Text to SQLText-To-SQL	—Unverified	0
Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis	Jan 22, 2024	16kProgram Synthesis	CodeCode Available	1
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis	Jan 19, 2024	Financial AnalysisLanguage Modelling	—Unverified	0
Using LLM to select the right SQL Query from candidates	Jan 4, 2024	Code GenerationText to SQL	—Unverified	0
Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases	Dec 22, 2023	RetrievalSemantic Parsing	—Unverified	0

Show:10 25 50

← PrevPage 4 of 9Next →

All datasets BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)spider Spider 2.0 SParC KaggleDBQA SEDE SQL-Eval Text-To-SQL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Human Performance	Execution Accurarcy (Human)	92.96	—	Unverified
2	XiYan-SQL	Execution Accuracy % (Test)	75.63	—	Unverified
3	DSAIR + GPT-4o	Execution Accuracy % (Test)	74.12	—	Unverified
4	CHASE-SQL + Gemini	Execution Accuracy % (Test)	74.06	—	Unverified
5	ExSL + granite-34b-code	Execution Accuracy % (Test)	73.17	—	Unverified
6	OpenSearch-SQL+ v2 + GPT-4o	Execution Accuracy % (Test)	72.28	—	Unverified
7	Distillery + GPT-4o	Execution Accuracy % (Test)	71.83	—	Unverified
8	Insights AI	Execution Accuracy % (Test)	70.26	—	Unverified
9	PURPLE + RED + GPT-4o	Execution Accuracy % (Test)	70.21	—	Unverified
10	MCTS-SQL	Execution Accuracy % (Test)	69.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy (Test)	89.65	—	Unverified
2	PET-SQL	Execution Accuracy (Test)	87.6	—	Unverified
3	datagpt-sql-7B + InvalidSQL-Feedback	Execution Accuracy (Dev)	87.2	—	Unverified
4	DAIL-SQL + GPT-4 + Self-Consistency	Execution Accuracy (Test)	86.6	—	Unverified
5	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	—	Unverified
6	datagpt-sql-7B	Execution Accuracy (Dev)	84.8	—	Unverified
7	MSc-SQL	Execution Accuracy (Test)	84.7	—	Unverified
8	MARLO + Claude 2.1	Execution Accuracy (Test)	84	—	Unverified
9	C3 + ChatGPT + Zero-Shot	Execution Accuracy (Test)	82.3	—	Unverified
10	code-davinci-002 175B (LEVER)	Execution Accuracy (Dev)	81.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spider-Agent + o1-preview	Success Rate	17.03	—	Unverified
2	Spider-Agent + GPT-4o	Success Rate	10.13	—	Unverified
3	Spider-Agent + Claude-3.5-Sonnect	Success Rate	9.02	—	Unverified
4	Spider-Agent + GPT-4	Success Rate	8.86	—	Unverified
5	Spider-Agent + Qwen2.5-72B	Success Rate	6.17	—	Unverified
6	Spider-Agent + DeepSeek-V2.5	Success Rate	5.22	—	Unverified
7	Spider-Agent + Gemini-Pro-1.5	Success Rate	2.53	—	Unverified
8	Spider-Agent + Llama-3.1-405B	Success Rate	2.21	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RASAT+PICARD	interaction match accuracy	45.2	—	Unverified
2	RAT-SQL-TC + GAP	interaction match accuracy	43.2	—	Unverified
3	HIE-SQL + GraPPa	interaction match accuracy	42.9	—	Unverified
4	RAT-SQL + SCoRe	interaction match accuracy	38.1	—	Unverified
5	EditSQL + BERT	interaction match accuracy	25.3	—	Unverified
6	GAZP + BERT	interaction match accuracy	23.5	—	Unverified
7	SyntaxSQL-con	interaction match accuracy	5.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RAT-SQL	Exact Match (EM)	26.77	—	Unverified
2	Edit-SQL	Exact Match (EM)	11.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5-Large	PCM-F1 (dev)	48.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy	69.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Orange-mini	0-shot MRR	74.17	—	Unverified