Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 424 papers

Title	Date	Tasks	Status
TrustSQL: Benchmarking Text-to-SQL Reliability with Penalty-Based Scoring	Mar 23, 2024	BenchmarkingText to SQL	CodeCode Available
Schema-Aware Multi-Task Learning for Complex Text-to-SQL	Mar 9, 2024	DecoderMulti-Task Learning	—Unverified
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation	Mar 5, 2024	BenchmarkingIn-Context Learning	—Unverified
DFIN-SQL: Integrating Focused Schema with DIN-SQL for Superior Accuracy in Large-Scale Databases	Mar 1, 2024	In-Context LearningNatural Language Queries	—Unverified
Ar-Spider: Text-to-SQL in Arabic	Feb 22, 2024	Semantic ParsingText to SQL	—Unverified
R^3: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks	Feb 20, 2024	Text to SQLText-To-SQL	—Unverified
Structure Guided Large Language Model for SQL Generation	Feb 19, 2024	Language ModelingLanguage Modelling	—Unverified
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning	Feb 19, 2024	Text to SQLText-To-SQL	—Unverified
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM	Feb 18, 2024	Text to SQLText-To-SQL	CodeCode Available
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL	Feb 16, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL	Feb 16, 2024	DiversityIn-Context Learning	CodeCode Available
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation	Feb 13, 2024	Data AugmentationSemantic Parsing	—Unverified
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries	Feb 13, 2024	Language ModellingText to SQL	CodeCode Available
AraSpider: Democratizing Arabic-to-SQL	Feb 12, 2024	Text to SQLText-To-SQL	CodeCode Available
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation	Feb 12, 2024	Instruction FollowingText to SQL	—Unverified
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models	Feb 2, 2024	Text to SQLText-To-SQL	—Unverified
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis	Jan 19, 2024	Financial AnalysisLanguage Modelling	—Unverified
Using LLM to select the right SQL Query from candidates	Jan 4, 2024	Code GenerationText to SQL	—Unverified
Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases	Dec 22, 2023	RetrievalSemantic Parsing	—Unverified
Data Transformation to Construct a Dataset for Generating Entity-Relationship Model from Natural Language	Dec 21, 2023	Text to SQLText-To-SQL	—Unverified
dIR -- Discrete Information Retrieval: Conversational Search over Unstructured (and Structured) Data with Large Language Models	Dec 20, 2023	Conversational SearchInformation Retrieval	—Unverified
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?	Dec 16, 2023	Question AnsweringText to SQL	CodeCode Available
Decoupling SQL Query Hardness Parsing for Text-to-SQL	Dec 11, 2023	Language ModelingLanguage Modelling	—Unverified
Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found	Dec 9, 2023	Domain AdaptationLanguage Modeling	—Unverified
A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases	Nov 13, 2023	Knowledge GraphsQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 11 of 17Next →

All datasets BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)spider Spider 2.0 SParC KaggleDBQA SEDE SQL-Eval Text-To-SQL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Human Performance	Execution Accurarcy (Human)	92.96	—	Unverified
2	XiYan-SQL	Execution Accuracy % (Test)	75.63	—	Unverified
3	DSAIR + GPT-4o	Execution Accuracy % (Test)	74.12	—	Unverified
4	CHASE-SQL + Gemini	Execution Accuracy % (Test)	74.06	—	Unverified
5	ExSL + granite-34b-code	Execution Accuracy % (Test)	73.17	—	Unverified
6	OpenSearch-SQL+ v2 + GPT-4o	Execution Accuracy % (Test)	72.28	—	Unverified
7	Distillery + GPT-4o	Execution Accuracy % (Test)	71.83	—	Unverified
8	Insights AI	Execution Accuracy % (Test)	70.26	—	Unverified
9	PURPLE + RED + GPT-4o	Execution Accuracy % (Test)	70.21	—	Unverified
10	MCTS-SQL	Execution Accuracy % (Test)	69.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy (Test)	89.65	—	Unverified
2	PET-SQL	Execution Accuracy (Test)	87.6	—	Unverified
3	datagpt-sql-7B + InvalidSQL-Feedback	Execution Accuracy (Dev)	87.2	—	Unverified
4	DAIL-SQL + GPT-4 + Self-Consistency	Execution Accuracy (Test)	86.6	—	Unverified
5	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	—	Unverified
6	datagpt-sql-7B	Execution Accuracy (Dev)	84.8	—	Unverified
7	MSc-SQL	Execution Accuracy (Test)	84.7	—	Unverified
8	MARLO + Claude 2.1	Execution Accuracy (Test)	84	—	Unverified
9	C3 + ChatGPT + Zero-Shot	Execution Accuracy (Test)	82.3	—	Unverified
10	code-davinci-002 175B (LEVER)	Execution Accuracy (Dev)	81.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spider-Agent + o1-preview	Success Rate	17.03	—	Unverified
2	Spider-Agent + GPT-4o	Success Rate	10.13	—	Unverified
3	Spider-Agent + Claude-3.5-Sonnect	Success Rate	9.02	—	Unverified
4	Spider-Agent + GPT-4	Success Rate	8.86	—	Unverified
5	Spider-Agent + Qwen2.5-72B	Success Rate	6.17	—	Unverified
6	Spider-Agent + DeepSeek-V2.5	Success Rate	5.22	—	Unverified
7	Spider-Agent + Gemini-Pro-1.5	Success Rate	2.53	—	Unverified
8	Spider-Agent + Llama-3.1-405B	Success Rate	2.21	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RASAT+PICARD	interaction match accuracy	45.2	—	Unverified
2	RAT-SQL-TC + GAP	interaction match accuracy	43.2	—	Unverified
3	HIE-SQL + GraPPa	interaction match accuracy	42.9	—	Unverified
4	RAT-SQL + SCoRe	interaction match accuracy	38.1	—	Unverified
5	EditSQL + BERT	interaction match accuracy	25.3	—	Unverified
6	GAZP + BERT	interaction match accuracy	23.5	—	Unverified
7	SyntaxSQL-con	interaction match accuracy	5.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RAT-SQL	Exact Match (EM)	26.77	—	Unverified
2	Edit-SQL	Exact Match (EM)	11.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5-Large	PCM-F1 (dev)	48.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	XiYan-SQL	Execution Accuracy	69.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Orange-mini	0-shot MRR	74.17	—	Unverified