SOTAVerified

Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Showing 150 of 424 papers

TitleStatusHype
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language ModelsCode11
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?Code5
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQLCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
Text2SQL is Not Enough: Unifying AI and Databases with TAGCode4
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQLCode3
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise RewardCode3
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution FeedbackCode3
OmniSQL: Synthesizing High-quality Text-to-SQL Data at ScaleCode3
Cognify: Supercharging Gen-AI Workflows With Hierarchical AutotuningCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
CHESS: Contextual Harnessing for Efficient SQL SynthesisCode3
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement LearningCode2
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQLCode2
Datrics Text2SQL. A Framework for Natural Language to SQL Query GenerationCode2
Automatic database description generation for Text-to-SQLCode2
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQLCode2
RSL-SQL: Robust Schema Linking in Text-to-SQL GenerationCode2
E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQLCode2
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL GenerationCode2
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health RecordsCode2
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistencyCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
When is Tree Search Useful for LLM Planning? It Depends on the DiscriminatorCode2
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQLCode2
Text-to-SQL Empowered by Large Language Models: A Benchmark EvaluationCode2
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-CorrectionCode2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQLCode2
SeaD: End-to-end Text-to-SQL Generation with Schema-aware DenoisingCode2
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query LanguageCode2
TableQA: a Large-Scale Chinese Text-to-SQL Dataset for Table-Aware SQL GenerationCode2
A Pilot Study for Chinese SQL Semantic ParsingCode2
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL TaskCode2
SQLNet: Generating Structured Queries From Natural Language Without Reinforcement LearningCode2
Schema-R1: A reasoning training approach for schema linking in Text-to-SQL TaskCode1
SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence GenerationCode1
ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL DialectsCode1
Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised RewardsCode1
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for CompressionCode1
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT PipelinesCode1
Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQLCode1
BASE-SQL: A powerful open source Text-To-SQL baseline approachCode1
A Study of In-Context-Learning-Based Text-to-SQL ErrorsCode1
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQLCode1
Towards Automated Cross-domain Exploratory Data Analysis through Large Language ModelsCode1
MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL TranslationCode1
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL BenchmarkCode1
MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQLCode1
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database QueriesCode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human PerformanceExecution Accurarcy (Human)92.96Unverified
2XiYan-SQLExecution Accuracy % (Test)75.63Unverified
3DSAIR + GPT-4oExecution Accuracy % (Test)74.12Unverified
4CHASE-SQL + GeminiExecution Accuracy % (Test)74.06Unverified
5ExSL + granite-34b-codeExecution Accuracy % (Test)73.17Unverified
6OpenSearch-SQL+ v2 + GPT-4oExecution Accuracy % (Test)72.28Unverified
7Distillery + GPT-4oExecution Accuracy % (Test)71.83Unverified
8Insights AIExecution Accuracy % (Test)70.26Unverified
9PURPLE + RED + GPT-4oExecution Accuracy % (Test)70.21Unverified
10MCTS-SQLExecution Accuracy % (Test)69.4Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy (Test)89.65Unverified
2PET-SQLExecution Accuracy (Test)87.6Unverified
3datagpt-sql-7B + InvalidSQL-FeedbackExecution Accuracy (Dev)87.2Unverified
4DAIL-SQL + GPT-4 + Self-ConsistencyExecution Accuracy (Test)86.6Unverified
5DIN-SQL + GPT-4Execution Accuracy (Test)85.3Unverified
6datagpt-sql-7BExecution Accuracy (Dev)84.8Unverified
7MSc-SQLExecution Accuracy (Test)84.7Unverified
8MARLO + Claude 2.1Execution Accuracy (Test)84Unverified
9C3 + ChatGPT + Zero-ShotExecution Accuracy (Test)82.3Unverified
10code-davinci-002 175B (LEVER)Execution Accuracy (Dev)81.9Unverified
#ModelMetricClaimedVerifiedStatus
1Spider-Agent + o1-previewSuccess Rate17.03Unverified
2Spider-Agent + GPT-4oSuccess Rate10.13Unverified
3Spider-Agent + Claude-3.5-SonnectSuccess Rate9.02Unverified
4Spider-Agent + GPT-4Success Rate8.86Unverified
5Spider-Agent + Qwen2.5-72BSuccess Rate6.17Unverified
6Spider-Agent + DeepSeek-V2.5Success Rate5.22Unverified
7Spider-Agent + Gemini-Pro-1.5Success Rate2.53Unverified
8Spider-Agent + Llama-3.1-405BSuccess Rate2.21Unverified
#ModelMetricClaimedVerifiedStatus
1RASAT+PICARDinteraction match accuracy45.2Unverified
2RAT-SQL-TC + GAPinteraction match accuracy43.2Unverified
3HIE-SQL + GraPPainteraction match accuracy42.9Unverified
4RAT-SQL + SCoReinteraction match accuracy38.1Unverified
5EditSQL + BERTinteraction match accuracy25.3Unverified
6GAZP + BERTinteraction match accuracy23.5Unverified
7SyntaxSQL-coninteraction match accuracy5.2Unverified
#ModelMetricClaimedVerifiedStatus
1RAT-SQLExact Match (EM)26.77Unverified
2Edit-SQLExact Match (EM)11.73Unverified
#ModelMetricClaimedVerifiedStatus
1T5-LargePCM-F1 (dev)48.2Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy69.86Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR74.17Unverified