SOTAVerified

Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Showing 51100 of 424 papers

TitleStatusHype
SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent CollaborationCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQLCode1
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting DomainCode1
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health RecordsCode1
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-EditionsCode1
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQLCode1
TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table DecompositionCode1
Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health recordsCode1
Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench BenchmarkCode1
Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow ParadigmCode1
Analyzing the Effectiveness of Large Language Models on Text-to-SQL SynthesisCode1
DBCopilot: Natural Language Querying over Massive Databases via Schema RoutingCode1
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQLCode1
ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-ThoughtCode1
Can LLMs Effectively Leverage Graph Structural Information through Prompts, and Why?Code1
C3: Zero-shot Text-to-SQL with ChatGPTCode1
Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based TechniquesCode1
UNITE: A Unified Benchmark for Text-to-SQL EvaluationCode1
Text-to-SQL Error Correction with Language Models of CodeCode1
How to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsCode1
Learning to Simulate Natural Language Feedback for Interactive Semantic ParsingCode1
Interactive Text-to-SQL Generation via Editable Step-by-Step ExplanationsCode1
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLsCode1
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capabilityCode1
LEVER: Learning to Verify Language-to-Code Generation with ExecutionCode1
Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL RobustnessCode1
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health RecordsCode1
Know What I don't Know: Handling Ambiguous and Unanswerable Questions for Text-to-SQLCode1
Augmenting Multi-Turn Text-to-SQL Datasets with Self-PlayCode1
SpCQL: A Semantic Parsing Dataset for Converting Natural Language into CypherCode1
Recent Advances in Text-to-SQL: A Survey of What We Have and What We ExpectCode1
Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking GraphCode1
RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQLCode1
Measuring and Improving Compositional Generalization in Text-to-SQL via Component AlignmentCode1
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related QueriesCode1
In-Context Learning for Few-Shot Dialogue State TrackingCode1
Weakly Supervised Text-to-SQL Parsing through Question DecompositionCode1
SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQLCode1
mRAT-SQL+GAP:A Portuguese Text-to-SQL TransformerCode1
SPARQLing Database Queries from Intermediate Question DecompositionsCode1
Leveraging Table Content for Zero-shot Text-to-SQL with Meta-LearningCode1
Natural SQL: Making SQL Easier to Infer from Natural Language SpecificationsCode1
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language ModelsCode1
Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange DataCode1
Towards Robustness of Text-to-SQL Models against Synonym SubstitutionCode1
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local RelationsCode1
Unlocking Compositional Generalization in Pre-trained Models Using Intermediate RepresentationsCode1
Learning to Synthesize Data for Semantic ParsingCode1
An Investigation Between Schema Linking and Text-to-SQL PerformanceCode1
Show:102550
← PrevPage 2 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human PerformanceExecution Accurarcy (Human)92.96Unverified
2XiYan-SQLExecution Accuracy % (Test)75.63Unverified
3DSAIR + GPT-4oExecution Accuracy % (Test)74.12Unverified
4CHASE-SQL + GeminiExecution Accuracy % (Test)74.06Unverified
5ExSL + granite-34b-codeExecution Accuracy % (Test)73.17Unverified
6OpenSearch-SQL+ v2 + GPT-4oExecution Accuracy % (Test)72.28Unverified
7Distillery + GPT-4oExecution Accuracy % (Test)71.83Unverified
8Insights AIExecution Accuracy % (Test)70.26Unverified
9PURPLE + RED + GPT-4oExecution Accuracy % (Test)70.21Unverified
10MCTS-SQLExecution Accuracy % (Test)69.4Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy (Test)89.65Unverified
2PET-SQLExecution Accuracy (Test)87.6Unverified
3datagpt-sql-7B + InvalidSQL-FeedbackExecution Accuracy (Dev)87.2Unverified
4DAIL-SQL + GPT-4 + Self-ConsistencyExecution Accuracy (Test)86.6Unverified
5DIN-SQL + GPT-4Execution Accuracy (Test)85.3Unverified
6datagpt-sql-7BExecution Accuracy (Dev)84.8Unverified
7MSc-SQLExecution Accuracy (Test)84.7Unverified
8MARLO + Claude 2.1Execution Accuracy (Test)84Unverified
9C3 + ChatGPT + Zero-ShotExecution Accuracy (Test)82.3Unverified
10code-davinci-002 175B (LEVER)Execution Accuracy (Dev)81.9Unverified
#ModelMetricClaimedVerifiedStatus
1Spider-Agent + o1-previewSuccess Rate17.03Unverified
2Spider-Agent + GPT-4oSuccess Rate10.13Unverified
3Spider-Agent + Claude-3.5-SonnectSuccess Rate9.02Unverified
4Spider-Agent + GPT-4Success Rate8.86Unverified
5Spider-Agent + Qwen2.5-72BSuccess Rate6.17Unverified
6Spider-Agent + DeepSeek-V2.5Success Rate5.22Unverified
7Spider-Agent + Gemini-Pro-1.5Success Rate2.53Unverified
8Spider-Agent + Llama-3.1-405BSuccess Rate2.21Unverified
#ModelMetricClaimedVerifiedStatus
1RASAT+PICARDinteraction match accuracy45.2Unverified
2RAT-SQL-TC + GAPinteraction match accuracy43.2Unverified
3HIE-SQL + GraPPainteraction match accuracy42.9Unverified
4RAT-SQL + SCoReinteraction match accuracy38.1Unverified
5EditSQL + BERTinteraction match accuracy25.3Unverified
6GAZP + BERTinteraction match accuracy23.5Unverified
7SyntaxSQL-coninteraction match accuracy5.2Unverified
#ModelMetricClaimedVerifiedStatus
1RAT-SQLExact Match (EM)26.77Unverified
2Edit-SQLExact Match (EM)11.73Unverified
#ModelMetricClaimedVerifiedStatus
1T5-LargePCM-F1 (dev)48.2Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy69.86Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR74.17Unverified