SOTAVerified

Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Showing 176200 of 424 papers

TitleStatusHype
Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health recordsCode1
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistencyCode2
Schema-Aware Multi-Task Learning for Complex Text-to-SQL0
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation0
DFIN-SQL: Integrating Focused Schema with DIN-SQL for Superior Accuracy in Large-Scale Databases0
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
Ar-Spider: Text-to-SQL in Arabic0
R^3: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks0
Structure Guided Large Language Model for SQL Generation0
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning0
Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench BenchmarkCode1
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLMCode0
Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow ParadigmCode1
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQLCode0
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQLCode0
When is Tree Search Useful for LLM Planning? It Depends on the DiscriminatorCode2
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation0
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User QueriesCode0
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation0
AraSpider: Democratizing Arabic-to-SQLCode0
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models0
Analyzing the Effectiveness of Large Language Models on Text-to-SQL SynthesisCode1
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis0
Using LLM to select the right SQL Query from candidates0
Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases0
Show:102550
← PrevPage 8 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human PerformanceExecution Accurarcy (Human)92.96Unverified
2XiYan-SQLExecution Accuracy % (Test)75.63Unverified
3DSAIR + GPT-4oExecution Accuracy % (Test)74.12Unverified
4CHASE-SQL + GeminiExecution Accuracy % (Test)74.06Unverified
5ExSL + granite-34b-codeExecution Accuracy % (Test)73.17Unverified
6OpenSearch-SQL+ v2 + GPT-4oExecution Accuracy % (Test)72.28Unverified
7Distillery + GPT-4oExecution Accuracy % (Test)71.83Unverified
8Insights AIExecution Accuracy % (Test)70.26Unverified
9PURPLE + RED + GPT-4oExecution Accuracy % (Test)70.21Unverified
10MCTS-SQLExecution Accuracy % (Test)69.4Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy (Test)89.65Unverified
2PET-SQLExecution Accuracy (Test)87.6Unverified
3datagpt-sql-7B + InvalidSQL-FeedbackExecution Accuracy (Dev)87.2Unverified
4DAIL-SQL + GPT-4 + Self-ConsistencyExecution Accuracy (Test)86.6Unverified
5DIN-SQL + GPT-4Execution Accuracy (Test)85.3Unverified
6datagpt-sql-7BExecution Accuracy (Dev)84.8Unverified
7MSc-SQLExecution Accuracy (Test)84.7Unverified
8MARLO + Claude 2.1Execution Accuracy (Test)84Unverified
9C3 + ChatGPT + Zero-ShotExecution Accuracy (Test)82.3Unverified
10code-davinci-002 175B (LEVER)Execution Accuracy (Dev)81.9Unverified
#ModelMetricClaimedVerifiedStatus
1Spider-Agent + o1-previewSuccess Rate17.03Unverified
2Spider-Agent + GPT-4oSuccess Rate10.13Unverified
3Spider-Agent + Claude-3.5-SonnectSuccess Rate9.02Unverified
4Spider-Agent + GPT-4Success Rate8.86Unverified
5Spider-Agent + Qwen2.5-72BSuccess Rate6.17Unverified
6Spider-Agent + DeepSeek-V2.5Success Rate5.22Unverified
7Spider-Agent + Gemini-Pro-1.5Success Rate2.53Unverified
8Spider-Agent + Llama-3.1-405BSuccess Rate2.21Unverified
#ModelMetricClaimedVerifiedStatus
1RASAT+PICARDinteraction match accuracy45.2Unverified
2RAT-SQL-TC + GAPinteraction match accuracy43.2Unverified
3HIE-SQL + GraPPainteraction match accuracy42.9Unverified
4RAT-SQL + SCoReinteraction match accuracy38.1Unverified
5EditSQL + BERTinteraction match accuracy25.3Unverified
6GAZP + BERTinteraction match accuracy23.5Unverified
7SyntaxSQL-coninteraction match accuracy5.2Unverified
#ModelMetricClaimedVerifiedStatus
1RAT-SQLExact Match (EM)26.77Unverified
2Edit-SQLExact Match (EM)11.73Unverified
#ModelMetricClaimedVerifiedStatus
1T5-LargePCM-F1 (dev)48.2Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy69.86Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR74.17Unverified