SOTAVerified

Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Showing 251275 of 424 papers

TitleStatusHype
TrustSQL: Benchmarking Text-to-SQL Reliability with Penalty-Based ScoringCode0
Schema-Aware Multi-Task Learning for Complex Text-to-SQL0
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation0
DFIN-SQL: Integrating Focused Schema with DIN-SQL for Superior Accuracy in Large-Scale Databases0
Ar-Spider: Text-to-SQL in Arabic0
R^3: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks0
Structure Guided Large Language Model for SQL Generation0
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning0
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLMCode0
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQLCode0
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQLCode0
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation0
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User QueriesCode0
AraSpider: Democratizing Arabic-to-SQLCode0
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation0
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models0
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis0
Using LLM to select the right SQL Query from candidates0
Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases0
Data Transformation to Construct a Dataset for Generating Entity-Relationship Model from Natural Language0
dIR -- Discrete Information Retrieval: Conversational Search over Unstructured (and Structured) Data with Large Language Models0
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?Code0
Decoupling SQL Query Hardness Parsing for Text-to-SQL0
Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found0
A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases0
Show:102550
← PrevPage 11 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human PerformanceExecution Accurarcy (Human)92.96Unverified
2XiYan-SQLExecution Accuracy % (Test)75.63Unverified
3DSAIR + GPT-4oExecution Accuracy % (Test)74.12Unverified
4CHASE-SQL + GeminiExecution Accuracy % (Test)74.06Unverified
5ExSL + granite-34b-codeExecution Accuracy % (Test)73.17Unverified
6OpenSearch-SQL+ v2 + GPT-4oExecution Accuracy % (Test)72.28Unverified
7Distillery + GPT-4oExecution Accuracy % (Test)71.83Unverified
8Insights AIExecution Accuracy % (Test)70.26Unverified
9PURPLE + RED + GPT-4oExecution Accuracy % (Test)70.21Unverified
10MCTS-SQLExecution Accuracy % (Test)69.4Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy (Test)89.65Unverified
2PET-SQLExecution Accuracy (Test)87.6Unverified
3datagpt-sql-7B + InvalidSQL-FeedbackExecution Accuracy (Dev)87.2Unverified
4DAIL-SQL + GPT-4 + Self-ConsistencyExecution Accuracy (Test)86.6Unverified
5DIN-SQL + GPT-4Execution Accuracy (Test)85.3Unverified
6datagpt-sql-7BExecution Accuracy (Dev)84.8Unverified
7MSc-SQLExecution Accuracy (Test)84.7Unverified
8MARLO + Claude 2.1Execution Accuracy (Test)84Unverified
9C3 + ChatGPT + Zero-ShotExecution Accuracy (Test)82.3Unverified
10code-davinci-002 175B (LEVER)Execution Accuracy (Dev)81.9Unverified
#ModelMetricClaimedVerifiedStatus
1Spider-Agent + o1-previewSuccess Rate17.03Unverified
2Spider-Agent + GPT-4oSuccess Rate10.13Unverified
3Spider-Agent + Claude-3.5-SonnectSuccess Rate9.02Unverified
4Spider-Agent + GPT-4Success Rate8.86Unverified
5Spider-Agent + Qwen2.5-72BSuccess Rate6.17Unverified
6Spider-Agent + DeepSeek-V2.5Success Rate5.22Unverified
7Spider-Agent + Gemini-Pro-1.5Success Rate2.53Unverified
8Spider-Agent + Llama-3.1-405BSuccess Rate2.21Unverified
#ModelMetricClaimedVerifiedStatus
1RASAT+PICARDinteraction match accuracy45.2Unverified
2RAT-SQL-TC + GAPinteraction match accuracy43.2Unverified
3HIE-SQL + GraPPainteraction match accuracy42.9Unverified
4RAT-SQL + SCoReinteraction match accuracy38.1Unverified
5EditSQL + BERTinteraction match accuracy25.3Unverified
6GAZP + BERTinteraction match accuracy23.5Unverified
7SyntaxSQL-coninteraction match accuracy5.2Unverified
#ModelMetricClaimedVerifiedStatus
1RAT-SQLExact Match (EM)26.77Unverified
2Edit-SQLExact Match (EM)11.73Unverified
#ModelMetricClaimedVerifiedStatus
1T5-LargePCM-F1 (dev)48.2Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy69.86Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR74.17Unverified