SOTAVerified

Text-To-SQL

Text-to-SQL is a task in natural language processing (NLP) where the goal is to automatically generate SQL queries from natural language text. The task involves converting the text input into a structured representation and then using this representation to generate a semantically correct SQL query that can be executed on a database.

( Image credit: SyntaxSQLNet )

Papers

Showing 201250 of 424 papers

TitleStatusHype
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural LanguageCode0
When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as DiscriminatorCode0
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic ParsingCode0
Zero-shot Text-to-SQL Learning with Auxiliary TaskCode0
Learning Metadata-Agnostic Representations for Text-to-SQL In-Context Example Selection0
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training0
Evaluating the Text-to-SQL Capabilities of Large Language Models0
Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload0
LEDD: Large Language Model-Empowered Data Discovery in Data Lakes0
Leveraging Adjective-Noun Phrasing Knowledge for Comparison Relation Prediction in Text-to-SQL0
Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing0
Towards Generalizable and Robust Text-to-SQL Parsing0
Evaluating Cross-Domain Text-to-SQL Models and Benchmarks0
EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention Instructions0
LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs0
Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection0
Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies0
LLM-Driven Data Generation and a Novel Soft Metric for Evaluating Text-to-SQL in Aviation MRO0
LLM-Powered Agents for Navigating Venice's Historical Cadastre0
Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge0
Towards Optimizing SQL Generation via LLM Routing0
Lucy: Think and Reason to Solve Text-to-SQL0
End-to-end Text-to-SQL Generation within an Analytics Insight Engine0
End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task0
A Review of Cross-Domain Text-to-SQL Models0
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing0
Makadi: A Large-Scale Human-Labeled Dataset for Hindi Semantic Parsing0
Making LLMs Work for Enterprise Data Tasks0
MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation0
MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search0
Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment0
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset0
Mention Extraction and Linking for SQL Query Generation0
Meta-aware Learning in text-to-SQL Large Language Model0
MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL0
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models0
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries0
Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found0
MT-Teql: Evaluating and Augmenting Consistency of Text-to-SQL Models with Metamorphic Testing0
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing0
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey0
DocuT5: Seq2seq SQL Generation with Table Documentation0
N-Best Hypotheses Reranking for Text-To-SQL Systems0
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL0
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation0
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL0
Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers0
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL0
On the Security Vulnerabilities of Text-to-SQL Models0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human PerformanceExecution Accurarcy (Human)92.96Unverified
2XiYan-SQLExecution Accuracy % (Test)75.63Unverified
3DSAIR + GPT-4oExecution Accuracy % (Test)74.12Unverified
4CHASE-SQL + GeminiExecution Accuracy % (Test)74.06Unverified
5ExSL + granite-34b-codeExecution Accuracy % (Test)73.17Unverified
6OpenSearch-SQL+ v2 + GPT-4oExecution Accuracy % (Test)72.28Unverified
7Distillery + GPT-4oExecution Accuracy % (Test)71.83Unverified
8Insights AIExecution Accuracy % (Test)70.26Unverified
9PURPLE + RED + GPT-4oExecution Accuracy % (Test)70.21Unverified
10MCTS-SQLExecution Accuracy % (Test)69.4Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy (Test)89.65Unverified
2PET-SQLExecution Accuracy (Test)87.6Unverified
3datagpt-sql-7B + InvalidSQL-FeedbackExecution Accuracy (Dev)87.2Unverified
4DAIL-SQL + GPT-4 + Self-ConsistencyExecution Accuracy (Test)86.6Unverified
5DIN-SQL + GPT-4Execution Accuracy (Test)85.3Unverified
6datagpt-sql-7BExecution Accuracy (Dev)84.8Unverified
7MSc-SQLExecution Accuracy (Test)84.7Unverified
8MARLO + Claude 2.1Execution Accuracy (Test)84Unverified
9C3 + ChatGPT + Zero-ShotExecution Accuracy (Test)82.3Unverified
10code-davinci-002 175B (LEVER)Execution Accuracy (Dev)81.9Unverified
#ModelMetricClaimedVerifiedStatus
1Spider-Agent + o1-previewSuccess Rate17.03Unverified
2Spider-Agent + GPT-4oSuccess Rate10.13Unverified
3Spider-Agent + Claude-3.5-SonnectSuccess Rate9.02Unverified
4Spider-Agent + GPT-4Success Rate8.86Unverified
5Spider-Agent + Qwen2.5-72BSuccess Rate6.17Unverified
6Spider-Agent + DeepSeek-V2.5Success Rate5.22Unverified
7Spider-Agent + Gemini-Pro-1.5Success Rate2.53Unverified
8Spider-Agent + Llama-3.1-405BSuccess Rate2.21Unverified
#ModelMetricClaimedVerifiedStatus
1RASAT+PICARDinteraction match accuracy45.2Unverified
2RAT-SQL-TC + GAPinteraction match accuracy43.2Unverified
3HIE-SQL + GraPPainteraction match accuracy42.9Unverified
4RAT-SQL + SCoReinteraction match accuracy38.1Unverified
5EditSQL + BERTinteraction match accuracy25.3Unverified
6GAZP + BERTinteraction match accuracy23.5Unverified
7SyntaxSQL-coninteraction match accuracy5.2Unverified
#ModelMetricClaimedVerifiedStatus
1RAT-SQLExact Match (EM)26.77Unverified
2Edit-SQLExact Match (EM)11.73Unverified
#ModelMetricClaimedVerifiedStatus
1T5-LargePCM-F1 (dev)48.2Unverified
#ModelMetricClaimedVerifiedStatus
1XiYan-SQLExecution Accuracy69.86Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR74.17Unverified