SOTAVerified

Semantic Parsing

Semantic Parsing is the task of transducing natural language utterances into formal meaning representations. The target meaning representations can be defined according to a wide variety of formalisms. This include linguistically-motivated semantic representations that are designed to capture the meaning of any sentence such as λ-calculus or the abstract meaning representations. Alternatively, for more task-driven approaches to Semantic Parsing, it is common for meaning representations to represent executable programs such as SQL queries, robotic commands, smart phone instructions, and even general-purpose programming languages like Python and Java.

Source: Tranx: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation

Papers

Showing 125 of 1202 papers

TitleStatusHype
Recognize Anything: A Strong Image Tagging ModelCode4
Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent TransformerCode3
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
Non-Autoregressive Semantic Parsing for Compositional Task-Oriented DialogCode3
ThingTalk: An Extensible, Executable Representation Language for Task-Oriented DialoguesCode2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerCode2
OpenICL: An Open-Source Framework for In-context LearningCode2
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL TaskCode2
Binding Language Models in Symbolic LanguagesCode2
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table UnderstandingCode2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQLCode2
A Pilot Study for Chinese SQL Semantic ParsingCode2
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language ModelsCode2
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language ModelsCode2
Calibrated Interpretation: Confidence Estimation in Semantic ParsingCode1
CABINET: Content Relevance based Noise Reduction for Table Question AnsweringCode1
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLsCode1
Break It Down: A Question Understanding BenchmarkCode1
Bidirectional Attentive Memory Networks for Question Answering over Knowledge BasesCode1
Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information ExtractionCode1
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic ParsingCode1
AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training DataCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic ParsingCode1
A Pilot Study of Text-to-SQL Semantic Parsing for VietnameseCode1
Show:102550
← PrevPage 1 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTEMIS-DAAccuracy (Test)80.8Unverified
2SynTQA (Oracle)Test Accuracy77.5Unverified
3TabLaPAccuracy (Test)76.6Unverified
4SynTQA (GPT)Accuracy (Test)74.4Unverified
5Mix SCAccuracy (Test)73.6Unverified
6SynTQA (RF)Accuracy (Test)71.6Unverified
7CABINETAccuracy (Test)69.1Unverified
8NormTab+TabSQLifyAccuracy (Test)68.63Unverified
9Chain-of-TableAccuracy (Test)67.31Unverified
10Tab-PoTAccuracy (Test)66.78Unverified
#ModelMetricClaimedVerifiedStatus
1RESDSQL-3B + NatSQLAccuracy84.1Unverified
2code-davinci-002 175B (LEVER)Accuracy81.9Unverified
3RASAT+PICARDAccuracy75.5Unverified
4Graphix-3B + PICARDAccuracy74Unverified
5T5-3B + PICARDAccuracy71.9Unverified
6SADGA + GAPAccuracy70.1Unverified
7RATSQL + GAPAccuracy69.7Unverified
8RATSQL + Grammar-Augmented Pre-TrainingAccuracy69.6Unverified
9RATSQL + BERTAccuracy65.6Unverified
10Exact Set MatchingAccuracy19.7Unverified
#ModelMetricClaimedVerifiedStatus
1Dynamic Least-to-Most PromptingExact Match95Unverified
2LeARExact Match90.9Unverified
3T5-3B w/ Intermediate RepresentationsExact Match83.8Unverified
4Hierarchical Poset DecodingExact Match69Unverified
5Universal TransformerExact Match18.9Unverified
#ModelMetricClaimedVerifiedStatus
1ReaRevAccuracy76.4Unverified
2NSM+hAccuracy74.3Unverified
3CBR-KBQAAccuracy70Unverified
4STAGG (Yih et al., 2016)Accuracy63.9Unverified
5T5-11B (Raffel et al., 2020)Accuracy56.5Unverified
#ModelMetricClaimedVerifiedStatus
1CABINETDenotation accuracy (test)89.5Unverified
2TAPEX-Large (weak supervision)Denotation accuracy (test)89.5Unverified
3ReasTAP-Large (weak supervision)Denotation accuracy (test)89.2Unverified
4NL2SQL-BERTAccuracy89Unverified
5TAPAS-Large (weak supervision)Denotation accuracy (test)83.6Unverified
#ModelMetricClaimedVerifiedStatus
1PhraseTransformerAccuracy90.4Unverified
2TranxAccuracy86.2Unverified
3ASN (Rabinovich et al., 2017)Accuracy85.3Unverified
4ZH15 (Zhao and Huang, 2015)Accuracy84.2Unverified
#ModelMetricClaimedVerifiedStatus
1coarse2fineAccuracy88.2Unverified
2PhraseTransformerAccuracy87.9Unverified
3TranxAccuracy87.7Unverified
#ModelMetricClaimedVerifiedStatus
1PERIN + RobeCzechF192.36Unverified
2PERINF192.24Unverified
3HUJI-KUF158Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF180.52Unverified
2HUJI-KUF145Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF180.23Unverified
2HUJI-KUF152Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF194.16Unverified
2HUJI-KUF163Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF189.83Unverified
2HUJI-KUF162Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF192.73Unverified
2HUJI-KUF180Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF189.19Unverified
2HUJI-KUF154Unverified
#ModelMetricClaimedVerifiedStatus
1TAPEX-LargeDenotation Accuracy74.5Unverified
2TAPAS-LargeAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF176.4Unverified
2HUJI-KUF173Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF181.01Unverified
2HUJI-KUF175Unverified
#ModelMetricClaimedVerifiedStatus
1HSPEM66.18Unverified
#ModelMetricClaimedVerifiedStatus
1ReasonBERTRF1 Score41.3Unverified
#ModelMetricClaimedVerifiedStatus
1MeMCEExact40.3Unverified