SOTAVerified

Semantic Parsing

Semantic Parsing is the task of transducing natural language utterances into formal meaning representations. The target meaning representations can be defined according to a wide variety of formalisms. This include linguistically-motivated semantic representations that are designed to capture the meaning of any sentence such as λ-calculus or the abstract meaning representations. Alternatively, for more task-driven approaches to Semantic Parsing, it is common for meaning representations to represent executable programs such as SQL queries, robotic commands, smart phone instructions, and even general-purpose programming languages like Python and Java.

Source: Tranx: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation

Papers

Showing 201250 of 1202 papers

TitleStatusHype
Evaluating Structural Generalization in Neural Machine TranslationCode0
Efficient Prompting for LLM-based Generative Internet of Things0
Multimodal Contextualized Semantic Parsing from SpeechCode0
Compositional Generalization with Grounded Language ModelsCode0
SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing0
CREPE: Coordinate-Aware End-to-End Document Parser0
Multi-hop Question Answering over Knowledge Graphs using Large Language Models0
Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual TransferCode0
Neural Semantic Parsing with Extremely Rich Symbolic Meaning RepresentationsCode0
Towards Compositionally Generalizable Semantic Parsing in Large Language Models: A Survey0
PMB5: Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks0
Self-Improvement Programming for Temporal Knowledge Graph Question Answering0
Ar-Spider: Text-to-SQL in Arabic0
Training Table Question Answering via SQL Query Decomposition0
Improving Generalization in Semantic Parsing by Increasing Natural Language VariationCode0
Neural Models for Source Code Synthesis and Completion0
LB-KBQA: Large-language-model and BERT based Knowledge-Based Question and Answering System0
Language-Guided World Models: A Model-Based Approach to AI Control0
How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering0
Evaluating Large Language Models in Semantic Parsing for Conversational Question Answering over Knowledge GraphsCode0
Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases0
kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning0
Semantic Parsing for Question Answering over Knowledge Graphs0
Leveraging Code to Improve In-context Learning for Semantic ParsingCode0
Predicting generalization performance with correctness discriminators0
Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence GenerationCode0
Robust NL-to-Cypher Translation for KBQA: Harnessing Large Language Model with Chain of Prompts0
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey0
SLOG: A Structural Generalization Benchmark for Semantic ParsingCode0
An In-Context Schema Understanding Method for Knowledge Base Question Answering0
Structural generalization in COGS: Supertagging is (almost) all you need0
Semantic Decomposition of Question and SQL for Text-to-SQL ParsingCode0
A Unified View of Evaluation Metrics for Structured PredictionCode0
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel InterpretationsCode0
Semantic Parsing by Large Language Models for Intricate Updating Strategies of Zero-Shot Dialogue State TrackingCode0
Parameterizing Context: Unleashing the Power of Parameter-Efficient Fine-Tuning and In-Context Tuning for Continual Table Semantic Parsing0
Testing the Limits of Unified Sequence to Sequence LLM Pretraining on Diverse Table Data Tasks0
PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis0
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models0
Towards End-User Development for IoT: A Case Study on Semantic Parsing of Cooking Recipes for Programming Kitchen DevicesCode0
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning0
Few-Shot Adaptation for Parsing Contextual Utterances with LLMsCode0
Augmenting text for spoken language understanding with Large Language Models0
Data Distribution Bottlenecks in Grounding Language Models to Knowledge BasesCode0
Semantic Parsing in Limited Resource Conditions0
HopPG: Self-Iterative Program Generation for Multi-Hop Question Answering over Heterogeneous Knowledge0
Adapt and Decompose: Efficient Generalization of Text-to-SQL via Domain Adapted Least-To-Most Prompting0
ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understandingCode0
Structural Transfer Learning in NL-to-Bash Semantic Parsers0
Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM ParadigmCode0
Show:102550
← PrevPage 5 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTEMIS-DAAccuracy (Test)80.8Unverified
2SynTQA (Oracle)Test Accuracy77.5Unverified
3TabLaPAccuracy (Test)76.6Unverified
4SynTQA (GPT)Accuracy (Test)74.4Unverified
5Mix SCAccuracy (Test)73.6Unverified
6SynTQA (RF)Accuracy (Test)71.6Unverified
7CABINETAccuracy (Test)69.1Unverified
8NormTab+TabSQLifyAccuracy (Test)68.63Unverified
9Chain-of-TableAccuracy (Test)67.31Unverified
10Tab-PoTAccuracy (Test)66.78Unverified
#ModelMetricClaimedVerifiedStatus
1RESDSQL-3B + NatSQLAccuracy84.1Unverified
2code-davinci-002 175B (LEVER)Accuracy81.9Unverified
3RASAT+PICARDAccuracy75.5Unverified
4Graphix-3B + PICARDAccuracy74Unverified
5T5-3B + PICARDAccuracy71.9Unverified
6SADGA + GAPAccuracy70.1Unverified
7RATSQL + GAPAccuracy69.7Unverified
8RATSQL + Grammar-Augmented Pre-TrainingAccuracy69.6Unverified
9RATSQL + BERTAccuracy65.6Unverified
10Exact Set MatchingAccuracy19.7Unverified
#ModelMetricClaimedVerifiedStatus
1Dynamic Least-to-Most PromptingExact Match95Unverified
2LeARExact Match90.9Unverified
3T5-3B w/ Intermediate RepresentationsExact Match83.8Unverified
4Hierarchical Poset DecodingExact Match69Unverified
5Universal TransformerExact Match18.9Unverified
#ModelMetricClaimedVerifiedStatus
1ReaRevAccuracy76.4Unverified
2NSM+hAccuracy74.3Unverified
3CBR-KBQAAccuracy70Unverified
4STAGG (Yih et al., 2016)Accuracy63.9Unverified
5T5-11B (Raffel et al., 2020)Accuracy56.5Unverified
#ModelMetricClaimedVerifiedStatus
1CABINETDenotation accuracy (test)89.5Unverified
2TAPEX-Large (weak supervision)Denotation accuracy (test)89.5Unverified
3ReasTAP-Large (weak supervision)Denotation accuracy (test)89.2Unverified
4NL2SQL-BERTAccuracy89Unverified
5TAPAS-Large (weak supervision)Denotation accuracy (test)83.6Unverified
#ModelMetricClaimedVerifiedStatus
1PhraseTransformerAccuracy90.4Unverified
2TranxAccuracy86.2Unverified
3ASN (Rabinovich et al., 2017)Accuracy85.3Unverified
4ZH15 (Zhao and Huang, 2015)Accuracy84.2Unverified
#ModelMetricClaimedVerifiedStatus
1coarse2fineAccuracy88.2Unverified
2PhraseTransformerAccuracy87.9Unverified
3TranxAccuracy87.7Unverified
#ModelMetricClaimedVerifiedStatus
1PERIN + RobeCzechF192.36Unverified
2PERINF192.24Unverified
3HUJI-KUF158Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF180.52Unverified
2HUJI-KUF145Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF180.23Unverified
2HUJI-KUF152Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF194.16Unverified
2HUJI-KUF163Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF189.83Unverified
2HUJI-KUF162Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF192.73Unverified
2HUJI-KUF180Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF189.19Unverified
2HUJI-KUF154Unverified
#ModelMetricClaimedVerifiedStatus
1TAPEX-LargeDenotation Accuracy74.5Unverified
2TAPAS-LargeAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF176.4Unverified
2HUJI-KUF173Unverified
#ModelMetricClaimedVerifiedStatus
1PERINF181.01Unverified
2HUJI-KUF175Unverified
#ModelMetricClaimedVerifiedStatus
1HSPEM66.18Unverified
#ModelMetricClaimedVerifiedStatus
1ReasonBERTRF1 Score41.3Unverified
#ModelMetricClaimedVerifiedStatus
1MeMCEExact40.3Unverified