SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 801850 of 1978 papers

TitleStatusHype
Simple and Effective Gradient-Based Tuning of Sequence-to-Sequence Models0
Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot FillingCode0
Entity Aware Syntax Tree Based Data Augmentation for Natural Language Understanding0
From Black Boxes to Conversations: Incorporating XAI in a Conversational AgentCode1
Semantically Meaningful Metrics for Norwegian ASR SystemsCode0
FOLIO: Natural Language Reasoning with First-Order LogicCode1
Evaluating N-best Calibration of Natural Language Understanding for Dialogue SystemsCode0
Dialog Acts for Task Driven Embodied Agents0
Distilling Multi-Scale Knowledge for Event Temporal Relation Extraction0
Enhancing Semantic Understanding with Self-supervised Methods for Abstractive Dialogue Summarization0
Unified Knowledge Prompt Pre-training for Customer Service Dialogues0
Exploring and Evaluating Personalized Models for Code Generation0
Building the Intent Landscape of Real-World Conversational Corpora with Extractive Question-Answering Transformers0
Shortcut Learning of Large Language Models in Natural Language Understanding0
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations0
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding0
Adapting Task-Oriented Dialogue Models for Email Conversations0
Effective Transfer Learning for Low-Resource Natural Language Understanding0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc0
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset0
A Hybrid Model of Classification and Generation for Spatial Relation Extraction0
Efficient Long-Text Understanding with Short-Text ModelsCode1
DoRO: Disambiguation of referred object for embodied agents0
Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning0
MoEC: Mixture of Expert Clusters0
Analyzing Bagging Methods for Language Models0
On the cross-lingual transferability of multilingual prototypical models across NLU tasks0
PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic SearchCode0
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting0
Forming Trees with Treeformers0
Learning to translate by learning to communicateCode0
PLM-ICD: Automatic ICD Coding with Pretrained Language ModelsCode1
Chat-to-Design: AI Assisted Personalized Fashion Design0
Lightweight Transformers for Conversational AI0
Efficient Semi-supervised Consistency Training for Natural Language Understanding0
m-Networks: Adapting the Triplet Networks for Acronym DisambiguationCode0
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning0
A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation0
Strategies to Improve Few-shot Learning for Intent Classification and Slot-Filling0
On Curriculum Learning for Commonsense ReasoningCode0
RGL: A Simple yet Effective Relation Graph Augmented Prompt-based Tuning Approach for Few-Shot Learning0
AlexU-AL at SemEval-2022 Task 6: Detecting Sarcasm in Arabic Text Using Deep Learning TechniquesCode0
NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity DetectionCode0
Enhancing Self-Attention with Knowledge-Assisted Attention Maps0
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions0
ID10M: Idiom Identification in 10 LanguagesCode0
Efficient Learning of Multiple NLP Tasks via Collective Weight Factorization on BERT0
Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking0
Is neural language acquisition similar to natural? A chronological probing studyCode0
Show:102550
← PrevPage 17 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2UDSSM-II (ensemble)Accuracy78.3Unverified
3BERT-large 340MAccuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified