SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 801850 of 1978 papers

TitleStatusHype
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract DescriptionsCode0
Identifying Distributional Perspective Differences from Colingual GroupsCode0
Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case StudyCode0
skLEP: A Slovak General Language Understanding BenchmarkCode0
Detecting Emotion Carriers by Combining Acoustic and Lexical Representations0
Designing the Next Generation of Intelligent Personal Robotic Assistants for the Physically Impaired0
Designing Templates for Eliciting Commonsense Knowledge from Pretrained Sequence-to-Sequence Models0
Design considerations for a hierarchical semantic compositional framework for medical natural language understanding0
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering0
DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents0
BaSCo: An Annotated Basque-Spanish Code-Switching Corpus for Natural Language Understanding0
An Interdisciplinary Review of Commonsense Reasoning and Intent Detection0
Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification0
Local Structure Matters Most: Perturbation Study in NLU0
Demonstrations of Integrity Attacks in Multi-Agent Systems0
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla0
An Improved Neural Baseline for Temporal Relation Extraction0
Deliberation Model for On-Device Spoken Language Understanding0
Delexicalized Paraphrase Generation0
DEEPYANG at SemEval-2020 Task 4: Using the Hidden Layer State of BERT Model for Differentiating Common Sense0
A New Sentence Ordering Method Using BERT Pretrained Model0
A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering0
Deep Natural Language Understanding of News Text0
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications0
Deeply Embedded Knowledge Representation & Reasoning For Natural Language Question Answering: A Practitioner’s Perspective0
Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE0
Deep learning systems as complex networks0
Bag of Experts Architectures for Model Reuse in Conversational Language Understanding0
Learning to Embed Categorical Features without Embedding Tables for Recommendation0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc0
Bactrainus: Optimizing Large Language Models for Multi-hop Complex Question Answering Tasks0
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning0
Deepening Hidden Representations from Pre-trained Language Models0
ESIE-BERT: Enriching Sub-words Information Explicitly with BERT for Joint Intent Classification and SlotFilling0
A Neural Entity Coreference Resolution Review0
A Comparative Study on Collecting High-Quality Implicit Reasonings at a Large-scale0
AAVENUE: Detecting LLM Biases on NLU Tasks in AAVE via a Novel Benchmark0
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models0
A Weakly-Supervised Attention-based Visualization Tool for Assessing Political Affiliation0
An Enactivist account of Mind Reading in Natural Language Understanding0
DAWSON: Data Augmentation using Weak Supervision On Natural Language0
Data Generation Using Large Language Models for Text Classification: An Empirical Case Study0
Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize0
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System0
Data Augmentation for Training Dialog Models Robust to Speech Recognition Errors0
Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue0
AutoNLU: Detecting, root-causing, and fixing NLU model errors0
Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue0
Data Annealing for Informal Language Understanding Tasks0
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining0
Show:102550
← PrevPage 17 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2BERT-large 340MAccuracy78.3Unverified
3UDSSM-II (ensemble)Accuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified