SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 18011850 of 1978 papers

TitleStatusHype
Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive StudyCode0
MDDial: A Multi-turn Differential Diagnosis Dialogue Dataset with Reliability EvaluationCode0
Rethinking embedding coupling in pre-trained language modelsCode0
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
TabFact: A Large-scale Dataset for Table-based Fact VerificationCode0
CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language UnderstandingCode0
Why Build an Assistant in Minecraft?Code0
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented DataCode0
Executing Natural Language-Described Algorithms with Large Language Models: An InvestigationCode0
TableFormer: Robust Transformer Modeling for Table-Text EncodingCode0
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational AutoencodersCode0
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document MatchingCode0
Revisit Few-shot Intent Classification with PLMs: Direct Fine-tuning vs. Continual Pre-trainingCode0
Event Linking: Grounding Event Mentions to WikipediaCode0
Memory TransformerCode0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
MERGE: Fast Private Text GenerationCode0
Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT EmbeddingsCode0
CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic ModelingCode0
Evaluating N-best Calibration of Natural Language Understanding for Dialogue SystemsCode0
Meta-Learning for Natural Language Understanding under Continual Learning FrameworkCode0
AlexU-AL at SemEval-2022 Task 6: Detecting Sarcasm in Arabic Text Using Deep Learning TechniquesCode0
Evaluating Natural Language Understanding Services for Conversational Question Answering SystemsCode0
tagE: Enabling an Embodied Agent to Understand Human InstructionsCode0
Revisiting Sample Size Determination in Natural Language UnderstandingCode0
TinyBERT: Distilling BERT for Natural Language UnderstandingCode0
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLUCode0
Evaluating Gender Bias in Natural Language InferenceCode0
Mind the GAP: A Balanced Corpus of Gendered Ambiguous PronounsCode0
Evaluating Coreference Resolvers on Community-based Question Answering: From Rule-based to State of the ArtCode0
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and GenerationCode0
EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in Natural Language InferenceCode0
WikiReading: A Novel Large-scale Language Understanding Task over WikipediaCode0
Mitigating Biases in Toxic Language Detection through Invariant RationalizationCode0
TAPIR: Learning Adaptive Revision for Incremental Natural Language Understanding with a Two-Pass ModelCode0
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural AnnotatorsCode0
WikiCREM: A Large Unsupervised Corpus for Coreference ResolutionCode0
Reweighting Augmented Samples by Minimizing the Maximal Expected LossCode0
m-Networks: Adapting the Triplet Networks for Acronym DisambiguationCode0
RGL: A Simple yet Effective Relation Graph Augmented Prompt-based Tuning Approach for Few-Shot LearningCode0
Accelerated Large Batch Optimization of BERT Pretraining in 54 minutesCode0
Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust ModelsCode0
A Multi-level Neural Network for Implicit Causality Detection in Web TextsCode0
Counterfactual Detection meets Transfer LearningCode0
Modeling Variations of First-Order Horn Abduction in Answer Set ProgrammingCode0
User-in-the-loop Adaptive Intent Detection for Instructable Digital AssistantCode0
A Probabilistic Generative Grammar for Semantic ParsingCode0
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech RecognitionCode0
Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language UnderstandingCode0
Coreference Reasoning in Machine Reading ComprehensionCode0
Show:102550
← PrevPage 37 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2BERT-large 340MAccuracy78.3Unverified
3UDSSM-II (ensemble)Accuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified