SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 18511875 of 1978 papers

TitleStatusHype
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference CorpusCode0
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank AdaptationCode0
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language TasksCode0
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised RepresentationsCode0
MOSS: End-to-End Dialog System Framework with Modular SupervisionCode0
BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task LearningCode0
mPMR: A Multilingual Pre-trained Machine Reader at ScaleCode0
MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension BenchmarkCode0
Robust Natural Language Understanding with Residual Attention DebiasingCode0
Enhancing Out-of-Distribution Detection in Natural Language Understanding via Implicit Layer EnsembleCode0
Conversational Disease Diagnosis via External Planner-Controlled Large Language ModelsCode0
Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual TemplatesCode0
Bengali Intent Classification with Generative Adversarial BERTCode0
Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot FillingCode0
BasqueGLUE: A Natural Language Understanding Benchmark for BasqueCode0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
Continuous Prompt Generation from Linear Combination of Discrete Prompt EmbeddingsCode0
End-to-End Knowledge-Routed Relational Dialogue System for Automatic DiagnosisCode0
ROSA: Random Subspace Adaptation for Efficient Fine-TuningCode0
TD-Suite: All Batteries Included Framework for Technical Debt ClassificationCode0
RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning for Language UnderstandingCode0
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric InstrumentsCode0
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language ModelsCode0
Continuous Entailment Patterns for Lexical Inference in ContextCode0
Transfer Fine-Tuning: A BERT Case StudyCode0
Show:102550
← PrevPage 75 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2UDSSM-II (ensemble)Accuracy78.3Unverified
3BERT-large 340MAccuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified