SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 18511900 of 1978 papers

TitleStatusHype
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference CorpusCode0
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank AdaptationCode0
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language TasksCode0
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised RepresentationsCode0
MOSS: End-to-End Dialog System Framework with Modular SupervisionCode0
BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task LearningCode0
mPMR: A Multilingual Pre-trained Machine Reader at ScaleCode0
MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension BenchmarkCode0
Robust Natural Language Understanding with Residual Attention DebiasingCode0
Enhancing Out-of-Distribution Detection in Natural Language Understanding via Implicit Layer EnsembleCode0
Conversational Disease Diagnosis via External Planner-Controlled Large Language ModelsCode0
Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual TemplatesCode0
Bengali Intent Classification with Generative Adversarial BERTCode0
Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot FillingCode0
BasqueGLUE: A Natural Language Understanding Benchmark for BasqueCode0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
Continuous Prompt Generation from Linear Combination of Discrete Prompt EmbeddingsCode0
End-to-End Knowledge-Routed Relational Dialogue System for Automatic DiagnosisCode0
ROSA: Random Subspace Adaptation for Efficient Fine-TuningCode0
TD-Suite: All Batteries Included Framework for Technical Debt ClassificationCode0
RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning for Language UnderstandingCode0
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric InstrumentsCode0
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language ModelsCode0
Continuous Entailment Patterns for Lexical Inference in ContextCode0
Transfer Fine-Tuning: A BERT Case StudyCode0
Transfer of Structural Knowledge from Synthetic LanguagesCode0
Continual Dialogue State Tracking via Example-Guided Question AnsweringCode0
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?Code0
Sarcasm Detection in a Disaster ContextCode0
Temporal Blind Spots in Large Language ModelsCode0
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt TuningCode0
Multi-sense embeddings through a word sense disambiguation processCode0
BAM! Born-Again Multi-Task Networks for Natural Language UnderstandingCode0
Multi-Task Deep Neural Networks for Natural Language UnderstandingCode0
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence EmbeddingCode0
UniT: Multimodal Multitask Learning with a Unified TransformerCode0
Contextual Dialogue Act Classification for Open-Domain Conversational AgentsCode0
An Investigation of the (In)effectiveness of Counterfactually Augmented DataCode0
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative ExamplesCode0
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMsCode0
Constructing a Natural Language Inference Dataset using Generative Neural NetworksCode0
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerCode0
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models InferenceCode0
My Teacher Thinks The World Is Flat! Interpreting Automatic Essay Scoring MechanismCode0
Named Entity Recognition in the Romanian Legal DomainCode0
Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition ErrorsCode0
NarraSum: A Large-Scale Dataset for Abstractive Narrative SummarizationCode0
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with SubwordsCode0
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-TrainingCode0
Encoder-Agnostic Adaptation for Conditional Language GenerationCode0
Show:102550
← PrevPage 38 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2UDSSM-II (ensemble)Accuracy78.3Unverified
3BERT-large 340MAccuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified