SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 751800 of 1978 papers

TitleStatusHype
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation FusionCode0
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech RecognitionCode0
PerSHOP -- A Persian dataset for shopping dialogue systems modeling0
Supervised Knowledge Makes Large Language Models Better In-context LearnersCode0
KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph0
PersianLLaMA: Towards Building First Persian Large Language Model0
Imitation of Life: A Search Engine for Biologically Inspired DesignCode0
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization0
TESS: A Multi-intent Parser for Conversational Multi-Agent Systems with Decentralized Natural Language Understanding Models0
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models InferenceCode0
Bengali Intent Classification with Generative Adversarial BERTCode0
Continuous Prompt Generation from Linear Combination of Discrete Prompt EmbeddingsCode0
LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language0
RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding0
Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks0
PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and EnvironmentsCode0
Rethinking the Instruction Quality: LIFT is What You Need0
Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?0
IEKG: A Commonsense Knowledge Graph for Idiomatic ExpressionsCode0
Retrieval-based Video Language Model for Efficient Long Video Question Answering0
A Study on the Calibration of In-context LearningCode0
Improving Bias Mitigation through Bias Experts in Natural Language UnderstandingCode0
Visually Grounded Language Learning: a review of language games, datasets, tasks, and models0
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly0
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in NorwegianCode0
From Beginner to Expert: Modeling Medical Knowledge into General LLMs0
Self Generated Wargame AI: Double Layer Agent Task Planning Based on Large Language Model0
Summarization-based Data Augmentation for Document ClassificationCode0
Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability0
Explore the Potential of LLMs in Misinformation Detection: An Empirical Study0
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning0
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLUCode0
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric InstrumentsCode0
Effective Large Language Model Adaptation for Improved Grounding and Citation Generation0
On the Calibration of Multilingual Question Answering LLMs0
Fusion-Eval: Integrating Assistant Evaluators with LLMs0
Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning0
Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding0
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining0
RankAug: Augmented data ranking for text classification0
Relation Extraction Model Based on Semantic Enhancement Mechanism0
A Systematic Review of Deep Graph Neural Networks: Challenges, Classification, Architectures, Applications & Potential Utility in Bioinformatics0
MARRS: Multimodal Reference Resolution System0
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models0
Dense Retrieval as Indirect Supervision for Large-space Decision MakingCode0
TLM: Token-Level Masking for TransformersCode0
Large-scale Foundation Models and Generative AI for BigData Neuroscience0
Evaluation of large language models using an Indian language LGBTI+ lexicon0
Meaning and understanding in large language models0
Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition ErrorsCode0
Show:102550
← PrevPage 16 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2BERT-large 340MAccuracy78.3Unverified
3UDSSM-II (ensemble)Accuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified