SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 401450 of 1978 papers

TitleStatusHype
Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis0
MaLLaM -- Malaysia Large Language Model0
Large Language Model Adaptation for Financial Sentiment Analysis0
TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy0
Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling0
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI ModelsCode0
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and GenerationCode2
How Can Large Language Models Understand Spatial-Temporal Data?0
Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode0
Synergizing Machine Learning & Symbolic Methods: A Survey on Hybrid Approaches to Natural Language Processing0
Temporal Blind Spots in Large Language ModelsCode0
Learning Shortcuts: On the Misleading Promise of NLU in Language Models0
Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions0
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications0
Natural Language Processing for Dialects of a Language: A Survey0
We Need to Talk About Classification Evaluation Metrics in NLP0
Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling0
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks0
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation FusionCode0
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech RecognitionCode0
Resource-Efficient Transformer Pruning for Finetuning of Large ModelsCode1
PerSHOP -- A Persian dataset for shopping dialogue systems modeling0
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and ActionCode2
KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph0
Supervised Knowledge Makes Large Language Models Better In-context LearnersCode0
PersianLLaMA: Towards Building First Persian Large Language Model0
Structured Probabilistic CodingCode1
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization0
Imitation of Life: A Search Engine for Biologically Inspired DesignCode0
TESS: A Multi-intent Parser for Conversational Multi-Agent Systems with Decentralized Natural Language Understanding Models0
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models InferenceCode0
Bengali Intent Classification with Generative Adversarial BERTCode0
Continuous Prompt Generation from Linear Combination of Discrete Prompt EmbeddingsCode0
LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language0
RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding0
Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks0
PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and EnvironmentsCode0
BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual PolynomialsCode1
Rethinking the Instruction Quality: LIFT is What You Need0
IEKG: A Commonsense Knowledge Graph for Idiomatic ExpressionsCode0
Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?0
Retrieval-based Video Language Model for Efficient Long Video Question Answering0
A Study on the Calibration of In-context LearningCode0
Efficient Large Language Models: A SurveyCode3
Improving Bias Mitigation through Bias Experts in Natural Language UnderstandingCode0
Visually Grounded Language Learning: a review of language games, datasets, tasks, and models0
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly0
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in NorwegianCode0
From Beginner to Expert: Modeling Medical Knowledge into General LLMs0
Show:102550
← PrevPage 9 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2UDSSM-II (ensemble)Accuracy78.3Unverified
3BERT-large 340MAccuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified