SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 401425 of 1978 papers

TitleStatusHype
Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis0
Large Language Model Adaptation for Financial Sentiment Analysis0
MaLLaM -- Malaysia Large Language Model0
TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy0
Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling0
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI ModelsCode0
How Can Large Language Models Understand Spatial-Temporal Data?0
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and GenerationCode2
Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode0
Temporal Blind Spots in Large Language ModelsCode0
Synergizing Machine Learning & Symbolic Methods: A Survey on Hybrid Approaches to Natural Language Processing0
Learning Shortcuts: On the Misleading Promise of NLU in Language Models0
Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions0
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications0
Natural Language Processing for Dialects of a Language: A Survey0
We Need to Talk About Classification Evaluation Metrics in NLP0
Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling0
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks0
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation FusionCode0
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech RecognitionCode0
Resource-Efficient Transformer Pruning for Finetuning of Large ModelsCode1
PerSHOP -- A Persian dataset for shopping dialogue systems modeling0
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and ActionCode2
KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph0
Show:102550
← PrevPage 17 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2BERT-large 340MAccuracy78.3Unverified
3UDSSM-II (ensemble)Accuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified