SOTAVerified

Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Showing 16011650 of 1978 papers

TitleStatusHype
Fast and Scalable Expansion of Natural Language Understanding Functionality for Intelligent Agents0
FedEAT: A Robustness Optimization Framework for Federated LLMs0
Federated Learning for Emoji Prediction in a Mobile Keyboard0
Federated Self-Learning with Weak Supervision for Speech Recognition0
FeelsGoodMan: Inferring Semantics of Twitch Neologisms0
FeelsGoodMan: Inferring Semantics of Twitch Neologisms0
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding0
Few-shot Intent Classification and Slot Filling with Retrieved Examples0
Few-shot Multimodal Multitask Multilingual Learning0
Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF0
Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?0
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers0
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers0
Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning0
Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking0
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks0
First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI0
FitChat: Conversational Artificial Intelligence Interventions for Encouraging Physical Activity in Older Adults0
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning0
FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding0
Fooling Vision and Language Models Despite Localization and Attention Mechanism0
Forming Trees with Treeformers0
FrameNet-like Annotation of Olfactory Information in Texts0
From Audio to Semantics: Approaches to end-to-end spoken language understanding0
From Beginner to Expert: Modeling Medical Knowledge into General LLMs0
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding0
From Spatial Relations to Spatial Configurations0
From text to talk: Harnessing conversational corpora for humane and diversity-aware language technology0
From Universal Language Model to Downstream Task: Improving RoBERTa-Based Vietnamese Hate Speech Detection0
From Virtual to Real: A Framework for Verbal Interaction with Robots0
Fully Unsupervised Crosslingual Semantic Textual Similarity Metric Based on BERT for Identifying Parallel Data0
Fuse and Adapt: Investigating the Use of Pre-Trained Self-Supervising Learning Models in Limited Data NLU problems0
Fusion-Eval: Integrating Assistant Evaluators with LLMs0
Generalized Multiple Intent Conditioned Slot Filling0
Generating Synthetic Data for Task-Oriented Semantic Parsing with Hierarchical Representations0
Generation-Distillation for Efficient Natural Language Understanding in Low-Data Settings0
Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU0
GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark0
GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language Understanding0
Get Your Model Puzzled: Introducing Crossword-Solving as a New NLP Benchmark0
GF + MMT = GLF -- From Language to Semantics through LF0
GLM: General Language Model Pretraining with Autoregressive Blank Infilling0
Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning0
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference0
GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using Large Language Models0
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP0
GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching0
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks0
Graph-Based Semi-Supervised Learning for Natural Language Understanding0
Graph Enhanced Cross-Domain Text-to-SQL Generation0
Show:102550
← PrevPage 33 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HNNAccuracy90Unverified
2UDSSM-II (ensemble)Accuracy78.3Unverified
3BERT-large 340MAccuracy78.3Unverified
4UDSSM-I (ensemble)Accuracy76.7Unverified
5DSSMAccuracy75Unverified
6UDSSM-IIAccuracy75Unverified
7BERT-base 110M + MASAccuracy68.3Unverified
8USSM + Supervised Deepnet + 3 Knowledge BasesAccuracy66.7Unverified
9Word-level CNN+LSTM (full scoring)Accuracy60Unverified
10Subword-level Transformer LMAccuracy58.3Unverified
#ModelMetricClaimedVerifiedStatus
1BERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
2BERT (none)Tags (Full) Acc82Unverified
3BERT (gold POS/lemmas)Tags (Full) Acc81Unverified
4GloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
5RoBERTa + LinearFull F1 (Preps)78.2Unverified
6GloVe (none)Tags (Full) Acc77.5Unverified
7GloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified
8SVM (feature-rich, gold syntax)Role F1 (Preps)62.2Unverified
9BiLSTM + MLP (gold syntax)Role F1 (Preps)62.2Unverified
10SVM (feature-rich, auto syntax)Role F1 (Preps)58.2Unverified
#ModelMetricClaimedVerifiedStatus
1CaseLaw-BERTCaseHOLD75.6Unverified
2Legal-BERTCaseHOLD75.1Unverified
3DeBERTaCaseHOLD72.1Unverified
4LongformerCaseHOLD72Unverified
5RoBERTaCaseHOLD71.7Unverified
6BERTCaseHOLD70.7Unverified
7BigBirdCaseHOLD70.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT-DGAverage74.6Unverified
2ConvBERT-DG + Pre + MultiAverage73.8Unverified
3mslmAverage73.49Unverified
4ConvBERT + Pre + MultiAverage68.22Unverified
5BanLanGenAverage39.16Unverified
#ModelMetricClaimedVerifiedStatus
1ConvBERT + Pre + MultiAverage86.89Unverified
2mslmAverage85.83Unverified
3ConvBERT-DG + Pre + MultiAverage85.34Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAverage89.9Unverified
2BERT-LARGEAverage82.1Unverified