SOTAVerified

Language Modeling

Papers

Showing 1050110550 of 14182 papers

TitleStatusHype
DEEPAGÉ: Answering Questions in Portuguese about the Brazilian EnvironmentCode0
Automatic Learning of Subword Dependent Model Scales0
NormFormer: Improved Transformer Pretraining with Extra Normalization0
Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic OptimizationCode1
Reminding the Incremental Language Model via Data-Free Self-Distillation0
GNN-LM: Language Modeling based on Global Contexts via GNNCode1
A Novel Metric for Evaluating Semantics PreservationCode0
Echo-Attention: Attend Once and Get N Attentions for Free0
DEMix Layers: Disentangling Domains for Modular Language Modeling0
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models0
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens0
N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking0
xGQA: Cross-Lingual Visual Question Answering0
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation0
Prix-LM: Pretraining for Multilingual Knowledge Base ConstructionCode0
Sharpness-Aware Minimization Improves Language Model Generalization0
Multilingual unsupervised sequence segmentation transfers to extremely low-resource languagesCode0
Improving Transformers with Probabilistic Attention KeysCode1
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language ModelsCode1
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model CompressionCode0
ASR4REAL: An extended benchmark for speech models0
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language ModelsCode1
Invariant Language ModelingCode1
Hydra: A System for Large Multi-Model Deep LearningCode1
Leveraging Knowledge in Multilingual Commonsense Reasoning0
A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification0
DS-TOD: Efficient Domain Specialization for Task Oriented DialogCode0
Generated Knowledge Prompting for Commonsense ReasoningCode1
Coherence boosting: When your pretrained language model is not paying enough attentionCode1
Control Prefixes for Parameter-Efficient Text GenerationCode1
Kronecker Decomposition for GPT Compression0
mLUKE: The Power of Entity Representations in Multilingual Pretrained Language ModelsCode1
The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of ColorCode0
Meta-learning via Language Model In-context TuningCode1
Tracing Origins: Coreference-aware Machine Reading ComprehensionCode1
Sparks: Inspiration for Science Writing using Language Models0
MIMICause: Representation and automatic extraction of causal relation types from clinical notes0
Spoken ObjectNet: A Bias-Controlled Spoken Caption DatasetCode0
Symbolic Knowledge Distillation: from General Language Models to Commonsense ModelsCode1
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and TasksCode2
UniPELT: A Unified Framework for Parameter-Efficient Language Model TuningCode1
Composable Sparse Fine-Tuning for Cross-Lingual TransferCode1
bert2BERT: Towards Reusable Pretrained Language Models0
Dict-BERT: Enhancing Language Model Pre-training with DictionaryCode0
On Language Model Integration for RNN Transducer based Speech Recognition0
Maximizing Efficiency of Language Model Pre-training for Learning Representation0
Deep Learning for Bias Detection: From Inception to Deployment0
Multi-Modal Pre-Training for Automated Speech Recognition0
Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning0
Time Masking for Temporal Language ModelsCode1
Show:102550
← PrevPage 211 of 284Next →

No leaderboard results yet.