SOTAVerified

Language Modeling

Papers

Showing 24012450 of 14182 papers

TitleStatusHype
AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-TuningCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
Understanding the Failure of Batch Normalization for Transformers in NLPCode1
Mixture of Attention Heads: Selecting Attention Heads Per TokenCode1
A Kernel-Based View of Language Model Fine-TuningCode1
Cross-Align: Modeling Deep Cross-lingual Interactions for Word AlignmentCode1
Controllable Dialogue Simulation with In-Context LearningCode1
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal ModelingCode1
InfoCSE: Information-aggregated Contrastive Learning of Sentence EmbeddingsCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
Bayesian Prompt Learning for Image-Language Model GeneralizationCode1
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
Less is More: Task-aware Layer-wise Distillation for Language Model CompressionCode1
Knowledge Unlearning for Mitigating Privacy Risks in Language ModelsCode1
Towards Improving Faithfulness in Abstractive SummarizationCode1
The Surprising Computational Power of Nondeterministic Stack RNNsCode1
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language ModelCode1
ContraCLM: Contrastive Learning For Causal Language ModelCode1
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple TasksCode1
Event Causality Identification via Derivative Prompt Joint LearningCode1
BECEL: Benchmark for Consistency Evaluation of Language ModelsCode1
DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like DocumentsCode1
polyBERT: A chemical language model to enable fully machine-driven ultrafast polymer informaticsCode1
A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language ProcessingCode1
Automatic Label Sequence Generation for Prompting Sequence-to-sequence ModelsCode1
Probabilistic Generative Transformer Language models for Generative Design of MoleculesCode1
The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained DecodingCode1
Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation ApproachCode1
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at TwitterCode1
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LMCode1
ASR2K: Speech Recognition for Around 2000 Languages without AudioCode1
TransPolymer: a Transformer-based language model for polymer property predictionsCode1
FOLIO: Natural Language Reasoning with First-Order LogicCode1
LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale RetrievalCode1
Learning from Unlabeled 3D Environments for Vision-and-Language NavigationCode1
Interpreting Song Lyrics with an Audio-Informed Pre-trained Language ModelCode1
Prompting as Probing: Using Language Models for Knowledge Base ConstructionCode1
Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject StudiesCode1
CoditT5: Pretraining for Source Code and Natural Language EditingCode1
Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree SearchCode1
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-trainingCode1
Composable Text Controls in Latent Space with ODEsCode1
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage RetrievalCode1
Contextual Information and Commonsense Based Prompt for Emotion Recognition in ConversationCode1
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
Zero-Shot Video Captioning with Evolving Pseudo-TokensCode1
Unsupervised pre-training of graph transformers on patient population graphsCode1
Label2Label: A Language Modeling Framework for Multi-Attribute LearningCode1
Show:102550
← PrevPage 49 of 284Next →

No leaderboard results yet.