SOTAVerified

Masked Language Modeling

Papers

Showing 125 of 475 papers

TitleStatusHype
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
Simple and Effective Masked Diffusion Language ModelsCode4
Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected LossCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language UnderstandingCode3
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation ModelsCode2
GPT or BERT: why not both?Code2
MosaicBERT: A Bidirectional Encoder Optimized for Fast PretrainingCode2
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
Retrieval Oriented Masking Pre-training Language Model for Dense Passage RetrievalCode2
Deep Bidirectional Language-Knowledge Graph PretrainingCode2
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
LinkBERT: Pretraining Language Models with Document LinksCode2
MPNet: Masked and Permuted Pre-training for Language UnderstandingCode2
Self-Supervised Log ParsingCode2
Diffusion Sequence Models for Enhanced Protein Representation and GenerationCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection LayersCode1
DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and ClassificationCode1
AutoScale: Scale-Aware Data Mixing for Pre-Training LLMsCode1
Retrieval-style In-Context Learning for Few-shot Hierarchical Text ClassificationCode1
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded TextCode1
Show:102550
← PrevPage 1 of 19Next →

No leaderboard results yet.