SOTAVerified

Language Modeling

Papers

Showing 20212030 of 14182 papers

TitleStatusHype
GateLoop: Fully Data-Controlled Linear Recurrence for Sequence ModelingCode1
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary InitializationCode1
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuningCode1
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
Atla Selene Mini: A General Purpose Evaluation ModelCode1
Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS ScoringCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
Incorporating External POS Tagger for Punctuation RestorationCode1
Show:102550
← PrevPage 203 of 1419Next →

No leaderboard results yet.