SOTAVerified

Language Modeling

Papers

Showing 10261050 of 14182 papers

TitleStatusHype
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and TasksCode2
Deduplicating Training Data Makes Language Models BetterCode2
FastMoE: A Fast Mixture-of-Expert Training SystemCode2
GPT Understands, TooCode2
When Attention Meets Fast Recurrence: Training Language Models with Reduced ComputeCode2
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNetCode2
The Pile: An 800GB Dataset of Diverse Text for Language ModelingCode2
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text ClassificationCode2
AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed GradientsCode2
Mirostat: A Neural Text Decoding Algorithm that Directly Controls PerplexityCode2
Simplifying Paragraph-level Question Generation via Transformer Language ModelsCode2
MPNet: Masked and Permuted Pre-training for Language UnderstandingCode2
BAE: BERT-based Adversarial Examples for Text ClassificationCode2
Self-Supervised Log ParsingCode2
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language ModelCode2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ParallelismCode2
MASS: Masked Sequence to Sequence Pre-training for Language GenerationCode2
Knowledge Representation Learning: A Quantitative ReviewCode2
Training RNNs as Fast as CNNsCode2
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts LayerCode2
End-To-End Memory NetworksCode2
InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofingCode1
Describe Anything Model for Visual Question Answering on Text-rich ImagesCode1
Evaluating Morphological Alignment of Tokenizers in 70 LanguagesCode1
Differential MambaCode1
Show:102550
← PrevPage 42 of 568Next →

No leaderboard results yet.