SOTAVerified

Language Modeling

Papers

Showing 401425 of 14182 papers

TitleStatusHype
Reasoning with Language Model Prompting: A SurveyCode3
Discovering Language Model Behaviors with Model-Written EvaluationsCode3
Prompting Is Programming: A Query Language for Large Language ModelsCode3
Human-level play in the game of Diplomacy by combining language models with strategic reasoningCode3
What Language Model to Train if You Have One Million GPU Hours?Code3
Diffusion-LM Improves Controllable Text GenerationCode3
A Systematic Evaluation of Large Language Models of CodeCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
Datasheet for the PileCode3
8-bit Optimizers via Block-wise QuantizationCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
Evaluating Large Language Models Trained on CodeCode3
Multi-objective Asynchronous Successive HalvingCode3
GLM: General Language Model Pretraining with Autoregressive Blank InfillingCode3
Prefix-Tuning: Optimizing Continuous Prompts for GenerationCode3
PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning MethodsCode3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language UnderstandingCode3
Language Models are Few-Shot LearnersCode3
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
Revisiting Pre-Trained Models for Chinese Natural Language ProcessingCode3
Longformer: The Long-Document TransformerCode3
Semi-Supervised Speech Recognition via Local Prior MatchingCode3
Universal Language Model Fine-tuning for Text ClassificationCode3
Order Matters: Sequence to sequence for setsCode3
Show:102550
← PrevPage 17 of 568Next →

No leaderboard results yet.