| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Reasoning with Language Model Prompting: A Survey | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 3 |
| Prompting Is Programming: A Query Language for Large Language Models | Dec 12, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| What Language Model to Train if You Have One Million GPU Hours? | Oct 27, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| Diffusion-LM Improves Controllable Text Generation | May 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Systematic Evaluation of Large Language Models of Code | Feb 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Datasheet for the Pile | Jan 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| 8-bit Optimizers via Block-wise Quantization | Oct 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 3 |