| UDApter -- Efficient Domain Adaptation Using Adapters | Feb 7, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Representation Deficiency in Masked Language Modeling | Feb 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GLADIS: A General and Large Acronym Disambiguation Benchmark | Feb 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 |
| SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prompt-Based Editing for Text Style Transfer | Jan 27, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 |