| Federated Learning for ASR based on Wav2vec 2.0 | Feb 20, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Guiding Pretraining in Reinforcement Learning with Large Language Models | Feb 13, 2023 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 |
| The Wisdom of Hindsight Makes Language Models Better Instruction Followers | Feb 10, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 |
| UDApter -- Efficient Domain Adaptation Using Adapters | Feb 7, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Representation Deficiency in Masked Language Modeling | Feb 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GLADIS: A General and Large Acronym Disambiguation Benchmark | Feb 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 |
| SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 |
| Prompt-Based Editing for Text Style Transfer | Jan 27, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Domain-Agnostic Molecular Generation with Chemical Feedback | Jan 26, 2023 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 |
| ViDeBERTa: A powerful pre-trained language model for Vietnamese | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lexi: Self-Supervised Learning of the UI Language | Jan 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints | Jan 22, 2023 | DecoderDenoising | CodeCode Available | 1 |
| An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Batch Prompting: Efficient Inference with Large Language Model APIs | Jan 19, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 1 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |