| Latent State Models of Training Dynamics | Aug 18, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models | Aug 15, 2023 | DecoderIn-Context Learning | CodeCode Available | 0 |
| Pairing interacting protein sequences using masked language modeling | Aug 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning | Jul 29, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 |
| S2SNet: A Pretrained Neural Network for Superconductivity Discovery | Jun 28, 2023 | Electrical EngineeringLanguage Modeling | CodeCode Available | 0 |
| Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling | Jun 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 |
| Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation | Jun 15, 2023 | Automatic Speech RecognitionClustering | CodeCode Available | 1 |
| Generate to Understand for Representation | Jun 14, 2023 | Contrastive LearningGPU | CodeCode Available | 1 |
| Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models | Jun 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Global and Local Semantic Completion Learning for Vision-Language Pre-training | Jun 12, 2023 | cross-modal alignmentImage-text Retrieval | CodeCode Available | 1 |
| Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Jun 7, 2023 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Difference of BERT-style and CLIP-style Text Encoders | Jun 6, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Fair multilingual vandalism detection system for Wikipedia | Jun 2, 2023 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression | Jun 1, 2023 | Contrastive LearningData Augmentation | —Unverified | 0 |