| CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers | Apr 9, 2024 | Knowledge DistillationZero-shot Generalization | CodeCode Available | 1 | 5 |
| Label Agnostic Pre-training for Zero-shot Text Classification | May 25, 2023 | Classificationtext-classification | CodeCode Available | 1 | 5 |
| MetaMorph: Learning Universal Controllers with Transformers | Mar 22, 2022 | Zero-shot Generalization | CodeCode Available | 1 | 5 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 | 5 |
| Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments | May 8, 2025 | BenchmarkingPrompt Engineering | CodeCode Available | 1 | 5 |
| Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity | Mar 18, 2024 | Zero-shot Generalization | CodeCode Available | 1 | 5 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks | Oct 31, 2017 | Machine TranslationTranslation | CodeCode Available | 1 | 5 |
| Generalization to New Actions in Reinforcement Learning | Nov 3, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| μLO: Compute-Efficient Meta-Generalization of Learned Optimizers | May 31, 2024 | GPUZero-shot Generalization | CodeCode Available | 1 | 5 |