| Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments | May 8, 2025 | BenchmarkingPrompt Engineering | CodeCode Available | 1 |
| Nature-Inspired Population-Based Evolution of Large Language Models | Mar 3, 2025 | GPUZero-shot Generalization | CodeCode Available | 1 |
| Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity | Mar 18, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 |
| Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas | Jun 1, 2020 | MinecraftMulti-Task Learning | CodeCode Available | 1 |
| Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV | Jul 20, 2023 | Depth EstimationDiversity | CodeCode Available | 1 |
| Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Mar 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| IRanker: Towards Ranking Foundation Model | Jun 25, 2025 | GSM8Kmodel | CodeCode Available | 1 |
| Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction | Oct 31, 2024 | Disaster ResponseLanguage Modeling | CodeCode Available | 1 |
| Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation | Dec 9, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |