| Improving Zero-shot Generalization and Robustness of Multi-modal Models | Dec 4, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Improving Zero-Shot Generalization for CLIP with Synthesized Prompts | Jul 14, 2023 | Generalized Zero-Shot LearningTransfer Learning | CodeCode Available | 1 | 5 |
| PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation | Apr 3, 2025 | ObjectPose Estimation | CodeCode Available | 1 | 5 |
| A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation | Jun 29, 2024 | Combinatorial Optimizationreinforcement-learning | CodeCode Available | 1 | 5 |
| Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas | Jun 1, 2020 | MinecraftMulti-Task Learning | CodeCode Available | 1 | 5 |
| Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments | May 8, 2025 | BenchmarkingPrompt Engineering | CodeCode Available | 1 | 5 |
| PartDistillation: Learning Parts From Instance Segmentation | Jan 1, 2023 | Instance SegmentationObject | CodeCode Available | 1 | 5 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large Model | Jan 11, 2024 | Image SegmentationPrompt Engineering | CodeCode Available | 1 | 5 |
| Improving Diffusion Models for Scene Text Editing with Dual Encoders | Apr 12, 2023 | Scene Text EditingStyle Transfer | CodeCode Available | 1 | 5 |