| Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot Coordination | Jan 10, 2025 | DiversityImitation Learning | CodeCode Available | 0 |
| Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence | Jan 9, 2025 | Change DetectionZero-shot Generalization | CodeCode Available | 1 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| MADation: Face Morphing Attack Detection with Foundation Models | Jan 7, 2025 | Face Morphing Attack DetectionFace Recognition | CodeCode Available | 0 |
| Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Jan 5, 2025 | Data AugmentationDepth Estimation | CodeCode Available | 3 |
| Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language Models | Jan 3, 2025 | Zero-shot Generalization | CodeCode Available | 0 |
| OW-OVD: Unified Open World and Open Vocabulary Object Detection | Jan 1, 2025 | AttributeIncremental Learning | CodeCode Available | 1 |
| On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach | Jan 1, 2025 | Adversarial RobustnessZero-shot Generalization | —Unverified | 0 |
| On the Out-Of-Distribution Generalization of Large Multimodal Models | Jan 1, 2025 | In-Context LearningOut-of-Distribution Generalization | —Unverified | 0 |
| FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images | Jan 1, 2025 | 3D CanonicalizationZero-shot Generalization | CodeCode Available | 1 |