| Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detections | Apr 15, 2025 | Anomaly DetectionAnomaly Localization | CodeCode Available | 1 |
| Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models | Apr 15, 2025 | Humanoid ControlReinforcement Learning (RL) | CodeCode Available | 4 |
| Detect Anything 3D in the Wild | Apr 10, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 3 |
| SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation | Apr 6, 2025 | Multi-Object TrackingObject | CodeCode Available | 2 |
| PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation | Apr 3, 2025 | ObjectPose Estimation | CodeCode Available | 1 |
| Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Apr 3, 2025 | Field Boundary DelineationInstance Segmentation | CodeCode Available | 2 |
| Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models | Mar 30, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study | Mar 28, 2025 | Domain GeneralizationImage Segmentation | —Unverified | 0 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Mar 28, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 |
| Thinking agents for zero-shot generalization to qualitatively novel tasks | Mar 25, 2025 | Zero-shot Generalization | —Unverified | 0 |