| Visual Prompting for One-shot Controllable Video Editing without Inversion | Apr 19, 2025 | Video EditingVisual Prompting | —Unverified | 0 |
| Understanding Attention Mechanism in Video Diffusion Models | Apr 16, 2025 | Video Editing | —Unverified | 0 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| Analysis of Attention in Video Diffusion Transformers | Apr 14, 2025 | Video Editing | —Unverified | 0 |
| CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Apr 13, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 |
| VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing | Apr 8, 2025 | DisentanglementMotion Disentanglement | —Unverified | 0 |
| How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models | Apr 3, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| SketchVideo: Sketch-based Video Generation and Editing | Mar 30, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025 | Mar 30, 2025 | ObjectReferring Video Object Segmentation | CodeCode Available | 0 |