| Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Mar 21, 2024 | Video Editing | —Unverified | 0 | 0 |
| VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing | Apr 8, 2025 | DisentanglementMotion Disentanglement | —Unverified | 0 | 0 |
| VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence | Dec 4, 2023 | Video Editing | —Unverified | 0 | 0 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 | 0 |
| VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs | Apr 12, 2023 | Image AnimationVideo Editing | —Unverified | 0 | 0 |
| Visual Prompting for One-shot Controllable Video Editing without Inversion | Apr 19, 2025 | Video EditingVisual Prompting | —Unverified | 0 | 0 |
| VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Nov 22, 2024 | Video Editing | —Unverified | 0 | 0 |
| V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Jun 20, 2024 | AttributeVideo Editing | —Unverified | 0 | 0 |
| VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Mar 13, 2024 | Face DetectionVideo Editing | —Unverified | 0 | 0 |
| WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens | Jan 18, 2024 | Video EditingVideo Generation | —Unverified | 0 | 0 |