| Video Inpainting of Complex Scenes | Mar 18, 2015 | Image InpaintingVideo Editing | —Unverified | 0 |
| VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space | Nov 22, 2022 | Asset ManagementManagement | —Unverified | 0 |
| Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Mar 21, 2024 | Video Editing | —Unverified | 0 |
| VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing | Apr 8, 2025 | DisentanglementMotion Disentanglement | —Unverified | 0 |
| VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence | Dec 4, 2023 | Video Editing | —Unverified | 0 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs | Apr 12, 2023 | Image AnimationVideo Editing | —Unverified | 0 |
| Visual Prompting for One-shot Controllable Video Editing without Inversion | Apr 19, 2025 | Video EditingVisual Prompting | —Unverified | 0 |
| VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Nov 22, 2024 | Video Editing | —Unverified | 0 |
| V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Jun 20, 2024 | AttributeVideo Editing | —Unverified | 0 |
| VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Mar 13, 2024 | Face DetectionVideo Editing | —Unverified | 0 |
| WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens | Jan 18, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising | Mar 26, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| Zero-Shot Video Editing through Adaptive Sliding Score Distillation | Jun 7, 2024 | DenoisingText-to-Video Generation | —Unverified | 0 |
| Zero-Shot Video Question Answering with Procedural Programs | Dec 1, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding | Jun 9, 2025 | Contrastive LearningVideo Editing | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets | Sep 2, 2024 | Video AlignmentVideo Editing | —Unverified | 0 |
| Task-agnostic Temporally Consistent Facial Video Editing | Jul 3, 2020 | 3D ReconstructionVideo Editing | —Unverified | 0 |
| TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs | May 26, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| Temporally Consistent Semantic Video Editing | Jun 21, 2022 | Image GenerationVideo Editing | —Unverified | 0 |
| Text-based Talking Video Editing with Cascaded Conditional Diffusion | Jul 20, 2024 | Video Editing | —Unverified | 0 |
| FacialFilmroll: High-resolution multi-shot video editing | Oct 5, 2021 | Face ModelVideo Editing | —Unverified | 0 |
| Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs | Jan 10, 2025 | Video Editing | —Unverified | 0 |
| Text-Video Multi-Grained Integration for Video Moment Montage | Dec 12, 2024 | SentenceVideo Editing | —Unverified | 0 |