| VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing | Apr 8, 2025 | DisentanglementMotion Disentanglement | —Unverified | 0 | 0 |
| VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence | Dec 4, 2023 | Video Editing | —Unverified | 0 | 0 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 | 0 |
| VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs | Apr 12, 2023 | Image AnimationVideo Editing | —Unverified | 0 | 0 |
| Visual Prompting for One-shot Controllable Video Editing without Inversion | Apr 19, 2025 | Video EditingVisual Prompting | —Unverified | 0 | 0 |
| VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Nov 22, 2024 | Video Editing | —Unverified | 0 | 0 |
| V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Jun 20, 2024 | AttributeVideo Editing | —Unverified | 0 | 0 |
| VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Mar 13, 2024 | Face DetectionVideo Editing | —Unverified | 0 | 0 |
| WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens | Jan 18, 2024 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising | Mar 26, 2025 | DenoisingVideo Editing | —Unverified | 0 | 0 |
| Zero-Shot Video Editing through Adaptive Sliding Score Distillation | Jun 7, 2024 | DenoisingText-to-Video Generation | —Unverified | 0 | 0 |
| Zero-Shot Video Question Answering with Procedural Programs | Dec 1, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 | 0 |
| ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Nov 30, 2023 | Image GenerationNeRF | —Unverified | 0 | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 | 0 |
| 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Jun 1, 2024 | Autonomous DrivingPanoptic Segmentation | —Unverified | 0 | 0 |
| Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions | Mar 11, 2024 | counterfactualVideo Editing | —Unverified | 0 | 0 |
| A Deep Multiscale Framework for Video Watermarking | Mar 28, 2023 | Video Editing | —Unverified | 0 | 0 |
| Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Sep 25, 2023 | Autonomous DrivingObject | —Unverified | 0 | 0 |
| AI based approach to Trailer Generation for Online Educational Courses | Jan 10, 2023 | Video Editing | —Unverified | 0 | 0 |
| Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space | Jan 1, 2025 | Image-to-Image TranslationVideo Editing | —Unverified | 0 | 0 |
| Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing | Jan 1, 2025 | DenoisingVideo Editing | —Unverified | 0 | 0 |
| Analysis of Attention in Video Diffusion Transformers | Apr 14, 2025 | Video Editing | —Unverified | 0 | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Anything in Any Scene: Photorealistic Video Object Insertion | Jan 30, 2024 | Data AugmentationObject | —Unverified | 0 | 0 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories | May 2, 2025 | Code GenerationText Generation | —Unverified | 0 | 0 |
| Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jul 8, 2024 | DisentanglementVideo Editing | —Unverified | 0 | 0 |
| Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos | Aug 20, 2024 | Video Editing | —Unverified | 0 | 0 |
| AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep Learning Assistant Video Editing | Mar 3, 2023 | Video Editing | —Unverified | 0 | 0 |
| Automatically Extract the Semi-transparent Motion-blurred Hand from a Single Image | Jun 27, 2019 | DecoderVideo Editing | —Unverified | 0 | 0 |
| Automatic Curation of Golf Highlights using Multimodal Excitement Features | Jul 22, 2017 | Action RecognitionRetrieval | —Unverified | 0 | 0 |
| Automatic Non-Linear Video Editing Transfer | May 14, 2021 | Video Editing | —Unverified | 0 | 0 |
| Blended Latent Diffusion under Attention Control for Real-World Video Editing | Sep 5, 2024 | Image GenerationText to Image Generation | —Unverified | 0 | 0 |
| B-Script: Transcript-based B-roll Video Editing with Recommendations | Feb 28, 2019 | Video Editing | —Unverified | 0 | 0 |
| Calipso: Physics-based Image and Video Editing through CAD Model Proxies | Aug 12, 2017 | Video Editing | —Unverified | 0 | 0 |
| CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing | Jan 1, 2024 | Video Editing | —Unverified | 0 | 0 |
| CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Apr 13, 2025 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information | Mar 22, 2024 | 3D ReconstructionHallucination | —Unverified | 0 | 0 |
| Clarification of Video Retrieval Query Results by the Automated Insertion of Supporting Shots | Feb 19, 2021 | RetrievalVideo Editing | —Unverified | 0 | 0 |
| Consistent and Controllable Image Animation with Motion Diffusion Models | Jan 1, 2025 | Image AnimationVideo Editing | —Unverified | 0 | 0 |
| Consistent Depth of Moving Objects in Video | Aug 2, 2021 | Depth EstimationDepth Prediction | —Unverified | 0 | 0 |
| Controllable Weather Synthesis and Removal with Video Diffusion Models | May 1, 2025 | Video Editing | —Unverified | 0 | 0 |
| Counteracting temporal attacks in Video Copy Detection | Jan 19, 2025 | Copy DetectionVideo Editing | —Unverified | 0 | 0 |
| CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Aug 24, 2024 | Autonomous DrivingObject | —Unverified | 0 | 0 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 | 0 |
| Cut-and-Paste: Subject-Driven Video Editing with Attention Control | Nov 20, 2023 | ObjectVideo Editing | —Unverified | 0 | 0 |
| Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance Synthesis | Nov 10, 2021 | Human Animationmotion retargeting | —Unverified | 0 | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 | 0 |
| DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Aug 14, 2024 | text-guided-image-editingVideo Editing | —Unverified | 0 | 0 |
| Designing a 3D-Aware StyleNeRF Encoder for Face Editing | Feb 19, 2023 | AttributeFace Model | —Unverified | 0 | 0 |