| DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing | Jun 26, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| Causally Steered Diffusion for Automated Video Counterfactual Generation | Jun 17, 2025 | counterfactualVideo Editing | CodeCode Available | 0 |
| LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning | Jun 11, 2025 | Video Editing | —Unverified | 0 |
| RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping | Jun 10, 2025 | Video Editing | —Unverified | 0 |
| Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding | Jun 9, 2025 | Contrastive LearningVideo Editing | —Unverified | 0 |
| FADE: Frequency-Aware Diffusion Model Factorization for Video Editing | Jun 6, 2025 | Video Editing | CodeCode Available | 1 |
| FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing | Jun 5, 2025 | Text-to-Video EditingVideo Editing | —Unverified | 0 |
| FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers | Jun 4, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| MiniMax-Remover: Taming Bad Noise Helps Video Object Removal | May 30, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Video Editing for Audio-Visual Dubbing | May 29, 2025 | Video Editing | CodeCode Available | 0 |
| Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing | May 29, 2025 | Optical Flow EstimationVideo Editing | CodeCode Available | 1 |
| TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs | May 26, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation | May 25, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Shots to Stories: LLM-Assisted Video Editing with Unified Language Representations | May 18, 2025 | Video EditingVideo Understanding | —Unverified | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 |
| Video Forgery Detection for Surveillance Cameras: A Review | May 4, 2025 | Frame Duplication DetectionMisinformation | —Unverified | 0 |
| A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories | May 2, 2025 | Code GenerationText Generation | —Unverified | 0 |
| Controllable Weather Synthesis and Removal with Video Diffusion Models | May 1, 2025 | Video Editing | —Unverified | 0 |
| Visual Prompting for One-shot Controllable Video Editing without Inversion | Apr 19, 2025 | Video EditingVisual Prompting | —Unverified | 0 |
| Understanding Attention Mechanism in Video Diffusion Models | Apr 16, 2025 | Video Editing | —Unverified | 0 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| Analysis of Attention in Video Diffusion Transformers | Apr 14, 2025 | Video Editing | —Unverified | 0 |
| CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Apr 13, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 |
| VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing | Apr 8, 2025 | DisentanglementMotion Disentanglement | —Unverified | 0 |
| How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models | Apr 3, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| SketchVideo: Sketch-based Video Generation and Editing | Mar 30, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025 | Mar 30, 2025 | ObjectReferring Video Object Segmentation | CodeCode Available | 0 |
| FreeInv: Free Lunch for Improving DDIM Inversion | Mar 29, 2025 | Video Editing | —Unverified | 0 |
| Wan: Open and Advanced Large-Scale Video Generative Models | Mar 26, 2025 | Video EditingVideo Generation | CodeCode Available | 11 |
| InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction | Mar 26, 2025 | Instruction FollowingVideo Editing | CodeCode Available | 1 |
| Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising | Mar 26, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance | Mar 24, 2025 | Text-to-Video GenerationVideo Editing | —Unverified | 0 |
| Shot Sequence Ordering for Video Editing: Benchmarks, Metrics, and Cinematology-Inspired Computing Methods | Mar 23, 2025 | Video Editing | CodeCode Available | 0 |
| InstructVEdit: A Holistic Approach for Instructional Video Editing | Mar 22, 2025 | Video Editing | —Unverified | 0 |
| HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks | Mar 21, 2025 | Meta-LearningVideo Editing | —Unverified | 0 |
| VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation | Mar 18, 2025 | Reasoning SegmentationVideo Editing | —Unverified | 0 |
| GIFT: Generated Indoor video frames for Texture-less point tracking | Mar 17, 2025 | Motion EstimationPoint Tracking | —Unverified | 0 |
| FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models | Mar 17, 2025 | SensitivityVideo Editing | —Unverified | 0 |
| RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing | Mar 14, 2025 | Video Editing | —Unverified | 0 |
| V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes | Mar 13, 2025 | 3D scene EditingDenoising | —Unverified | 0 |
| Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space | Mar 12, 2025 | Image-to-Image TranslationVideo Editing | CodeCode Available | 2 |
| VACE: All-in-One Video Creation and Editing | Mar 10, 2025 | AllHuman-Domain Subject-to-Video | CodeCode Available | 7 |
| Get In Video: Add Anything You Want to the Video | Mar 8, 2025 | object-detectionObject Detection | —Unverified | 0 |
| VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control | Mar 7, 2025 | Image InpaintingOptical Flow Estimation | CodeCode Available | 4 |
| VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors | Mar 3, 2025 | 3D ReconstructionObject | —Unverified | 0 |
| VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing | Feb 24, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists | Feb 10, 2025 | Video EditingVideo Generation | —Unverified | 0 |