| Wan: Open and Advanced Large-Scale Video Generative Models | Mar 26, 2025 | Video EditingVideo Generation | CodeCode Available | 11 |
| VACE: All-in-One Video Creation and Editing | Mar 10, 2025 | AllHuman-Domain Subject-to-Video | CodeCode Available | 7 |
| Segment Anything for Videos: A Systematic Survey | Jul 31, 2024 | Image SegmentationRobot Manipulation Generalization | CodeCode Available | 5 |
| Mora: Enabling Generalist Video Generation via A Multi-Agent Framework | Mar 20, 2024 | Image to Video GenerationText-to-Video Generation | CodeCode Available | 5 |
| VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild | Nov 27, 2022 | Video EditingVideo Generation | CodeCode Available | 5 |
| VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control | Mar 7, 2025 | Image InpaintingOptical Flow Estimation | CodeCode Available | 4 |
| Video Seal: Open and Efficient Video Watermarking | Dec 12, 2024 | Video CompressionVideo Editing | CodeCode Available | 4 |
| Taming Rectified Flow for Inversion and Editing | Nov 7, 2024 | Image GenerationText-to-Image Generation | CodeCode Available | 4 |
| AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks | Mar 21, 2024 | Image to Video GenerationStyle Transfer | CodeCode Available | 4 |
| A Survey on Video Diffusion Models | Oct 16, 2023 | Image GenerationSurvey | CodeCode Available | 4 |
| Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis | Aug 18, 2023 | Dynamic ReconstructionNovel View Synthesis | CodeCode Available | 4 |
| Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators | Mar 23, 2023 | Image GenerationText-to-Video Generation | CodeCode Available | 4 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 |
| JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Jan 3, 2025 | 3D ReconstructionFace Generation | CodeCode Available | 3 |
| DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Dec 24, 2024 | Video EditingVideo Generation | CodeCode Available | 3 |
| AutoVFX: Physically Realistic Video Editing from Natural Language Instructions | Nov 4, 2024 | Code GenerationVideo Editing | CodeCode Available | 3 |
| Movie Gen: A Cast of Media Foundation Models | Oct 17, 2024 | Audio GenerationVideo Editing | CodeCode Available | 3 |
| Diffusion Model-Based Video Editing: A Survey | Jun 26, 2024 | modelSurvey | CodeCode Available | 3 |
| A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models | Jun 20, 2024 | Video Editing | CodeCode Available | 3 |
| MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion | May 30, 2024 | DenoisingGPU | CodeCode Available | 3 |
| Lumiere: A Space-Time Diffusion Model for Video Generation | Jan 23, 2024 | Super-ResolutionText-to-Video Generation | CodeCode Available | 3 |
| StableVideo: Text-driven Consistency-aware Diffusion Video Editing | Aug 18, 2023 | Video Editing | CodeCode Available | 3 |
| TokenFlow: Consistent Diffusion Features for Consistent Video Editing | Jul 19, 2023 | Video Editing | CodeCode Available | 3 |
| FateZero: Fusing Attentions for Zero-shot Text-based Video Editing | Mar 16, 2023 | AttributeText-to-Video Editing | CodeCode Available | 3 |
| Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space | Mar 12, 2025 | Image-to-Image TranslationVideo Editing | CodeCode Available | 2 |