| EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing | Mar 24, 2024 | AttributeVideo Editing | —Unverified | 0 |
| Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information | Mar 22, 2024 | 3D ReconstructionHallucination | —Unverified | 0 |
| AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks | Mar 21, 2024 | Image to Video GenerationStyle Transfer | CodeCode Available | 4 |
| Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Mar 21, 2024 | Video Editing | —Unverified | 0 |
| Mora: Enabling Generalist Video Generation via A Multi-Agent Framework | Mar 20, 2024 | Image to Video GenerationText-to-Video Generation | CodeCode Available | 5 |
| DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing | Mar 18, 2024 | Video Editing | —Unverified | 0 |
| EffiVED:Efficient Video Editing via Text-instruction Diffusion Models | Mar 18, 2024 | Video Editing | CodeCode Available | 1 |
| AICL: Action In-Context Learning for Video Diffusion Model | Mar 18, 2024 | Action GenerationIn-Context Learning | CodeCode Available | 1 |
| Video Editing via Factorized Diffusion Distillation | Mar 14, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Mar 13, 2024 | Face DetectionVideo Editing | —Unverified | 0 |
| Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions | Mar 11, 2024 | counterfactualVideo Editing | —Unverified | 0 |
| Reframe Anything: LLM Agent for Open World Video Reframing | Mar 10, 2024 | object-detectionObject Detection | —Unverified | 0 |
| FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing | Mar 10, 2024 | Image GenerationText-to-Video Editing | —Unverified | 0 |
| Contextualized Diffusion Models for Text-Guided Image and Video Generation | Feb 26, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Place Anything into Any Video | Feb 22, 2024 | 3D GenerationObject | —Unverified | 0 |
| UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing | Feb 20, 2024 | Video Editing | —Unverified | 0 |
| Human Video Translation via Query Warping | Feb 19, 2024 | DenoisingTranslation | —Unverified | 0 |
| LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing | Feb 15, 2024 | Video Editing | —Unverified | 0 |
| Video Editing for Video Retrieval | Feb 4, 2024 | RetrievalText Retrieval | —Unverified | 0 |
| Anything in Any Scene: Photorealistic Video Object Insertion | Jan 30, 2024 | Data AugmentationObject | —Unverified | 0 |
| Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval | Jan 24, 2024 | Moment RetrievalRetrieval | —Unverified | 0 |
| Lumiere: A Space-Time Diffusion Model for Video Generation | Jan 23, 2024 | Super-ResolutionText-to-Video Generation | CodeCode Available | 3 |
| WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens | Jan 18, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Object-Centric Diffusion for Efficient Video Editing | Jan 11, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Jan 5, 2024 | NeRFVideo Editing | CodeCode Available | 1 |