| Rethinking Video Tokenization: A Conditioned Diffusion-based Approach | Mar 5, 2025 | DecoderVideo Compression | CodeCode Available | 1 |
| Detecting AI-Generated Video via Frame Consistency | Feb 3, 2024 | Video Generation | CodeCode Available | 1 |
| DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Jan 23, 2024 | 3D Shape GenerationImage Generation | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |
| D'ARTAGNAN: Counterfactual Video Generation | Jun 3, 2022 | Anatomycounterfactual | CodeCode Available | 1 |
| Constrained Synthesis with Projected Diffusion Models | Feb 5, 2024 | Motion SynthesisVideo Generation | CodeCode Available | 1 |
| Predicting Video with VQVAE | Mar 2, 2021 | Video GenerationVideo Prediction | CodeCode Available | 1 |
| Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Dec 19, 2024 | Video Generation | CodeCode Available | 1 |
| Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation | Sep 7, 2023 | Action RecognitionDecoder | CodeCode Available | 1 |
| Pix2Gif: Motion-Guided Diffusion for GIF Generation | Mar 7, 2024 | Video Generation | CodeCode Available | 1 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| Playable Environments: Video Manipulation in Space and Time | Mar 3, 2022 | Video Generation | CodeCode Available | 1 |
| Playable Video Generation | Jan 28, 2021 | DecoderVideo Generation | CodeCode Available | 1 |
| Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion | Jun 9, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| AMG: Avatar Motion Guided Video Generation | Sep 2, 2024 | Video Generation | CodeCode Available | 1 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 |
| Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model | May 12, 2025 | Video Generation | CodeCode Available | 1 |
| PEEKABOO: Interactive Video Generation via Masked-Diffusion | Dec 12, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| PoM: Efficient Image and Video Generation with the Polynomial Mixer | Nov 19, 2024 | Video Generation | CodeCode Available | 1 |
| ^RFLAV: Rolling Flow matching for infinite Audio Video generation | Mar 11, 2025 | Video Generation | CodeCode Available | 1 |
| OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models | May 21, 2024 | Video Generation | CodeCode Available | 1 |
| AMD-Hummingbird: Towards an Efficient Text-to-Video Model | Mar 24, 2025 | Computational EfficiencyVideo Generation | CodeCode Available | 1 |
| OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Nov 15, 2024 | Optical Flow EstimationText-to-Video Generation | CodeCode Available | 1 |
| Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose | Feb 24, 2020 | 3D Face AnimationVideo Generation | CodeCode Available | 1 |
| ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Dec 10, 2024 | DenoisingImage Generation | CodeCode Available | 1 |
| OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | May 10, 2024 | 3D ReconstructionImage to 3D | CodeCode Available | 1 |
| Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Apr 18, 2023 | Image GenerationSuper-Resolution | CodeCode Available | 1 |
| NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion | Nov 24, 2021 | DecoderImage Generation | CodeCode Available | 1 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| Object-Centric Image to Video Generation with Language Guidance | Feb 17, 2025 | Image to Video GenerationObject | CodeCode Available | 1 |
| Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions | Jan 27, 2022 | Motion EstimationVideo Frame Interpolation | CodeCode Available | 1 |
| EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models | Mar 25, 2025 | Video Generation | CodeCode Available | 1 |
| Multi-StyleGAN: Towards Image-Based Simulation of Time-Lapse Live-Cell Microscopy | Jun 15, 2021 | DescriptiveGenerative Adversarial Network | CodeCode Available | 1 |
| Temporal Shift GAN for Large Scale Video Generation | Apr 4, 2020 | Video Generation | CodeCode Available | 1 |
| Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency | Feb 6, 2025 | Video GenerationVideo Quality Assessment | CodeCode Available | 1 |
| MotionCrafter: One-Shot Motion Customization of Diffusion Models | Dec 8, 2023 | DisentanglementMotion Disentanglement | CodeCode Available | 1 |
| MotionCraft: Physics-based Zero-Shot Video Generation | May 22, 2024 | Image GenerationMissing Elements | CodeCode Available | 1 |
| MVOC: a training-free multiple video object composition method with diffusion models | Jun 22, 2024 | Image to Video GenerationObject | CodeCode Available | 1 |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mar 7, 2023 | ObjectUnconditional Video Generation | CodeCode Available | 1 |
| A Light and Tuning-free Method for Simulating Camera Motion in Video Generation | Mar 9, 2025 | DenoisingDepth Estimation | CodeCode Available | 1 |
| ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | Oct 11, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | Jan 3, 2024 | Image AnimationVideo Editing | CodeCode Available | 1 |
| MoStGAN-V: Video Generation with Temporal Motion Styles | Apr 5, 2023 | Video Generation | CodeCode Available | 1 |
| Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices | Oct 15, 2024 | Image Generationmultimodal generation | CodeCode Available | 1 |
| Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models | Apr 4, 2025 | DenoisingVideo Generation | CodeCode Available | 1 |
| Conditional diffusion model with spatial attention and latent embedding for medical image segmentation | Feb 10, 2025 | HippocampusImage Segmentation | CodeCode Available | 1 |
| MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance | Jun 28, 2024 | Image GenerationVideo Generation | CodeCode Available | 1 |
| AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark | Mar 18, 2025 | Video Generation | CodeCode Available | 1 |
| Minute-Long Videos with Dual Parallelisms | May 27, 2025 | DenoisingGPU | CodeCode Available | 1 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |