| Detecting AI-Generated Video via Frame Consistency | Feb 3, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Jan 23, 2024 | 3D Shape GenerationImage Generation | CodeCode Available | 1 | 5 |
| MoStGAN-V: Video Generation with Temporal Motion Styles | Apr 5, 2023 | Video Generation | CodeCode Available | 1 | 5 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 | 5 |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mar 7, 2023 | ObjectUnconditional Video Generation | CodeCode Available | 1 | 5 |
| D'ARTAGNAN: Counterfactual Video Generation | Jun 3, 2022 | Anatomycounterfactual | CodeCode Available | 1 | 5 |
| Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | Jan 3, 2024 | Image AnimationVideo Editing | CodeCode Available | 1 | 5 |
| Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models | Apr 4, 2025 | DenoisingVideo Generation | CodeCode Available | 1 | 5 |
| Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation | Sep 7, 2023 | Action RecognitionDecoder | CodeCode Available | 1 | 5 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation | Oct 2, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation | Mar 9, 2025 | QuantizationVideo Generation | CodeCode Available | 1 | 5 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Minute-Long Videos with Dual Parallelisms | May 27, 2025 | DenoisingGPU | CodeCode Available | 1 | 5 |
| Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion | Jun 9, 2024 | Autonomous DrivingObject | CodeCode Available | 1 | 5 |
| MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Mar 20, 2025 | Autonomous DrivingDenoising | CodeCode Available | 1 | 5 |
| MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling | May 28, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| AMG: Avatar Motion Guided Video Generation | Sep 2, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Aug 1, 2020 | Face GenerationTalking Face Generation | CodeCode Available | 1 | 5 |
| MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance | Jun 28, 2024 | Image GenerationVideo Generation | CodeCode Available | 1 | 5 |
| MoCoGAN: Decomposing Motion and Content for Video Generation | Jul 17, 2017 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 | 5 |
| Real-time One-Step Diffusion-based Expressive Portrait Videos Generation | Dec 18, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| ^RFLAV: Rolling Flow matching for infinite Audio Video generation | Mar 11, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| AMD-Hummingbird: Towards an Efficient Text-to-Video Model | Mar 24, 2025 | Computational EfficiencyVideo Generation | CodeCode Available | 1 | 5 |
| Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose | Feb 24, 2020 | 3D Face AnimationVideo Generation | CodeCode Available | 1 | 5 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Dec 10, 2024 | DenoisingImage Generation | CodeCode Available | 1 | 5 |
| Make-A-Video: Text-to-Video Generation without Text-Video Data | Sep 29, 2022 | DecoderImage Generation | CodeCode Available | 1 | 5 |
| Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Apr 18, 2023 | Image GenerationSuper-Resolution | CodeCode Available | 1 | 5 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Dec 19, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| Make It Move: Controllable Image-to-Video Generation with Text Descriptions | Dec 6, 2021 | DiversityImage to Video Generation | CodeCode Available | 1 | 5 |
| Constrained Synthesis with Projected Diffusion Models | Feb 5, 2024 | Motion SynthesisVideo Generation | CodeCode Available | 1 | 5 |
| PoM: Efficient Image and Video Generation with the Polynomial Mixer | Nov 19, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| Predicting Video with VQVAE | Mar 2, 2021 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency | Feb 6, 2025 | Video GenerationVideo Quality Assessment | CodeCode Available | 1 | 5 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation | May 17, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 | 5 |
| Playable Video Generation | Jan 28, 2021 | DecoderVideo Generation | CodeCode Available | 1 | 5 |
| Latent Video Transformer | Jun 18, 2020 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| A Light and Tuning-free Method for Simulating Camera Motion in Video Generation | Mar 9, 2025 | DenoisingDepth Estimation | CodeCode Available | 1 | 5 |
| ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | Oct 11, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 | 5 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 | 5 |
| Latent Image Animator: Learning to animate image via latent space navigation | Sep 29, 2021 | Image AnimationVideo Generation | CodeCode Available | 1 | 5 |
| Latent Neural Differential Equations for Video Generation | Nov 7, 2020 | Unconditional Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Conditional diffusion model with spatial attention and latent embedding for medical image segmentation | Feb 10, 2025 | HippocampusImage Segmentation | CodeCode Available | 1 | 5 |
| Mask-conditioned latent diffusion for generating gastrointestinal polyp images | Apr 11, 2023 | Image GenerationImage Segmentation | CodeCode Available | 1 | 5 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark | Mar 18, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| Compositional Video Synthesis with Action Graphs | Jun 27, 2020 | SchedulingVideo Generation | CodeCode Available | 1 | 5 |