| MVOC: a training-free multiple video object composition method with diffusion models | Jun 22, 2024 | Image to Video GenerationObject | CodeCode Available | 1 |
| CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation | May 21, 2025 | Video Generation | CodeCode Available | 1 |
| Multi-StyleGAN: Towards Image-Based Simulation of Time-Lapse Live-Cell Microscopy | Jun 15, 2021 | DescriptiveGenerative Adversarial Network | CodeCode Available | 1 |
| DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation | Apr 9, 2025 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency | Feb 6, 2025 | Video GenerationVideo Quality Assessment | CodeCode Available | 1 |
| DwNet: Dense warp-based network for pose-guided human video generation | Oct 21, 2019 | Video Generation | CodeCode Available | 1 |
| A Good Image Generator Is What You Need for High-Resolution Video Synthesis | Apr 30, 2021 | Video Generation | CodeCode Available | 1 |
| Temporal Shift GAN for Large Scale Video Generation | Apr 4, 2020 | Video Generation | CodeCode Available | 1 |
| DVD-Quant: Data-free Video Diffusion Transformers Quantization | May 24, 2025 | Data Free QuantizationQuantization | CodeCode Available | 1 |
| MotionCraft: Physics-based Zero-Shot Video Generation | May 22, 2024 | Image GenerationMissing Elements | CodeCode Available | 1 |
| DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Mar 5, 2025 | 3D Object DetectionBEV Segmentation | CodeCode Available | 1 |
| DTVNet: Dynamic Time-lapse Video Generation via Single Still Image | Aug 11, 2020 | DecoderOptical Flow Estimation | CodeCode Available | 1 |
| MotionCrafter: One-Shot Motion Customization of Diffusion Models | Dec 8, 2023 | DisentanglementMotion Disentanglement | CodeCode Available | 1 |
| Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models | May 10, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation | Mar 8, 2025 | Video Generation | CodeCode Available | 1 |
| Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | Jan 3, 2024 | Image AnimationVideo Editing | CodeCode Available | 1 |
| MoCoGAN: Decomposing Motion and Content for Video Generation | Jul 17, 2017 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 |
| C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation | Feb 27, 2025 | ObjectVideo Generation | CodeCode Available | 1 |
| Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models | Apr 4, 2025 | DenoisingVideo Generation | CodeCode Available | 1 |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mar 7, 2023 | ObjectUnconditional Video Generation | CodeCode Available | 1 |
| CCVS: Context-aware Controllable Video Synthesis | Jul 16, 2021 | DecoderOptical Flow Estimation | CodeCode Available | 1 |
| DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving | May 26, 2025 | Autonomous DrivingVideo Generation | CodeCode Available | 1 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| Minute-Long Videos with Dual Parallelisms | May 27, 2025 | DenoisingGPU | CodeCode Available | 1 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |
| FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation | Jun 5, 2025 | DenoisingVideo Generation | CodeCode Available | 1 |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Aug 1, 2020 | Face GenerationTalking Face Generation | CodeCode Available | 1 |
| FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation | Nov 3, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Mar 20, 2025 | Autonomous DrivingDenoising | CodeCode Available | 1 |
| StoryBench: A Multifaceted Benchmark for Continuous Story Visualization | Aug 22, 2023 | Story ContinuationStory Generation | CodeCode Available | 1 |
| MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling | May 28, 2024 | Video Generation | CodeCode Available | 1 |
| MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance | Jun 28, 2024 | Image GenerationVideo Generation | CodeCode Available | 1 |
| MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation | Oct 2, 2024 | Video Generation | CodeCode Available | 1 |
| Fine-grained Audible Video Description | Mar 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MoStGAN-V: Video Generation with Temporal Motion Styles | Apr 5, 2023 | Video Generation | CodeCode Available | 1 |
| Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions | Jan 27, 2022 | Motion EstimationVideo Frame Interpolation | CodeCode Available | 1 |
| Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Dec 19, 2024 | Video Generation | CodeCode Available | 1 |
| StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3 | Aug 16, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 |
| CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Jan 28, 2025 | 2kVideo Generation | CodeCode Available | 1 |
| FitVid: Overfitting in Pixel-Level Video Prediction | Jun 24, 2021 | Image AugmentationPrediction | CodeCode Available | 1 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| Make-A-Video: Text-to-Video Generation without Text-Video Data | Sep 29, 2022 | DecoderImage Generation | CodeCode Available | 1 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation | May 17, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 |
| Make It Move: Controllable Image-to-Video Generation with Text Descriptions | Dec 6, 2021 | DiversityImage to Video Generation | CodeCode Available | 1 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 |
| DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation | Feb 17, 2025 | Video Generation | CodeCode Available | 1 |
| CamContextI2V: Context-aware Controllable Video Generation | Apr 8, 2025 | DiversityScene Understanding | CodeCode Available | 1 |
| Latent Video Transformer | Jun 18, 2020 | Video GenerationVideo Prediction | CodeCode Available | 1 |
| Diverse Video Generation using a Gaussian Process Trigger | Jul 9, 2021 | DiversityVideo Generation | CodeCode Available | 1 |