| MotionCraft: Physics-based Zero-Shot Video Generation | May 22, 2024 | Image GenerationMissing Elements | CodeCode Available | 1 |
| MoStGAN-V: Video Generation with Temporal Motion Styles | Apr 5, 2023 | Video Generation | CodeCode Available | 1 |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mar 7, 2023 | ObjectUnconditional Video Generation | CodeCode Available | 1 |
| MotionCrafter: One-Shot Motion Customization of Diffusion Models | Dec 8, 2023 | DisentanglementMotion Disentanglement | CodeCode Available | 1 |
| ^RFLAV: Rolling Flow matching for infinite Audio Video generation | Mar 11, 2025 | Video Generation | CodeCode Available | 1 |
| MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation | Oct 2, 2024 | Video Generation | CodeCode Available | 1 |
| CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Jan 28, 2025 | 2kVideo Generation | CodeCode Available | 1 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |
| MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance | Jun 28, 2024 | Image GenerationVideo Generation | CodeCode Available | 1 |
| MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Mar 20, 2025 | Autonomous DrivingDenoising | CodeCode Available | 1 |
| Minute-Long Videos with Dual Parallelisms | May 27, 2025 | DenoisingGPU | CodeCode Available | 1 |
| MoCoGAN: Decomposing Motion and Content for Video Generation | Jul 17, 2017 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 |
| Mask-conditioned latent diffusion for generating gastrointestinal polyp images | Apr 11, 2023 | Image GenerationImage Segmentation | CodeCode Available | 1 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| Make It Move: Controllable Image-to-Video Generation with Text Descriptions | Dec 6, 2021 | DiversityImage to Video Generation | CodeCode Available | 1 |
| MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling | May 28, 2024 | Video Generation | CodeCode Available | 1 |
| Make-A-Video: Text-to-Video Generation without Text-Video Data | Sep 29, 2022 | DecoderImage Generation | CodeCode Available | 1 |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Aug 1, 2020 | Face GenerationTalking Face Generation | CodeCode Available | 1 |
| Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models | Apr 4, 2025 | DenoisingVideo Generation | CodeCode Available | 1 |
| Free-Editor: Zero-shot Text-driven 3D Scene Editing | Dec 21, 2023 | 3D scene EditingStyle Transfer | CodeCode Available | 1 |
| DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation | Feb 17, 2025 | Video Generation | CodeCode Available | 1 |
| CamContextI2V: Context-aware Controllable Video Generation | Apr 8, 2025 | DiversityScene Understanding | CodeCode Available | 1 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| Diverse Video Generation using a Gaussian Process Trigger | Jul 9, 2021 | DiversityVideo Generation | CodeCode Available | 1 |
| Diverse Video Generation from a Single Video | May 11, 2022 | Video Generation | CodeCode Available | 1 |
| Diverse Generation from a Single Video Made Possible | Sep 17, 2021 | Video GenerationVideo Inpainting | CodeCode Available | 1 |
| Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation | Sep 28, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | May 19, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 1 |
| 3D-Aware Video Generation | Jun 29, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 |
| LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation | May 17, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 |
| Latent Video Transformer | Jun 18, 2020 | Video GenerationVideo Prediction | CodeCode Available | 1 |
| Latent Neural Differential Equations for Video Generation | Nov 7, 2020 | Unconditional Video GenerationVideo Generation | CodeCode Available | 1 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 |
| DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation | May 23, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| Latent Image Animator: Learning to animate image via latent space navigation | Sep 29, 2021 | Image AnimationVideo Generation | CodeCode Available | 1 |
| Diffusion Transformers for Tabular Data Time Series Generation | Apr 10, 2025 | Tabular Data GenerationTime Series | CodeCode Available | 1 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 |
| Diffusion Probabilistic Modeling for Video Generation | Mar 16, 2022 | DenoisingImage Generation | CodeCode Available | 1 |
| Diffusion Models for Video Prediction and Infilling | Jun 15, 2022 | PredictionVideo Generation | CodeCode Available | 1 |
| INR-V: A Continuous Representation Space for Video-based Generative Tasks | Oct 29, 2022 | Video GenerationVideo Inpainting | CodeCode Available | 1 |
| Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Jan 31, 2025 | DenoisingVideo Alignment | CodeCode Available | 1 |
| Improved Training Technique for Latent Consistency Models | Feb 3, 2025 | Video Generation | CodeCode Available | 1 |
| Detecting AI-Generated Video via Frame Consistency | Feb 3, 2024 | Video Generation | CodeCode Available | 1 |
| ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Jun 4, 2024 | QuantizationVideo Generation | CodeCode Available | 1 |
| Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework | Sep 19, 2024 | Motion CompensationVideo Generation | CodeCode Available | 1 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 |
| BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Dec 5, 2023 | Image GenerationModel Selection | CodeCode Available | 1 |
| Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer | Jul 15, 2023 | motion predictionPose Transfer | CodeCode Available | 1 |
| Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample | Jun 22, 2020 | DiversityVideo Generation | CodeCode Available | 1 |
| GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions | Apr 30, 2021 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |