| MEt3R: Measuring Multi-View Consistency in Generated Images | Jan 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 |
| MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition | Apr 3, 2025 | Code GenerationImage to Video Generation | —Unverified | 0 |
| MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation | Nov 30, 2023 | Image GenerationText to Image Generation | —Unverified | 0 |
| Mimir: Improving Video Diffusion Models for Precise Text Understanding | Dec 4, 2024 | DecoderReading Comprehension | —Unverified | 0 |
| Mind the Time: Temporally-Controlled Multi-Event Video Generation | Dec 6, 2024 | Video Generation | —Unverified | 0 |
| MinD: Unified Visual Imagination and Control via Hierarchical World Models | Jun 23, 2025 | Video GenerationVideo Prediction | —Unverified | 0 |
| MiniMax-Remover: Taming Bad Noise Helps Video Object Removal | May 30, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation | Feb 3, 2025 | BenchmarkingFairness | —Unverified | 0 |
| MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text | Jul 31, 2023 | Video Generation | —Unverified | 0 |
| MoCha: Towards Movie-Grade Talking Character Synthesis | Mar 30, 2025 | Video Generation | —Unverified | 0 |
| Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM | Apr 16, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 |
| Mojito: Motion Trajectory and Intensity Control for Video Generation | Dec 12, 2024 | Computational EfficiencyOptical Flow Estimation | —Unverified | 0 |
| Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments | Apr 3, 2025 | Physical Commonsense ReasoningVideo Generation | —Unverified | 0 |
| MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Feb 5, 2025 | Image to Video GenerationMotion Generation | —Unverified | 0 |
| Motion-Aware Generative Frame Interpolation | Jan 7, 2025 | Video Generation | —Unverified | 0 |
| Motion-aware Latent Diffusion Models for Video Frame Interpolation | Apr 21, 2024 | Motion EstimationVideo Frame Interpolation | —Unverified | 0 |
| MotionBooth: Motion-Aware Customized Text-to-Video Generation | Jun 25, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Dec 17, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation | Feb 6, 2025 | Image to Video GenerationVideo Editing | —Unverified | 0 |
| MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation | Nov 27, 2024 | AttributeVideo Generation | —Unverified | 0 |
| Motion Control for Enhanced Complex Action Video Generation | Nov 13, 2024 | Motion GenerationVideo Generation | —Unverified | 0 |
| Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling | Jan 29, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| MotionMaster: Training-free Camera Motion Transfer For Video Generation | Apr 24, 2024 | DisentanglementMotion Disentanglement | —Unverified | 0 |
| Motion Modes: What Could Happen Next? | Nov 29, 2024 | DiversityObject | —Unverified | 0 |
| MotionPro: A Precise Motion Controller for Image-to-Video Generation | May 26, 2025 | DenoisingImage to Video Generation | —Unverified | 0 |
| Motion Prompting: Controlling Video Generation with Motion Trajectories | Dec 3, 2024 | Video Generation | —Unverified | 0 |
| MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation | Dec 8, 2024 | Contrastive LearningImage to Video Generation | —Unverified | 0 |
| MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation | Nov 28, 2023 | DisentanglementText-to-Video Generation | —Unverified | 0 |
| Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Jan 18, 2024 | DenoisingPosition | —Unverified | 0 |
| MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoVideo: Motion-Aware Video Generation with Diffusion Models | Nov 19, 2023 | Image GenerationImage to Video Generation | —Unverified | 0 |
| MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence | Jul 23, 2024 | Video Generation | —Unverified | 0 |
| Movie Gen: SWOT Analysis of Meta's Generative AI Foundation Model for Transforming Media Generation, Advertising, and Entertainment Industries | Dec 5, 2024 | Video Generation | —Unverified | 0 |
| MOVi: Training-free Text-conditioned Multi-Object Video Generation | May 29, 2025 | ObjectVideo Generation | —Unverified | 0 |
| MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion | Dec 13, 2024 | Video Generation | —Unverified | 0 |
| UniForm: A Unified Multi-Task Diffusion Transformer for Audio-Video Generation | Feb 6, 2025 | Audio GenerationDiversity | —Unverified | 0 |
| UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation | May 30, 2025 | Video Generation | —Unverified | 0 |
| UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics | Dec 10, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| UniVG: Towards UNIfied-modal Video Generation | Jan 17, 2024 | Video Generation | —Unverified | 0 |
| Unlearning Concepts from Text-to-Video Diffusion Models | Jul 19, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation | Jun 3, 2024 | Autonomous DrivingVideo Generation | —Unverified | 0 |
| Unpaired Cartoon Image Synthesis via Gated Cycle Mapping | Jan 1, 2022 | Image GenerationVideo Generation | —Unverified | 0 |
| Unsupervised Bi-directional Flow-based Video Generation from one Snapshot | Mar 3, 2019 | Video Generation | —Unverified | 0 |
| V3GAN: Decomposing Background, Foreground and Motion for Video Generation | Mar 26, 2022 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| VACT: A Video Automatic Causal Testing System and a Benchmark | Mar 8, 2025 | Large Language ModelVideo Generation | —Unverified | 0 |
| VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation | Dec 21, 2024 | Video Generation | —Unverified | 0 |
| VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Jul 17, 2024 | Video Generation | —Unverified | 0 |
| VEnhancer: Generative Space-Time Enhancement for Video Generation | Jul 10, 2024 | Data AugmentationSuper-Resolution | —Unverified | 0 |
| V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation | Jun 4, 2024 | Video Generation | —Unverified | 0 |
| VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption | May 17, 2025 | DecoderPosition | —Unverified | 0 |