| Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency | Sep 4, 2024 | Video Generation | —Unverified | 0 | 0 |
| LoViC: Efficient Long Video Generation with Context Compression | Jul 17, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| LuciBot: Automated Robot Policy Learning from Generated Videos | Mar 12, 2025 | Video Generation | —Unverified | 0 | 0 |
| LumiSculpt: A Consistency Lighting Control Network for Video Generation | Oct 30, 2024 | Video Generation | —Unverified | 0 | 0 |
| Lyric Video Analysis Using Text Detection and Tracking | Jun 21, 2020 | ClusteringDynamic Time Warping | —Unverified | 0 | 0 |
| M4V: Multi-Modal Mamba for Text-to-Video Generation | Jun 12, 2025 | MambaText-to-Video Generation | —Unverified | 0 | 0 |
| MagicAvatar: Multimodal Avatar Generation and Animation | Aug 28, 2023 | Video Generation | —Unverified | 0 | 0 |
| MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation | Mar 18, 2025 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | May 23, 2024 | 3D GenerationAutonomous Driving | —Unverified | 0 | 0 |
| MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Nov 21, 2024 | Autonomous DrivingVideo Generation | —Unverified | 0 | 0 |
| MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice | Mar 7, 2025 | DenoisingPortrait Animation | —Unverified | 0 | 0 |
| MAGIC: Motion-Aware Generative Inference via Confidence-Guided LLM | May 22, 2025 | 3D GenerationVideo Generation | —Unverified | 0 | 0 |
| MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance | Mar 20, 2025 | Image to Video GenerationObject | —Unverified | 0 | 0 |
| MagicVideo: Efficient Video Generation With Latent Diffusion Models | Nov 20, 2022 | GPUText-to-Video Generation | —Unverified | 0 | 0 |
| MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Jan 9, 2024 | MORPHVideo Generation | —Unverified | 0 | 0 |
| Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation | May 16, 2023 | Motion GenerationMotion Synthesis | —Unverified | 0 | 0 |
| Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts | May 15, 2023 | DenoisingVideo Editing | —Unverified | 0 | 0 |
| Make Pixels Dance: High-Dynamic Video Generation | Nov 18, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance | Jun 1, 2023 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation | Feb 18, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance | Apr 23, 2025 | Instruction FollowingSSIM | —Unverified | 0 | 0 |
| ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping | Dec 18, 2024 | ObjectVideo Generation | —Unverified | 0 | 0 |
| VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation | Jun 21, 2024 | Video GenerationVideo Quality Assessment | —Unverified | 0 | 0 |
| MarDini: Masked Autoregressive Diffusion for Video Generation at Scale | Oct 26, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Markov Decision Process for Video Generation | Sep 26, 2019 | DiversityVideo Generation | —Unverified | 0 | 0 |
| Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions | Jul 10, 2025 | Video Generation | —Unverified | 0 | 0 |
| Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation | Mar 25, 2025 | text annotationVideo Generation | —Unverified | 0 | 0 |
| Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation | Jan 1, 2025 | text annotationVideo Generation | —Unverified | 0 | 0 |
| MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation | Feb 16, 2025 | Video Generation | —Unverified | 0 | 0 |
| Matten: Video Generation with Mamba-Attention | May 5, 2024 | MambaVideo Generation | —Unverified | 0 | 0 |
| Medical Video Generation for Disease Progression Simulation | Nov 18, 2024 | PrognosisVideo Generation | —Unverified | 0 | 0 |
| MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation | Dec 5, 2024 | Portrait AnimationVideo Generation | —Unverified | 0 | 0 |
| MEt3R: Measuring Multi-View Consistency in Generated Images | Jan 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition | Apr 3, 2025 | Code GenerationImage to Video Generation | —Unverified | 0 | 0 |
| MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation | Nov 30, 2023 | Image GenerationText to Image Generation | —Unverified | 0 | 0 |
| Mimir: Improving Video Diffusion Models for Precise Text Understanding | Dec 4, 2024 | DecoderReading Comprehension | —Unverified | 0 | 0 |
| Mind the Time: Temporally-Controlled Multi-Event Video Generation | Dec 6, 2024 | Video Generation | —Unverified | 0 | 0 |
| MinD: Unified Visual Imagination and Control via Hierarchical World Models | Jun 23, 2025 | Video GenerationVideo Prediction | —Unverified | 0 | 0 |
| MiniMax-Remover: Taming Bad Noise Helps Video Object Removal | May 30, 2025 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation | Feb 3, 2025 | BenchmarkingFairness | —Unverified | 0 | 0 |
| MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text | Jul 31, 2023 | Video Generation | —Unverified | 0 | 0 |
| MoCha: Towards Movie-Grade Talking Character Synthesis | Mar 30, 2025 | Video Generation | —Unverified | 0 | 0 |
| Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM | Apr 16, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 | 0 |
| Mojito: Motion Trajectory and Intensity Control for Video Generation | Dec 12, 2024 | Computational EfficiencyOptical Flow Estimation | —Unverified | 0 | 0 |
| Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments | Apr 3, 2025 | Physical Commonsense ReasoningVideo Generation | —Unverified | 0 | 0 |
| MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Feb 5, 2025 | Image to Video GenerationMotion Generation | —Unverified | 0 | 0 |
| Motion-Aware Generative Frame Interpolation | Jan 7, 2025 | Video Generation | —Unverified | 0 | 0 |
| Motion-aware Latent Diffusion Models for Video Frame Interpolation | Apr 21, 2024 | Motion EstimationVideo Frame Interpolation | —Unverified | 0 | 0 |
| MotionBooth: Motion-Aware Customized Text-to-Video Generation | Jun 25, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Dec 17, 2024 | Video EditingVideo Generation | —Unverified | 0 | 0 |