Open-Sora: Democratizing Efficient Video Production for All Dec 29, 2024 All Image Generation
Code Code Available 135 CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Aug 12, 2024 Text-to-Video Generation Video Alignment
Code Code Available 115 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models Jan 17, 2024 Text-to-Video Generation Video Generation
Code Code Available 95 Pyramidal Flow Matching for Efficient Video Generative Modeling Oct 8, 2024 GPU Text-to-Video Generation
Code Code Available 75 CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers May 29, 2022 Text-to-Video Generation Video Generation
Code Code Available 65 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Jun 26, 2024 Text-to-Video Generation Video Generation
Code Code Available 55 Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Mar 20, 2024 Image to Video Generation Text-to-Video Generation
Code Code Available 55 Latte: Latent Diffusion Transformer for Video Generation Jan 5, 2024 Text-to-Video Generation Video Generation
Code Code Available 55 StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text Mar 21, 2024 Text-to-Video Generation Video Generation
Code Code Available 55 VideoCrafter1: Open Diffusion Models for High-Quality Video Generation Oct 30, 2023 Text-to-Video Generation Video Generation
Code Code Available 55 MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators Apr 7, 2024 Text-to-Video Generation Video Generation
Code Code Available 55 Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization Feb 5, 2024 Science Question Answering Text-to-Video Generation
Code Code Available 45 MotionClone: Training-Free Motion Cloning for Controllable Video Generation Jun 8, 2024 Denoising Motion Generation
Code Code Available 45 VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 45 Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Mar 23, 2023 Image Generation Text-to-Video Generation
Code Code Available 45 TransPixeler: Advancing Text-to-Video Generation with Transparency Jan 6, 2025 Text-to-Video Generation Video Generation
Code Code Available 45 CameraCtrl: Enabling Camera Control for Text-to-Video Generation Apr 2, 2024 Text-to-Video Generation Video Generation
Code Code Available 45 Identity-Preserving Text-to-Video Generation by Frequency Decomposition Nov 26, 2024 Human-Domain Subject-to-Video Image to Video Generation
Code Code Available 45 Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation Dec 22, 2022 Style Transfer Text-to-Video Generation
Code Code Available 45 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Jan 14, 2025 Benchmarking Text-to-Video Generation
Code Code Available 45 VideoTetris: Towards Compositional Text-to-Video Generation Jun 6, 2024 Denoising Text-to-Video Generation
Code Code Available 35 Lumiere: A Space-Time Diffusion Model for Video Generation Jan 23, 2024 Super-Resolution Text-to-Video Generation
Code Code Available 35 ModelScope Text-to-Video Technical Report Aug 12, 2023 Denoising Image Generation
Code Code Available 35 FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Feb 7, 2025 Computational Efficiency Text-to-Video Generation
Code Code Available 35 GameGen-X: Interactive Open-world Game Video Generation Nov 1, 2024 Text-to-Video Generation Video Generation
Code Code Available 35 Evaluation of Text-to-Video Generation Models: A Dynamics Perspective Jul 1, 2024 Text-to-Video Generation Video Generation
Code Code Available 35 FIFO-Diffusion: Generating Infinite Videos from Text without Training May 19, 2024 Text-to-Video Generation Video Generation
Code Code Available 35 From Sora What We Can See: A Survey of Text-to-Video Generation May 17, 2024 Text-to-Video Generation Video Generation
Code Code Available 35 Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Apr 3, 2023 Image Generation Text to Image Generation
Code Code Available 35 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation Sep 27, 2023 GPU Text-to-Video Generation
Code Code Available 35 Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models Jan 30, 2023 Audio Generation Text-to-Video Generation
Code Code Available 25 MAGVIT: Masked Generative Video Transformer Dec 10, 2022 Multi-Task Learning Text-to-Video Generation
Code Code Available 25 ControlVideo: Training-free Controllable Text-to-Video Generation May 22, 2023 Image Generation Text-to-Video Generation
Code Code Available 25 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Jan 7, 2025 Diversity Text-to-Video Generation
Code Code Available 25 Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning May 23, 2023 Image Generation Optical Flow Estimation
Code Code Available 25 VideoComposer: Compositional Video Synthesis with Motion Controllability Jun 3, 2023 Image Generation Text-to-Video Generation
Code Code Available 25 Video Diffusion Models: A Survey May 6, 2024 Survey Text-to-Video Generation
Code Code Available 25 VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Mar 10, 2024 Copy Detection Image Generation
Code Code Available 25 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation Jul 19, 2024 Attribute Language Modeling
Code Code Available 25 StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter Dec 1, 2023 Disentanglement Text-to-Video Generation
Code Code Available 25 Latent Video Diffusion Models for High-Fidelity Long Video Generation Nov 23, 2022 Denoising Image Generation
Code Code Available 25 GenAI Arena: An Open Evaluation Platform for Generative Models Jun 6, 2024 Image Generation Instruction Following
Code Code Available 25 Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming Dec 11, 2024 Text to 3D Text-to-Image Generation
Code Code Available 25 PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation Nov 30, 2024 Text-to-Video Generation Video Generation
Code Code Available 25 CelebV-Text: A Large-Scale Facial Text-Video Dataset Mar 26, 2023 Text Generation Text-to-Video Generation
Code Code Available 25 On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices Mar 31, 2025 Denoising Model Optimization
Code Code Available 25 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Dec 5, 2024 Image Comprehension Representation Learning
Code Code Available 25 FreeInit: Bridging Initialization Gap in Video Diffusion Models Dec 12, 2023 Denoising Text-to-Video Generation
Code Code Available 25 On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices Feb 5, 2025 Denoising Model Optimization
Code Code Available 25 Make-A-Video: Text-to-Video Generation without Text-Video Data Sep 29, 2022 Decoder Image Generation
Code Code Available 15