GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Nov 25, 2023 Instruction Following Language Modeling
— Unverified 0FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Nov 22, 2023 SSIM Text-to-Video Generation
Code Code Available 1GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Nov 21, 2023 Image Generation Text-to-Video Generation
— Unverified 0Make Pixels Dance: High-Dynamic Video Generation Nov 18, 2023 Text-to-Video Generation Video Generation
— Unverified 0Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Nov 17, 2023 Text-to-Video Generation Video Generation
— Unverified 0FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation Nov 3, 2023 Text-to-Video Generation Video Generation
Code Code Available 1REGIS: Refining Generated Videos via Iterative Stylistic Redesigning Nov 3, 2023 Text-to-Video Generation Video Generation
Code Code Available 0VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning Nov 2, 2023 Attribute Text-to-Video Generation
— Unverified 0POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation Nov 2, 2023 Denoising POS
— Unverified 0VideoCrafter1: Open Diffusion Models for High-Quality Video Generation Oct 30, 2023 Text-to-Video Generation Video Generation
Code Code Available 5EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Oct 17, 2023 Benchmarking Language Modelling
Code Code Available 1ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Oct 11, 2023 Image Generation Text to Image Generation
Code Code Available 1Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Sep 28, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation Sep 27, 2023 GPU Text-to-Video Generation
Code Code Available 3LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Sep 26, 2023 Super-Resolution Text-to-Video Generation
Code Code Available 1Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator Sep 25, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation Sep 7, 2023 Action Recognition Decoder
Code Code Available 1VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Sep 1, 2023 Decoder Image Generation
— Unverified 0Dual-Stream Diffusion Net for Text-to-Video Generation Aug 16, 2023 Text-to-Video Generation Video Generation
— Unverified 0ModelScope Text-to-Video Technical Report Aug 12, 2023 Denoising Image Generation
Code Code Available 3InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Jul 13, 2023 Action Recognition Contrastive Learning
Code Code Available 0VideoComposer: Compositional Video Synthesis with Motion Controllability Jun 3, 2023 Image Generation Text-to-Video Generation
Code Code Available 2Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning May 23, 2023 Image Generation Optical Flow Estimation
Code Code Available 2DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation May 23, 2023 Text-to-Video Generation Video Generation
Code Code Available 1ControlVideo: Training-free Controllable Text-to-Video Generation May 22, 2023 Image Generation Text-to-Video Generation
Code Code Available 2Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation May 18, 2023 Image Generation Text to Image Generation
Code Code Available 1Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models May 17, 2023 Image Generation Text-to-Video Generation
— Unverified 0Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation May 16, 2023 Motion Generation Motion Synthesis
— Unverified 0Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models May 10, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Apr 18, 2023 Image Generation Super-Resolution
Code Code Available 1Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Apr 17, 2023 Image Generation Super-Resolution
— Unverified 0Generative Disco: Text-to-Video Generation for Music Visualization Apr 17, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Apr 3, 2023 Image Generation Text to Image Generation
Code Code Available 3CelebV-Text: A Large-Scale Facial Text-Video Dataset Mar 26, 2023 Text Generation Text-to-Video Generation
Code Code Available 2Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Mar 23, 2023 Image Generation Text-to-Video Generation
Code Code Available 4VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 4Structure and Content-Guided Video Synthesis with Diffusion Models Feb 6, 2023 Disentanglement Text-to-Video Generation
— Unverified 0Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models Jan 30, 2023 Audio Generation Text-to-Video Generation
Code Code Available 2Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation Dec 22, 2022 Style Transfer Text-to-Video Generation
Code Code Available 4MAGVIT: Masked Generative Video Transformer Dec 10, 2022 Multi-Task Learning Text-to-Video Generation
Code Code Available 2Latent Video Diffusion Models for High-Fidelity Long Video Generation Nov 23, 2022 Denoising Image Generation
Code Code Available 2Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation Nov 23, 2022 Text-to-Video Generation Video Generation
Code Code Available 1MagicVideo: Efficient Video Generation With Latent Diffusion Models Nov 20, 2022 GPU Text-to-Video Generation
— Unverified 0Make-A-Video: Text-to-Video Generation without Text-Video Data Sep 29, 2022 Decoder Image Generation
Code Code Available 1FlexLip: A Controllable Text-to-Lip System Jun 7, 2022 Audio Generation text-to-speech
— Unverified 0CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers May 29, 2022 Text-to-Video Generation Video Generation
Code Code Available 6MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Apr 17, 2022 Navigate Retrieval
Code Code Available 1NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion Nov 24, 2021 Decoder Image Generation
Code Code Available 1Video Generation from Text Employing Latent Path Construction for Temporal Modeling Jul 29, 2021 Text-to-Video Generation Video Generation
— Unverified 0GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions Apr 30, 2021 Text-to-Video Generation Video Generation
Code Code Available 1