VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis Mar 20, 2024 Generative Temporal Nursing Text-to-Video Generation
Code Code Available 15 FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Nov 22, 2023 SSIM Text-to-Video Generation
Code Code Available 15 IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis Oct 5, 2024 Text-to-Video Generation
Code Code Available 15 Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator Sep 25, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation May 18, 2023 Image Generation Text to Image Generation
Code Code Available 15 VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Feb 18, 2025 Text-to-Video Generation Video Captioning
Code Code Available 15 AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM Nov 26, 2024 Benchmarking Text-to-Video Generation
Code Code Available 15 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Oct 11, 2023 Image Generation Text to Image Generation
Code Code Available 15 Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation Nov 23, 2022 Text-to-Video Generation Video Generation
Code Code Available 15 LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation May 17, 2025 Benchmarking Question Answering
Code Code Available 15 TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation May 7, 2024 Text-to-Video Generation Video Generation
Code Code Available 15 Generative Disco: Text-to-Video Generation for Music Visualization Apr 17, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation Nov 3, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Sep 26, 2023 Super-Resolution Text-to-Video Generation
Code Code Available 15 DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation May 23, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Oct 17, 2023 Benchmarking Language Modelling
Code Code Available 15 Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models May 10, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation Sep 7, 2023 Action Recognition Decoder
Code Code Available 15 Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning Oct 31, 2024 Motion Synthesis Text-to-Video Generation
Code Code Available 15 VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs Jun 14, 2024 Anomaly Detection Benchmarking
Code Code Available 15 SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Jun 20, 2024 Safety Alignment Text-to-Video Generation
Code Code Available 15 PEEKABOO: Interactive Video Generation via Masked-Diffusion Dec 12, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models Nov 15, 2024 Optical Flow Estimation Text-to-Video Generation
Code Code Available 15 NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion Nov 24, 2021 Decoder Image Generation
Code Code Available 15 MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Apr 17, 2022 Navigate Retrieval
Code Code Available 15 MotionCrafter: One-Shot Motion Customization of Diffusion Models Dec 8, 2023 Disentanglement Motion Disentanglement
Code Code Available 15 MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Jul 30, 2024 Audio Generation Image to Video Generation
Code Code Available 15 VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation May 29, 2025 Caption Generation Language Modeling
Code Code Available 15 GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions Apr 30, 2021 Text-to-Video Generation Video Generation
Code Code Available 15 Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Apr 18, 2023 Image Generation Super-Resolution
Code Code Available 15 Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Sep 28, 2023 Text-to-Video Generation Video Generation
Code Code Available 15 Make-A-Video: Text-to-Video Generation without Text-Video Data Sep 29, 2022 Decoder Image Generation
Code Code Available 15 A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Dec 25, 2023 Image Generation Text to Image Generation
Code Code Available 05 VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Mar 3, 2025 Text-to-Video Generation Video Generation
Code Code Available 05 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Dec 16, 2024 Informativeness Large Language Model
Code Code Available 05 Magic 1-For-1: Generating One Minute Video Clips within One Minute Feb 11, 2025 Image Generation Image to Video Generation
Code Code Available 05 Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures Nov 30, 2016 Text-to-Video Generation Video Generation
Code Code Available 05 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Jul 13, 2023 Action Recognition Contrastive Learning
Code Code Available 05 InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO May 23, 2025 Text-to-Video Generation Video Generation
Code Code Available 05 Enabling Versatile Controls for Video Diffusion Models Mar 21, 2025 Text-to-Video Generation Video Generation
Code Code Available 05 Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations Mar 26, 2025 Descriptive Text-to-Video Generation
Code Code Available 05 Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation Dec 7, 2023 Spatial Reasoning Text-to-Video Generation
Code Code Available 05 Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification Nov 22, 2024 Autonomous Driving Text-to-Video Generation
Code Code Available 05 RecTable: Fast Modeling Tabular Data with Rectified Flow Mar 26, 2025 Image Generation Text to Image Generation
Code Code Available 05 MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Oct 10, 2024 Denoising parameter-efficient fine-tuning
Code Code Available 05 Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task Jul 9, 2024 GPU Text-to-Video Generation
Code Code Available 05 REGIS: Refining Generated Videos via Iterative Stylistic Redesigning Nov 3, 2023 Text-to-Video Generation Video Generation
Code Code Available 05 GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Dec 5, 2024 Attribute Hallucination
— Unverified 00 Gender Bias in Text-to-Video Generation Models: A case study of Sora Dec 30, 2024 Text-to-Video Generation Video Generation
— Unverified 00 DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control May 21, 2024 Attribute Motion Generation
— Unverified 00