LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation May 17, 2025 Benchmarking Question Answering
Code Code Available 1VPO: Aligning Text-to-Video Generation Models with Prompt Optimization Mar 26, 2025 In-Context Learning Safety Alignment
Code Code Available 1VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Feb 18, 2025 Text-to-Video Generation Video Captioning
Code Code Available 1AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM Nov 26, 2024 Benchmarking Text-to-Video Generation
Code Code Available 1OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models Nov 15, 2024 Optical Flow Estimation Text-to-Video Generation
Code Code Available 1Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning Oct 31, 2024 Motion Synthesis Text-to-Video Generation
Code Code Available 1IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis Oct 5, 2024 Text-to-Video Generation
Code Code Available 1MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Jul 30, 2024 Audio Generation Image to Video Generation
Code Code Available 1SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Jun 20, 2024 Safety Alignment Text-to-Video Generation
Code Code Available 1VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs Jun 14, 2024 Anomaly Detection Benchmarking
Code Code Available 1TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation May 7, 2024 Text-to-Video Generation Video Generation
Code Code Available 1VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis Mar 20, 2024 Generative Temporal Nursing Text-to-Video Generation
Code Code Available 1PEEKABOO: Interactive Video Generation via Masked-Diffusion Dec 12, 2023 Text-to-Video Generation Video Generation
Code Code Available 1MotionCrafter: One-Shot Motion Customization of Diffusion Models Dec 8, 2023 Disentanglement Motion Disentanglement
Code Code Available 1FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Nov 22, 2023 SSIM Text-to-Video Generation
Code Code Available 1FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation Nov 3, 2023 Text-to-Video Generation Video Generation
Code Code Available 1EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Oct 17, 2023 Benchmarking Language Modelling
Code Code Available 1ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Oct 11, 2023 Image Generation Text to Image Generation
Code Code Available 1Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Sep 28, 2023 Text-to-Video Generation Video Generation
Code Code Available 1LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Sep 26, 2023 Super-Resolution Text-to-Video Generation
Code Code Available 1Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator Sep 25, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation Sep 7, 2023 Action Recognition Decoder
Code Code Available 1DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation May 23, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation May 18, 2023 Image Generation Text to Image Generation
Code Code Available 1Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models May 10, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Apr 18, 2023 Image Generation Super-Resolution
Code Code Available 1Generative Disco: Text-to-Video Generation for Music Visualization Apr 17, 2023 Text-to-Video Generation Video Generation
Code Code Available 1Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation Nov 23, 2022 Text-to-Video Generation Video Generation
Code Code Available 1Make-A-Video: Text-to-Video Generation without Text-Video Data Sep 29, 2022 Decoder Image Generation
Code Code Available 1MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Apr 17, 2022 Navigate Retrieval
Code Code Available 1NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion Nov 24, 2021 Decoder Image Generation
Code Code Available 1GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions Apr 30, 2021 Text-to-Video Generation Video Generation
Code Code Available 1LoViC: Efficient Long Video Generation with Context Compression Jul 17, 2025 Text-to-Video Generation Video Generation
— Unverified 0M4V: Multi-Modal Mamba for Text-to-Video Generation Jun 12, 2025 Mamba Text-to-Video Generation
— Unverified 0Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers Jun 5, 2025 GPU Text-to-Video Generation
— Unverified 0InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO May 23, 2025 Text-to-Video Generation Video Generation
Code Code Available 0Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking May 19, 2025 Image Generation Mamba
— Unverified 0T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models May 8, 2025 Instruction Following Text-to-Video Generation
— Unverified 0DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization May 4, 2025 Denoising Text-to-Video Generation
— Unverified 0T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation May 1, 2025 counterfactual Instruction Following
— Unverified 0We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback Apr 24, 2025 Text-to-Video Generation Video Generation
— Unverified 0DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation Apr 21, 2025 Attribute Denoising
— Unverified 0Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform Apr 21, 2025 Boundary Detection Optical Character Recognition (OCR)
— Unverified 0The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation Apr 16, 2025 Sentence Text-to-Video Generation
— Unverified 0Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM Apr 16, 2025 Large Language Model Text-to-Video Generation
— Unverified 0H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models Apr 14, 2025 Denoising Text-to-Video Generation
— Unverified 0Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Apr 11, 2025 Denoising Object
— Unverified 0VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models Mar 27, 2025 Text-to-Video Generation Video Generation
— Unverified 0RecTable: Fast Modeling Tabular Data with Rectified Flow Mar 26, 2025 Image Generation Text to Image Generation
Code Code Available 0Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations Mar 26, 2025 Descriptive Text-to-Video Generation
Code Code Available 0