Open-Sora: Democratizing Efficient Video Production for All Dec 29, 2024 All Image Generation
Code Code Available 13Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Dec 21, 2024 Text-to-Video Generation Video Generation
— Unverified 0VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Dec 18, 2024 Image Generation Text-to-Video Generation
— Unverified 0VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Dec 16, 2024 Informativeness Large Language Model
Code Code Available 0LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Dec 13, 2024 GPU Mamba
— Unverified 0Mojito: Motion Trajectory and Intensity Control for Video Generation Dec 12, 2024 Computational Efficiency Optical Flow Estimation
— Unverified 0T-SVG: Text-Driven Stereoscopic Video Generation Dec 12, 2024 Depth Estimation Text-to-Video Generation
— Unverified 0InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Dec 12, 2024 Text-to-Video Generation Video Generation
— Unverified 0Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming Dec 11, 2024 Text to 3D Text-to-Image Generation
Code Code Available 2Multi-Shot Character Consistency for Text-to-Video Generation Dec 10, 2024 Text-to-Video Generation Video Generation
— Unverified 0GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Dec 5, 2024 Attribute Hallucination
— Unverified 0Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Dec 5, 2024 Image Comprehension Representation Learning
Code Code Available 2Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback Dec 3, 2024 Object Offline RL
— Unverified 0CPA: Camera-pose-awareness Diffusion Transformer for Video Generation Dec 2, 2024 Text-to-Video Generation Video Generation
— Unverified 0PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation Nov 30, 2024 Text-to-Video Generation Video Generation
Code Code Available 2Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Nov 26, 2024 Reinforcement Learning (RL) Text-to-Video Generation
— Unverified 0AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM Nov 26, 2024 Benchmarking Text-to-Video Generation
Code Code Available 1Identity-Preserving Text-to-Video Generation by Frequency Decomposition Nov 26, 2024 Human-Domain Subject-to-Video Image to Video Generation
Code Code Available 4DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Nov 25, 2024 Large Language Model Motion Planning
— Unverified 0Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification Nov 22, 2024 Autonomous Driving Text-to-Video Generation
Code Code Available 0VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Nov 22, 2024 Text-to-Video Generation Video Alignment
— Unverified 0OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models Nov 15, 2024 Optical Flow Estimation Text-to-Video Generation
Code Code Available 1A Survey of Emerging Approaches and Advances in Video Generation Nov 9, 2024 Image to Video Generation Language Modeling
— Unverified 0GameGen-X: Interactive Open-world Game Video Generation Nov 1, 2024 Text-to-Video Generation Video Generation
Code Code Available 3Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning Oct 31, 2024 Motion Synthesis Text-to-Video Generation
Code Code Available 1GiVE: Guiding Visual Encoder to Perceive Overlooked Information Oct 26, 2024 Object Question Answering
— Unverified 0MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Oct 10, 2024 Denoising parameter-efficient fine-tuning
Code Code Available 0Pyramidal Flow Matching for Efficient Video Generative Modeling Oct 8, 2024 GPU Text-to-Video Generation
Code Code Available 7BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way Oct 8, 2024 Decoder Text-to-Video Generation
— Unverified 0IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis Oct 5, 2024 Text-to-Video Generation
Code Code Available 1Technical Report: Competition Solution For Modelscope-Sora Sep 24, 2024 Text-to-Video Generation Video Description
— Unverified 0Advancing Video Quality Assessment for AIGC Sep 23, 2024 Image Generation Text Generation
— Unverified 0The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives Sep 17, 2024 text-to-speech Text to Speech
— Unverified 0Compositional 3D-aware Video Generation with LLM Director Aug 31, 2024 Text-to-Video Generation Video Generation
— Unverified 0Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Aug 19, 2024 Instruction Following Large Language Model
— Unverified 0CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Aug 12, 2024 Text-to-Video Generation Video Alignment
Code Code Available 11VidGen-1M: A Large-Scale Dataset for Text-to-video Generation Aug 5, 2024 Text-to-Video Generation Video Generation
— Unverified 0MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Jul 30, 2024 Audio Generation Image to Video Generation
Code Code Available 1T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation Jul 19, 2024 Attribute Language Modeling
Code Code Available 2Unlearning Concepts from Text-to-Video Diffusion Models Jul 19, 2024 Text-to-Video Generation Video Generation
— Unverified 0Video-to-Audio Generation with Hidden Alignment Jul 10, 2024 Audio Generation Data Augmentation
— Unverified 0Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task Jul 9, 2024 GPU Text-to-Video Generation
Code Code Available 0VIMI: Grounding Video Generation through Multi-modal Instruction Jul 8, 2024 Text-to-Video Generation Video Generation
— Unverified 0GVDIFF: Grounded Text-to-Video Generation with Diffusion Models Jul 2, 2024 Text-to-Video Generation Video Generation
— Unverified 0OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Jul 2, 2024 Text-to-Video Generation Video Generation
— Unverified 0Evaluation of Text-to-Video Generation Models: A Dynamics Perspective Jul 1, 2024 Text-to-Video Generation Video Generation
Code Code Available 3ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Jun 26, 2024 Text-to-Video Generation Video Generation
Code Code Available 5MotionBooth: Motion-Aware Customized Text-to-Video Generation Jun 25, 2024 Text-to-Video Generation Video Generation
— Unverified 0SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Jun 20, 2024 Safety Alignment Text-to-Video Generation
Code Code Available 1VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs Jun 14, 2024 Anomaly Detection Benchmarking
Code Code Available 1