Can Text-to-Video Generation help Video-Language Alignment? Mar 24, 2025 Text-to-Video Generation Video Generation
— Unverified 0Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance Mar 24, 2025 Text-to-Video Generation Video Editing
— Unverified 0Enabling Versatile Controls for Video Diffusion Models Mar 21, 2025 Text-to-Video Generation Video Generation
Code Code Available 0HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Mar 14, 2025 Text-to-Video Generation Video Generation
— Unverified 0WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation Mar 11, 2025 Text-to-Video Generation Video Generation
— Unverified 0VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Mar 3, 2025 Text-to-Video Generation Video Generation
Code Code Available 0HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Feb 28, 2025 Action Understanding Text-to-Video Generation
— Unverified 0RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers Feb 20, 2025 Text-to-Video Generation Video Generation
— Unverified 0MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Feb 18, 2025 Text-to-Video Generation Video Generation
— Unverified 0CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Feb 12, 2025 Object Text-to-Video Generation
— Unverified 0Magic 1-For-1: Generating One Minute Video Clips within One Minute Feb 11, 2025 Image Generation Image to Video Generation
Code Code Available 0IPO: Iterative Preference Optimization for Text-to-Video Generation Feb 4, 2025 Large Language Model Text-to-Video Generation
— Unverified 0Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models Feb 4, 2025 Text-to-Video Generation Video Generation
— Unverified 0RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Jan 17, 2025 Text-to-Video Generation Video Generation
— Unverified 0BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Jan 13, 2025 Object Text-to-Video Generation
— Unverified 0ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Jan 8, 2025 Text-to-Video Generation Video Generation
— Unverified 0STDD: Spatio-Temporal Dual Diffusion for Video Generation Jan 1, 2025 Text-to-Video Generation Video Generation
— Unverified 0EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation Jan 1, 2025 Image Generation Text-to-Video Generation
— Unverified 0ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way Jan 1, 2025 Text-to-Video Generation Video Generation
— Unverified 0Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Jan 1, 2025 Image Captioning Image Generation
— Unverified 0IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner Jan 1, 2025 Motion Generation Text-to-Video Generation
— Unverified 0Gender Bias in Text-to-Video Generation Models: A case study of Sora Dec 30, 2024 Text-to-Video Generation Video Generation
— Unverified 0Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Dec 21, 2024 Text-to-Video Generation Video Generation
— Unverified 0VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Dec 18, 2024 Image Generation Text-to-Video Generation
— Unverified 0VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Dec 16, 2024 Informativeness Large Language Model
Code Code Available 0LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Dec 13, 2024 GPU Mamba
— Unverified 0InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Dec 12, 2024 Text-to-Video Generation Video Generation
— Unverified 0Mojito: Motion Trajectory and Intensity Control for Video Generation Dec 12, 2024 Computational Efficiency Optical Flow Estimation
— Unverified 0T-SVG: Text-Driven Stereoscopic Video Generation Dec 12, 2024 Depth Estimation Text-to-Video Generation
— Unverified 0Multi-Shot Character Consistency for Text-to-Video Generation Dec 10, 2024 Text-to-Video Generation Video Generation
— Unverified 0GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Dec 5, 2024 Attribute Hallucination
— Unverified 0Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback Dec 3, 2024 Object Offline RL
— Unverified 0CPA: Camera-pose-awareness Diffusion Transformer for Video Generation Dec 2, 2024 Text-to-Video Generation Video Generation
— Unverified 0Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Nov 26, 2024 Reinforcement Learning (RL) Text-to-Video Generation
— Unverified 0DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Nov 25, 2024 Large Language Model Motion Planning
— Unverified 0VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Nov 22, 2024 Text-to-Video Generation Video Alignment
— Unverified 0Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification Nov 22, 2024 Autonomous Driving Text-to-Video Generation
Code Code Available 0A Survey of Emerging Approaches and Advances in Video Generation Nov 9, 2024 Image to Video Generation Language Modeling
— Unverified 0GiVE: Guiding Visual Encoder to Perceive Overlooked Information Oct 26, 2024 Object Question Answering
— Unverified 0MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Oct 10, 2024 Denoising parameter-efficient fine-tuning
Code Code Available 0BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way Oct 8, 2024 Decoder Text-to-Video Generation
— Unverified 0Technical Report: Competition Solution For Modelscope-Sora Sep 24, 2024 Text-to-Video Generation Video Description
— Unverified 0Advancing Video Quality Assessment for AIGC Sep 23, 2024 Image Generation Text Generation
— Unverified 0The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives Sep 17, 2024 text-to-speech Text to Speech
— Unverified 0Compositional 3D-aware Video Generation with LLM Director Aug 31, 2024 Text-to-Video Generation Video Generation
— Unverified 0Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Aug 19, 2024 Instruction Following Large Language Model
— Unverified 0VidGen-1M: A Large-Scale Dataset for Text-to-video Generation Aug 5, 2024 Text-to-Video Generation Video Generation
— Unverified 0Unlearning Concepts from Text-to-Video Diffusion Models Jul 19, 2024 Text-to-Video Generation Video Generation
— Unverified 0Video-to-Audio Generation with Hidden Alignment Jul 10, 2024 Audio Generation Data Augmentation
— Unverified 0Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task Jul 9, 2024 GPU Text-to-Video Generation
Code Code Available 0