LoViC: Efficient Long Video Generation with Context Compression Jul 17, 2025 Text-to-Video Generation Video Generation
— Unverified 0M4V: Multi-Modal Mamba for Text-to-Video Generation Jun 12, 2025 Mamba Text-to-Video Generation
— Unverified 0Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers Jun 5, 2025 GPU Text-to-Video Generation
— Unverified 0VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation May 29, 2025 Caption Generation Language Modeling
Code Code Available 1InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO May 23, 2025 Text-to-Video Generation Video Generation
Code Code Available 0Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking May 19, 2025 Image Generation Mamba
— Unverified 0LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation May 17, 2025 Benchmarking Question Answering
Code Code Available 1T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models May 8, 2025 Instruction Following Text-to-Video Generation
— Unverified 0DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization May 4, 2025 Denoising Text-to-Video Generation
— Unverified 0T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation May 1, 2025 counterfactual Instruction Following
— Unverified 0We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback Apr 24, 2025 Text-to-Video Generation Video Generation
— Unverified 0Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform Apr 21, 2025 Boundary Detection Optical Character Recognition (OCR)
— Unverified 0DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation Apr 21, 2025 Attribute Denoising
— Unverified 0The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation Apr 16, 2025 Sentence Text-to-Video Generation
— Unverified 0Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM Apr 16, 2025 Large Language Model Text-to-Video Generation
— Unverified 0H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models Apr 14, 2025 Denoising Text-to-Video Generation
— Unverified 0Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Apr 11, 2025 Denoising Object
— Unverified 0On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices Mar 31, 2025 Denoising Model Optimization
Code Code Available 2VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models Mar 27, 2025 Text-to-Video Generation Video Generation
— Unverified 0Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations Mar 26, 2025 Descriptive Text-to-Video Generation
Code Code Available 0VPO: Aligning Text-to-Video Generation Models with Prompt Optimization Mar 26, 2025 In-Context Learning Safety Alignment
Code Code Available 1RecTable: Fast Modeling Tabular Data with Rectified Flow Mar 26, 2025 Image Generation Text to Image Generation
Code Code Available 0Can Text-to-Video Generation help Video-Language Alignment? Mar 24, 2025 Text-to-Video Generation Video Generation
— Unverified 0Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance Mar 24, 2025 Text-to-Video Generation Video Editing
— Unverified 0Enabling Versatile Controls for Video Diffusion Models Mar 21, 2025 Text-to-Video Generation Video Generation
Code Code Available 0HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Mar 14, 2025 Text-to-Video Generation Video Generation
— Unverified 0WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation Mar 11, 2025 Text-to-Video Generation Video Generation
— Unverified 0VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Mar 3, 2025 Text-to-Video Generation Video Generation
Code Code Available 0HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Feb 28, 2025 Action Understanding Text-to-Video Generation
— Unverified 0RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers Feb 20, 2025 Text-to-Video Generation Video Generation
— Unverified 0VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Feb 18, 2025 Text-to-Video Generation Video Captioning
Code Code Available 1MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Feb 18, 2025 Text-to-Video Generation Video Generation
— Unverified 0CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Feb 12, 2025 Object Text-to-Video Generation
— Unverified 0Magic 1-For-1: Generating One Minute Video Clips within One Minute Feb 11, 2025 Image Generation Image to Video Generation
Code Code Available 0FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Feb 7, 2025 Computational Efficiency Text-to-Video Generation
Code Code Available 3On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices Feb 5, 2025 Denoising Model Optimization
Code Code Available 2Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models Feb 4, 2025 Text-to-Video Generation Video Generation
— Unverified 0IPO: Iterative Preference Optimization for Text-to-Video Generation Feb 4, 2025 Large Language Model Text-to-Video Generation
— Unverified 0RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Jan 17, 2025 Text-to-Video Generation Video Generation
— Unverified 0Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Jan 14, 2025 Benchmarking Text-to-Video Generation
Code Code Available 4BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Jan 13, 2025 Object Text-to-Video Generation
— Unverified 0ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Jan 8, 2025 Text-to-Video Generation Video Generation
— Unverified 0Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Jan 7, 2025 Diversity Text-to-Video Generation
Code Code Available 2TransPixeler: Advancing Text-to-Video Generation with Transparency Jan 6, 2025 Text-to-Video Generation Video Generation
Code Code Available 4ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way Jan 1, 2025 Text-to-Video Generation Video Generation
— Unverified 0Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Jan 1, 2025 Image Captioning Image Generation
— Unverified 0EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation Jan 1, 2025 Image Generation Text-to-Video Generation
— Unverified 0IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner Jan 1, 2025 Motion Generation Text-to-Video Generation
— Unverified 0STDD: Spatio-Temporal Dual Diffusion for Video Generation Jan 1, 2025 Text-to-Video Generation Video Generation
— Unverified 0Gender Bias in Text-to-Video Generation Models: A case study of Sora Dec 30, 2024 Text-to-Video Generation Video Generation
— Unverified 0