The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives Sep 17, 2024 text-to-speech Text to Speech
— Unverified 00 The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation Apr 16, 2025 Sentence Text-to-Video Generation
— Unverified 00 The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective May 13, 2024 Text-to-Video Generation Video Generation
— Unverified 00 Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform Apr 21, 2025 Boundary Detection Optical Character Recognition (OCR)
— Unverified 00 Towards A Better Metric for Text-to-Video Generation Jan 15, 2024 Mixture-of-Experts Text-to-Video Generation
— Unverified 00 Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Apr 11, 2025 Denoising Object
— Unverified 00 BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way Oct 8, 2024 Decoder Text-to-Video Generation
— Unverified 00 TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models Mar 25, 2024 Image to Video Generation Relational Reasoning
— Unverified 00 T-SVG: Text-Driven Stereoscopic Video Generation Dec 12, 2024 Depth Estimation Text-to-Video Generation
— Unverified 00 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Jan 13, 2025 Object Text-to-Video Generation
— Unverified 00 Tutorial on Diffusion Models for Imaging and Vision Mar 26, 2024 Image Generation Text to Image Generation
— Unverified 00 Unlearning Concepts from Text-to-Video Diffusion Models Jul 19, 2024 Text-to-Video Generation Video Generation
— Unverified 00 VIMI: Grounding Video Generation through Multi-modal Instruction Jul 8, 2024 Text-to-Video Generation Video Generation
— Unverified 00 WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation Mar 11, 2025 Text-to-Video Generation Video Generation
— Unverified 00 A Survey of Emerging Approaches and Advances in Video Generation Nov 9, 2024 Image to Video Generation Language Modeling
— Unverified 00 Zero-Shot Video Editing through Adaptive Sliding Score Distillation Jun 7, 2024 Denoising Text-to-Video Generation
— Unverified 00 Gender Bias in Text-to-Video Generation Models: A case study of Sora Dec 30, 2024 Text-to-Video Generation Video Generation
— Unverified 00 Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Nov 26, 2024 Reinforcement Learning (RL) Text-to-Video Generation
— Unverified 00 Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers Jun 5, 2025 GPU Text-to-Video Generation
— Unverified 00 GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Dec 5, 2024 Attribute Hallucination
— Unverified 00 GenTron: Diffusion Transformers for Image and Video Generation Dec 7, 2023 Text-to-Video Generation Video Generation
— Unverified 00 GiVE: Guiding Visual Encoder to Perceive Overlooked Information Oct 26, 2024 Object Question Answering
— Unverified 00 ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models Nov 30, 2023 Text-to-Video Generation Video Generation
— Unverified 00 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Nov 21, 2023 Image Generation Text-to-Video Generation
— Unverified 00 IPO: Iterative Preference Optimization for Text-to-Video Generation Feb 4, 2025 Large Language Model Text-to-Video Generation
— Unverified 00 Grid Diffusion Models for Text-to-Video Generation Mar 30, 2024 GPU Image Generation
— Unverified 00 GVDIFF: Grounded Text-to-Video Generation with Diffusion Models Jul 2, 2024 Text-to-Video Generation Video Generation
— Unverified 00 H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models Apr 14, 2025 Denoising Text-to-Video Generation
— Unverified 00 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Feb 28, 2025 Action Understanding Text-to-Video Generation
— Unverified 00 Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models Feb 4, 2025 Text-to-Video Generation Video Generation
— Unverified 00 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Mar 14, 2025 Text-to-Video Generation Video Generation
— Unverified 00 I4VGen: Image as Free Stepping Stone for Text-to-Video Generation Jun 4, 2024 Diversity Image Generation
— Unverified 00 Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Dec 21, 2024 Text-to-Video Generation Video Generation
— Unverified 00 Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback Dec 3, 2024 Object Offline RL
— Unverified 00 IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner Jan 1, 2025 Motion Generation Text-to-Video Generation
— Unverified 00 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Dec 12, 2024 Text-to-Video Generation Video Generation
— Unverified 00 GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Nov 25, 2023 Instruction Following Language Modeling
— Unverified 00 A Review of Multi-Modal Large Language and Vision Models Mar 28, 2024 Image Captioning Prompt Engineering
— Unverified 00 Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Aug 19, 2024 Instruction Following Large Language Model
— Unverified 00 Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Apr 17, 2023 Image Generation Super-Resolution
— Unverified 00 FlexLip: A Controllable Text-to-Lip System Jun 7, 2022 Audio Generation text-to-speech
— Unverified 00 FlashVideo: A Framework for Swift Inference in Text-to-Video Generation Dec 30, 2023 Text-to-Video Generation Video Generation
— Unverified 00 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Dec 18, 2024 Image Generation Text-to-Video Generation
— Unverified 00 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Dec 13, 2024 GPU Mamba
— Unverified 00 LivePhoto: Real Image Animation with Text-guided Motion Control Dec 5, 2023 Image Animation Text-to-Video Generation
— Unverified 00 VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning Nov 2, 2023 Attribute Text-to-Video Generation
— Unverified 00 LoViC: Efficient Long Video Generation with Context Compression Jul 17, 2025 Text-to-Video Generation Video Generation
— Unverified 00 Fine-grained Controllable Video Generation via Object Appearance and Context Dec 5, 2023 Text-to-Video Generation Video Generation
— Unverified 00 M4V: Multi-Modal Mamba for Text-to-Video Generation Jun 12, 2025 Mamba Text-to-Video Generation
— Unverified 00 FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing Mar 10, 2024 Image Generation Text-to-Video Editing
— Unverified 00