| WorldScore: A Unified Evaluation Benchmark for World Generation | Apr 1, 2025 | Scene GenerationVideo Generation | —Unverified | 0 | 0 |
| WorldSimBench: Towards Video Generation Models as World Simulators | Oct 23, 2024 | Autonomous DrivingRobot Manipulation | —Unverified | 0 | 0 |
| X-Dancer: Expressive Music to Human Dance Video Generation | Feb 24, 2025 | Image AnimationVideo Generation | —Unverified | 0 | 0 |
| xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Aug 22, 2024 | Dense CaptioningMotion Estimation | —Unverified | 0 | 0 |
| Xp-GAN: Unsupervised Multi-object Controllable Video Generation | Nov 19, 2021 | ObjectVideo Generation | —Unverified | 0 | 0 |
| Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model | Mar 28, 2025 | Video Generation | —Unverified | 0 | 0 |
| ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Dec 24, 2024 | Human-Object Interaction DetectionVideo Generation | —Unverified | 0 | 0 |
| Generating Videos of Zero-Shot Compositions of Actions and Objects | Dec 5, 2019 | Human-Object Interaction DetectionObject | —Unverified | 0 | 0 |
| Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors | Mar 25, 2025 | DiversityHuman-Object Interaction Detection | —Unverified | 0 | 0 |
| Zero-Shot Video Editing through Adaptive Sliding Score Distillation | Jun 7, 2024 | DenoisingText-to-Video Generation | —Unverified | 0 | 0 |
| 0/1 Deep Neural Networks via Block Coordinate Descent | Jun 19, 2022 | 10-shot image generation | —Unverified | 0 | 0 |
| Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer | Feb 2, 2025 | Reinforcement Learning (RL)Video Generation | —Unverified | 0 | 0 |
| Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion | Aug 1, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 | 0 |
| VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Aug 5, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios | May 14, 2025 | MarketingVideo Generation | —Unverified | 0 | 0 |
| Face Consistency Benchmark for GenAI Video | May 16, 2025 | Video Generation | —Unverified | 0 | 0 |
| 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Jan 12, 2024 | Video Generation | —Unverified | 0 | 0 |
| 3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models | Nov 25, 2022 | DenoisingNeRF | —Unverified | 0 | 0 |
| 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Jan 14, 2025 | Novel View SynthesisVideo Generation | —Unverified | 0 | 0 |
| 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Oct 21, 2024 | 3DGSDecoder | —Unverified | 0 | 0 |
| 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Dec 10, 2024 | Video Generation | —Unverified | 0 | 0 |
| 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | May 31, 2024 | NeRFVideo Generation | —Unverified | 0 | 0 |
| 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Jun 11, 2024 | Scene GenerationVideo Generation | —Unverified | 0 | 0 |
| Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Mar 1, 2024 | Objectobject-detection | —Unverified | 0 | 0 |
| AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Nov 27, 2024 | Camera Pose EstimationPose Estimation | —Unverified | 0 | 0 |
| Accelerating Diffusion Sampling via Exploiting Local Transition Coherence | Mar 12, 2025 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Apr 22, 2024 | DenoisingGPU | —Unverified | 0 | 0 |
| Accelerating Video Diffusion Models via Distribution Matching | Dec 8, 2024 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Mar 26, 2025 | Autonomous DrivingNeRF | —Unverified | 0 | 0 |
| ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction | Oct 7, 2024 | multimodal generationStory Generation | —Unverified | 0 | 0 |
| Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts | May 22, 2025 | Dialogue GenerationLarge Language Model | —Unverified | 0 | 0 |
| Action Concept Grounding Network for Semantically-Consistent Video Generation | Sep 28, 2020 | Action Recognitionobject-detection | —Unverified | 0 | 0 |
| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| Action-conditioned video data improves predictability | Apr 8, 2024 | Video Generation | —Unverified | 0 | 0 |
| AdaDiff: Adaptive Step Selection for Fast Diffusion Models | Nov 24, 2023 | DenoisingImage Generation | —Unverified | 0 | 0 |
| Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation | Dec 22, 2024 | Video Frame InterpolationVideo Generation | —Unverified | 0 | 0 |
| Adaptive Caching for Faster Video Generation with Diffusion Transformers | Nov 4, 2024 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis | Jun 6, 2023 | Neural Renderingtext-to-speech | —Unverified | 0 | 0 |
| Advancing Auto-Regressive Continuation for Video Frames | Dec 4, 2024 | Autonomous DrivingOptical Flow Estimation | —Unverified | 0 | 0 |
| Advancing Video Quality Assessment for AIGC | Sep 23, 2024 | Image GenerationText Generation | —Unverified | 0 | 0 |
| AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production | Mar 12, 2024 | Image GenerationRAG | —Unverified | 0 | 0 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 | 0 |
| A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction | Oct 6, 2021 | DiversityVideo Generation | —Unverified | 0 | 0 |
| AKiRa: Augmentation Kit on Rays for optical video generation | Dec 18, 2024 | Video Generation | —Unverified | 0 | 0 |
| Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation | Aug 29, 2024 | AllVideo Generation | —Unverified | 0 | 0 |
| Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Dec 21, 2023 | Synthetic Data GenerationVideo Generation | —Unverified | 0 | 0 |
| AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Nov 26, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 | 0 |
| Anchored Diffusion for Video Face Reenactment | Jul 21, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 | 0 |
| AniClipart: Clipart Animation with Text-to-Video Priors | Apr 18, 2024 | Image to Video GenerationText-to-Video Generation | —Unverified | 0 | 0 |
| AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Dec 3, 2024 | 3D ReconstructionVideo Generation | —Unverified | 0 | 0 |