| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 |
| Action-conditioned video data improves predictability | Apr 8, 2024 | Video Generation | —Unverified | 0 |
| AdaDiff: Adaptive Step Selection for Fast Diffusion Models | Nov 24, 2023 | DenoisingImage Generation | —Unverified | 0 |
| Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation | Dec 22, 2024 | Video Frame InterpolationVideo Generation | —Unverified | 0 |
| Adaptive Caching for Faster Video Generation with Diffusion Transformers | Nov 4, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis | Jun 6, 2023 | Neural Renderingtext-to-speech | —Unverified | 0 |
| Advancing Auto-Regressive Continuation for Video Frames | Dec 4, 2024 | Autonomous DrivingOptical Flow Estimation | —Unverified | 0 |
| Advancing Video Quality Assessment for AIGC | Sep 23, 2024 | Image GenerationText Generation | —Unverified | 0 |
| AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production | Mar 12, 2024 | Image GenerationRAG | —Unverified | 0 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 |
| A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction | Oct 6, 2021 | DiversityVideo Generation | —Unverified | 0 |
| AKiRa: Augmentation Kit on Rays for optical video generation | Dec 18, 2024 | Video Generation | —Unverified | 0 |
| Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation | Aug 29, 2024 | AllVideo Generation | —Unverified | 0 |
| Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Dec 21, 2023 | Synthetic Data GenerationVideo Generation | —Unverified | 0 |
| AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Nov 26, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Anchored Diffusion for Video Face Reenactment | Jul 21, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 |
| AniClipart: Clipart Animation with Text-to-Video Priors | Apr 18, 2024 | Image to Video GenerationText-to-Video Generation | —Unverified | 0 |
| AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Dec 3, 2024 | 3D ReconstructionVideo Generation | —Unverified | 0 |
| AnimateAnything: Consistent and Controllable Animation for Video Generation | Nov 16, 2024 | Video Generation | —Unverified | 0 |
| AnimateDiff-Lightning: Cross-Model Diffusion Distillation | Mar 19, 2024 | modelVideo Generation | —Unverified | 0 |
| Animate Your Motion: Turning Still Images into Dynamic Videos | Mar 15, 2024 | SpecificityText-to-Video Generation | —Unverified | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Animating the Past: Reconstruct Trilobite via Video Generation | Oct 10, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Mar 31, 2025 | Video Generation | —Unverified | 0 |
| AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance | Feb 12, 2025 | Video Generation | —Unverified | 0 |
| AnyI2V: Animating Any Conditional Image with Motion Control | Jul 3, 2025 | Style TransferVideo Generation | —Unverified | 0 |
| APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency | Aug 24, 2023 | Video Generation | —Unverified | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Recipe for Scaling up Text-to-Video Generation with Text-free Videos | Dec 25, 2023 | Image GenerationText to Image Generation | —Unverified | 0 |
| A review of Generative Adversarial Networks (GANs) and its applications in a wide variety of disciplines -- From Medical to Remote Sensing | Oct 1, 2021 | AstronomyGenerative Adversarial Network | —Unverified | 0 |
| A Review of Multi-Modal Large Language and Vision Models | Mar 28, 2024 | Image CaptioningPrompt Engineering | —Unverified | 0 |
| ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation | Oct 27, 2024 | Video Generation | —Unverified | 0 |
| ArrowGAN : Learning to Generate Videos by Learning Arrow of Time | Jan 11, 2021 | Conditional Image GenerationImage Generation | —Unverified | 0 |
| ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models | Nov 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Feb 11, 2025 | Image GenerationMotion Generation | —Unverified | 0 |
| A spatiotemporal style transfer algorithm for dynamic visual stimulus generation | Mar 7, 2024 | Image GenerationObject Recognition | —Unverified | 0 |
| Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers | Jun 5, 2025 | GPUText-to-Video Generation | —Unverified | 0 |
| A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication | Jul 15, 2024 | FairnessImage Generation | —Unverified | 0 |
| A Survey of Emerging Approaches and Advances in Video Generation | Nov 9, 2024 | Image to Video GenerationLanguage Modeling | —Unverified | 0 |
| A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming | Jan 30, 2024 | Video GenerationVideo Understanding | —Unverified | 0 |
| A Survey on Long Video Generation: Challenges, Methods, and Prospects | Mar 25, 2024 | SurveyVideo Generation | —Unverified | 0 |
| A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality | Jul 9, 2025 | DiversityVideo Generation | —Unverified | 0 |
| A Survey on Vision Autoregressive Model | Nov 13, 2024 | 3D GenerationBenchmarking | —Unverified | 0 |
| ASurvey: Spatiotemporal Consistency in Video Generation | Feb 25, 2025 | Image GenerationVideo Generation | —Unverified | 0 |
| AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations | Oct 17, 2024 | DecoderQuantization | —Unverified | 0 |
| ATI: Any Trajectory Instruction for Controllable Video Generation | May 28, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| AtomoVideo: High Fidelity Image-to-Video Generation | Mar 4, 2024 | Image GenerationImage to Video Generation | —Unverified | 0 |
| AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers | Mar 25, 2025 | Video Generation | —Unverified | 0 |
| Audio-Driven Co-Speech Gesture Video Generation | Dec 5, 2022 | Video Generation | —Unverified | 0 |
| Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Mar 27, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |