| AnimateAnything: Consistent and Controllable Animation for Video Generation | Nov 16, 2024 | Video Generation | —Unverified | 0 | 0 |
| AnimateDiff-Lightning: Cross-Model Diffusion Distillation | Mar 19, 2024 | modelVideo Generation | —Unverified | 0 | 0 |
| Animate Your Motion: Turning Still Images into Dynamic Videos | Mar 15, 2024 | SpecificityText-to-Video Generation | —Unverified | 0 | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Animating the Past: Reconstruct Trilobite via Video Generation | Oct 10, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Mar 31, 2025 | Video Generation | —Unverified | 0 | 0 |
| AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance | Feb 12, 2025 | Video Generation | —Unverified | 0 | 0 |
| AnyI2V: Animating Any Conditional Image with Motion Control | Jul 3, 2025 | Style TransferVideo Generation | —Unverified | 0 | 0 |
| APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency | Aug 24, 2023 | Video Generation | —Unverified | 0 | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Recipe for Scaling up Text-to-Video Generation with Text-free Videos | Dec 25, 2023 | Image GenerationText to Image Generation | —Unverified | 0 | 0 |
| A review of Generative Adversarial Networks (GANs) and its applications in a wide variety of disciplines -- From Medical to Remote Sensing | Oct 1, 2021 | AstronomyGenerative Adversarial Network | —Unverified | 0 | 0 |
| A Review of Multi-Modal Large Language and Vision Models | Mar 28, 2024 | Image CaptioningPrompt Engineering | —Unverified | 0 | 0 |
| ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation | Oct 27, 2024 | Video Generation | —Unverified | 0 | 0 |
| ArrowGAN : Learning to Generate Videos by Learning Arrow of Time | Jan 11, 2021 | Conditional Image GenerationImage Generation | —Unverified | 0 | 0 |
| ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models | Nov 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Feb 11, 2025 | Image GenerationMotion Generation | —Unverified | 0 | 0 |
| A spatiotemporal style transfer algorithm for dynamic visual stimulus generation | Mar 7, 2024 | Image GenerationObject Recognition | —Unverified | 0 | 0 |
| Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers | Jun 5, 2025 | GPUText-to-Video Generation | —Unverified | 0 | 0 |
| A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication | Jul 15, 2024 | FairnessImage Generation | —Unverified | 0 | 0 |
| A Survey of Emerging Approaches and Advances in Video Generation | Nov 9, 2024 | Image to Video GenerationLanguage Modeling | —Unverified | 0 | 0 |
| A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming | Jan 30, 2024 | Video GenerationVideo Understanding | —Unverified | 0 | 0 |
| A Survey on Long Video Generation: Challenges, Methods, and Prospects | Mar 25, 2024 | SurveyVideo Generation | —Unverified | 0 | 0 |
| A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality | Jul 9, 2025 | DiversityVideo Generation | —Unverified | 0 | 0 |
| A Survey on Vision Autoregressive Model | Nov 13, 2024 | 3D GenerationBenchmarking | —Unverified | 0 | 0 |
| ASurvey: Spatiotemporal Consistency in Video Generation | Feb 25, 2025 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations | Oct 17, 2024 | DecoderQuantization | —Unverified | 0 | 0 |
| ATI: Any Trajectory Instruction for Controllable Video Generation | May 28, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| AtomoVideo: High Fidelity Image-to-Video Generation | Mar 4, 2024 | Image GenerationImage to Video Generation | —Unverified | 0 | 0 |
| AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers | Mar 25, 2025 | Video Generation | —Unverified | 0 | 0 |
| Audio-Driven Co-Speech Gesture Video Generation | Dec 5, 2022 | Video Generation | —Unverified | 0 | 0 |
| Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Mar 27, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 | 0 |
| Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels | Jan 16, 2022 | Video Generation | —Unverified | 0 | 0 |
| Audio-Sync Video Generation with Multi-Stream Temporal Control | Jun 9, 2025 | Audio-Visual SynchronizationVideo Alignment | —Unverified | 0 | 0 |
| Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | May 7, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 | 0 |
| Autoencoding Video Latents for Adversarial Video Generation | Jan 18, 2022 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| AutoLV: Automatic Lecture Video Generator | Sep 19, 2022 | Speech SynthesisTalking Head Generation | —Unverified | 0 | 0 |
| Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Nov 19, 2024 | 3D GenerationGPU | —Unverified | 0 | 0 |
| AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations | Mar 17, 2025 | Semantic SegmentationVideo Generation | —Unverified | 0 | 0 |
| AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection | May 21, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation | Jun 11, 2024 | Audio GenerationVideo Generation | —Unverified | 0 | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 | 0 |
| Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Jan 1, 2025 | Code GenerationImage Generation | —Unverified | 0 | 0 |
| Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos | Apr 10, 2025 | Question AnsweringVideo Generation | —Unverified | 0 | 0 |
| The Missing U for Efficient Diffusion Models | Oct 31, 2023 | DenoisingImage Generation | —Unverified | 0 | 0 |
| BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering | Mar 10, 2024 | Video GenerationVideo Temporal Consistency | —Unverified | 0 | 0 |
| BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations | Jan 13, 2025 | ObjectText-to-Video Generation | —Unverified | 0 | 0 |
| Boosting Camera Motion Control for Video Diffusion Transformers | Oct 14, 2024 | Video Generation | —Unverified | 0 | 0 |
| Bora: Biomedical Generalist Video Generation Model | Jul 12, 2024 | Cell TrackingData Augmentation | —Unverified | 0 | 0 |
| Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising | Jan 6, 2025 | DenoisingVideo Generation | —Unverified | 0 | 0 |