| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 |
| Improving the Diffusability of Autoencoders | Feb 20, 2025 | DecoderImage Generation | —Unverified | 0 |
| Improving Video Generation with Human Feedback | Jan 23, 2025 | Video Generation | —Unverified | 0 |
| IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner | Jan 1, 2025 | Motion GenerationText-to-Video Generation | —Unverified | 0 |
| Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models | Nov 27, 2024 | Model CompressionVideo Generation | —Unverified | 0 |
| Inference Optimization of Foundation Models on AI Accelerators | Jul 12, 2024 | Inference OptimizationModel Compression | —Unverified | 0 |
| InfinityDrive: Breaking Time Limits in Driving World Models | Dec 2, 2024 | Autonomous DrivingDiversity | —Unverified | 0 |
| Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Jan 18, 2024 | Super-ResolutionVideo Generation | —Unverified | 0 |
| InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation | Jan 8, 2021 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Dec 12, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Instructional Video Generation | Dec 5, 2024 | Video Generation | —Unverified | 0 |
| InstructVideo: Instructing Video Diffusion Models with Human Feedback | Dec 19, 2023 | Video Generation | —Unverified | 0 |
| Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor | Oct 16, 2021 | Face GenerationTalking Face Generation | —Unverified | 0 |
| Intention-driven Ego-to-Exo Video Generation | Mar 14, 2024 | Optical Flow EstimationStereo Matching | —Unverified | 0 |
| Interactive Video Generation via Domain Adaptation | May 30, 2025 | AttributeDenoising | —Unverified | 0 |
| InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation | Apr 15, 2025 | DenoisingVideo Generation | —Unverified | 0 |
| InterDyn: Controllable Interactive Dynamics with Video Diffusion Models | Dec 16, 2024 | Video Generation | —Unverified | 0 |
| InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation | Jul 13, 2023 | Action RecognitionContrastive Learning | —Unverified | 0 |
| Interspatial Attention for Efficient 4D Human Video Generation | May 21, 2025 | Video Generation | —Unverified | 0 |
| Investigating Memorization in Video Diffusion Models | Oct 29, 2024 | MemorizationVideo Generation | —Unverified | 0 |
| IPO: Iterative Preference Optimization for Text-to-Video Generation | Feb 4, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 |
| Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation | Dec 17, 2024 | Story CompletionVideo Generation | —Unverified | 0 |
| JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization | Mar 30, 2025 | Video Generation | —Unverified | 0 |
| Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation | Jul 8, 2022 | Image AnimationText Generation | —Unverified | 0 |
| Jointly Trained Image and Video Generation using Residual Vectors | Dec 17, 2019 | DisentanglementImage Generation | —Unverified | 0 |
| JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation | Mar 31, 2025 | Video Generation | —Unverified | 0 |
| JoyHallo: Digital human model for Mandarin | Sep 20, 2024 | modelText Generation | —Unverified | 0 |
| JPEG-LM: LLMs as Image Generators with Canonical Codec Representations | Aug 15, 2024 | Image GenerationQuantization | —Unverified | 0 |
| JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Sep 21, 2024 | Video Generation | —Unverified | 0 |
| Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Oct 10, 2024 | Video AlignmentVideo Generation | —Unverified | 0 |
| Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Aug 19, 2024 | Instruction FollowingLarge Language Model | —Unverified | 0 |
| Label-Conditioned Next-Frame Video Generation with Neural Flows | Oct 16, 2019 | Video Generation | —Unverified | 0 |
| LaMD: Latent Motion Diffusion for Image-Conditional Video Generation | Apr 23, 2023 | Motion GenerationVideo Generation | —Unverified | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 |
| Large Motion Video Autoencoding with Cross-modal Video VAE | Dec 23, 2024 | Video Generation | —Unverified | 0 |
| Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Dec 8, 2024 | Video Generation | —Unverified | 0 |
| Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Apr 17, 2023 | Image GenerationSuper-Resolution | —Unverified | 0 |
| LayerAnimate: Layer-specific Control for Animation | Jan 14, 2025 | Video Generation | —Unverified | 0 |
| Layered Controllable Video Generation | Nov 24, 2021 | Video Generation | —Unverified | 0 |
| Learning Long-Term Style-Preserving Blind Video Temporal Consistency | Mar 12, 2021 | Image ManipulationStyle Transfer | —Unverified | 0 |
| Learning Online Scale Transformation for Talking Head Video Generation | Jul 13, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 |
| Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression | Feb 6, 2025 | Computational EfficiencyVideo Generation | —Unverified | 0 |
| Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Jan 16, 2025 | DecoderImage Generation | —Unverified | 0 |
| Learning Temporally Consistent Video Depth from Video Diffusion Priors | Jun 3, 2024 | Depth EstimationNovel View Synthesis | —Unverified | 0 |
| Learning to Deblur and Generate High Frame Rate Video with an Event Camera | Mar 2, 2020 | DeblurringVideo Generation | —Unverified | 0 |
| Learning to Generate Videos Using Neural Uncertainty Priors | Jan 1, 2021 | DiversityVideo Generation | —Unverified | 0 |
| Learning Universal Policies via Text-Guided Video Generation | Jan 31, 2023 | Decision MakingImage Generation | —Unverified | 0 |
| Learning World Models for Interactive Video Generation | May 28, 2025 | In-Context LearningRetrieval | —Unverified | 0 |
| Lets Play Music: Audio-driven Performance Video Generation | Nov 5, 2020 | Video Generation | —Unverified | 0 |
| LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis | Nov 24, 2024 | DiversityImage Animation | —Unverified | 0 |