| Interspatial Attention for Efficient 4D Human Video Generation | May 21, 2025 | Video Generation | —Unverified | 0 | 0 |
| Investigating Memorization in Video Diffusion Models | Oct 29, 2024 | MemorizationVideo Generation | —Unverified | 0 | 0 |
| IPO: Iterative Preference Optimization for Text-to-Video Generation | Feb 4, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 | 0 |
| Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation | Dec 17, 2024 | Story CompletionVideo Generation | —Unverified | 0 | 0 |
| JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization | Mar 30, 2025 | Video Generation | —Unverified | 0 | 0 |
| Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation | Jul 8, 2022 | Image AnimationText Generation | —Unverified | 0 | 0 |
| Jointly Trained Image and Video Generation using Residual Vectors | Dec 17, 2019 | DisentanglementImage Generation | —Unverified | 0 | 0 |
| JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation | Mar 31, 2025 | Video Generation | —Unverified | 0 | 0 |
| JoyHallo: Digital human model for Mandarin | Sep 20, 2024 | modelText Generation | —Unverified | 0 | 0 |
| JPEG-LM: LLMs as Image Generators with Canonical Codec Representations | Aug 15, 2024 | Image GenerationQuantization | —Unverified | 0 | 0 |
| JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Sep 21, 2024 | Video Generation | —Unverified | 0 | 0 |
| Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Oct 10, 2024 | Video AlignmentVideo Generation | —Unverified | 0 | 0 |
| Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Aug 19, 2024 | Instruction FollowingLarge Language Model | —Unverified | 0 | 0 |
| Label-Conditioned Next-Frame Video Generation with Neural Flows | Oct 16, 2019 | Video Generation | —Unverified | 0 | 0 |
| LaMD: Latent Motion Diffusion for Image-Conditional Video Generation | Apr 23, 2023 | Motion GenerationVideo Generation | —Unverified | 0 | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 | 0 |
| Large Motion Video Autoencoding with Cross-modal Video VAE | Dec 23, 2024 | Video Generation | —Unverified | 0 | 0 |
| Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Dec 8, 2024 | Video Generation | —Unverified | 0 | 0 |
| Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Apr 17, 2023 | Image GenerationSuper-Resolution | —Unverified | 0 | 0 |
| LayerAnimate: Layer-specific Control for Animation | Jan 14, 2025 | Video Generation | —Unverified | 0 | 0 |
| Layered Controllable Video Generation | Nov 24, 2021 | Video Generation | —Unverified | 0 | 0 |
| Learning Long-Term Style-Preserving Blind Video Temporal Consistency | Mar 12, 2021 | Image ManipulationStyle Transfer | —Unverified | 0 | 0 |
| Learning Online Scale Transformation for Talking Head Video Generation | Jul 13, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 | 0 |
| Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression | Feb 6, 2025 | Computational EfficiencyVideo Generation | —Unverified | 0 | 0 |
| Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Jan 16, 2025 | DecoderImage Generation | —Unverified | 0 | 0 |
| Learning Temporally Consistent Video Depth from Video Diffusion Priors | Jun 3, 2024 | Depth EstimationNovel View Synthesis | —Unverified | 0 | 0 |
| Learning to Deblur and Generate High Frame Rate Video with an Event Camera | Mar 2, 2020 | DeblurringVideo Generation | —Unverified | 0 | 0 |
| Learning to Generate Videos Using Neural Uncertainty Priors | Jan 1, 2021 | DiversityVideo Generation | —Unverified | 0 | 0 |
| Learning Universal Policies via Text-Guided Video Generation | Jan 31, 2023 | Decision MakingImage Generation | —Unverified | 0 | 0 |
| Learning World Models for Interactive Video Generation | May 28, 2025 | In-Context LearningRetrieval | —Unverified | 0 | 0 |
| Lets Play Music: Audio-driven Performance Video Generation | Nov 5, 2020 | Video Generation | —Unverified | 0 | 0 |
| LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis | Nov 24, 2024 | DiversityImage Animation | —Unverified | 0 | 0 |
| Leveraging Pre-Trained Visual Models for AI-Generated Video Detection | Jul 17, 2025 | MisinformationVideo Generation | —Unverified | 0 | 0 |
| License Plate Images Generation with Diffusion Models | Jan 6, 2025 | License Plate RecognitionSynthetic Data Generation | —Unverified | 0 | 0 |
| LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Dec 12, 2024 | 3D ReconstructionImage to 3D | —Unverified | 0 | 0 |
| LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Dec 13, 2024 | GPUMamba | —Unverified | 0 | 0 |
| LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition | Jan 8, 2025 | Lip Readingspeech-recognition | —Unverified | 0 | 0 |
| LivePhoto: Real Image Animation with Text-guided Motion Control | Dec 5, 2023 | Image AnimationText-to-Video Generation | —Unverified | 0 | 0 |
| LLM as an Art Director (LaDi): Using LLMs to improve Text-to-Media Generators | Nov 7, 2023 | Image GenerationRetrieval | —Unverified | 0 | 0 |
| LLM-based Realistic Safety-Critical Driving Video Generation | Jul 2, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| LLM-grounded Video Diffusion Models | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation | Feb 18, 2025 | BenchmarkingText Generation | —Unverified | 0 | 0 |
| LMP: Leveraging Motion Prior in Zero-Shot Video Generation with Diffusion Transformer | May 20, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Long Context Tuning for Video Generation | Mar 13, 2025 | Video Generation | —Unverified | 0 | 0 |
| LongDiff: Training-Free Long Video Generation in One Go | Mar 23, 2025 | PositionVideo Generation | —Unverified | 0 | 0 |
| LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model | Jun 2, 2025 | Video Generation | —Unverified | 0 | 0 |
| Long-Term Human Video Generation of Multiple Futures Using Poses | Apr 16, 2019 | Autonomous DrivingPose Prediction | —Unverified | 0 | 0 |
| Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation | Dec 2, 2024 | DiversityVideo Generation | —Unverified | 0 | 0 |
| Loong: Generating Minute-level Long Videos with Autoregressive Language Models | Oct 3, 2024 | Video Generation | —Unverified | 0 | 0 |
| LoopAnimate: Loopable Salient Object Animation | Apr 14, 2024 | GPUObject | —Unverified | 0 | 0 |