| TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation | May 7, 2024 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| TAVGBench: Benchmarking Text to Audible-Video Generation | Apr 22, 2024 | BenchmarkingContrastive Learning | CodeCode Available | 1 | 5 |
| Click to Move: Controlling Video Generation with Sparse Motion | Aug 19, 2021 | Video Generation | CodeCode Available | 1 | 5 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance | May 27, 2024 | Diffusion PersonalizationVideo Generation | CodeCode Available | 1 | 5 |
| CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation | May 21, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation | Apr 9, 2025 | Image GenerationText to Image Generation | CodeCode Available | 1 | 5 |
| Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think | Mar 2, 2025 | DenoisingImage to Video Generation | CodeCode Available | 1 | 5 |
| DwNet: Dense warp-based network for pose-guided human video generation | Oct 21, 2019 | Video Generation | CodeCode Available | 1 | 5 |
| A Good Image Generator Is What You Need for High-Resolution Video Synthesis | Apr 30, 2021 | Video Generation | CodeCode Available | 1 | 5 |
| DVD-Quant: Data-free Video Diffusion Transformers Quantization | May 24, 2025 | Data Free QuantizationQuantization | CodeCode Available | 1 | 5 |
| DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Mar 5, 2025 | 3D Object DetectionBEV Segmentation | CodeCode Available | 1 | 5 |
| DTVNet: Dynamic Time-lapse Video Generation via Single Still Image | Aug 11, 2020 | DecoderOptical Flow Estimation | CodeCode Available | 1 | 5 |
| Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model | May 12, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation | May 17, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 | 5 |
| SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction | Mar 24, 2025 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation | Mar 8, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation | Feb 27, 2025 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios | Jun 3, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 | 5 |
| CCVS: Context-aware Controllable Video Synthesis | Jul 16, 2021 | DecoderOptical Flow Estimation | CodeCode Available | 1 | 5 |
| Latent Video Transformer | Jun 18, 2020 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving | May 26, 2025 | Autonomous DrivingVideo Generation | CodeCode Available | 1 | 5 |
| Latent Neural Differential Equations for Video Generation | Nov 7, 2020 | Unconditional Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation | May 23, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation | Jun 5, 2025 | DenoisingVideo Generation | CodeCode Available | 1 | 5 |
| Latent Image Animator: Learning to animate image via latent space navigation | Sep 29, 2021 | Image AnimationVideo Generation | CodeCode Available | 1 | 5 |
| FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation | Nov 3, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Playable Video Generation | Jan 28, 2021 | DecoderVideo Generation | CodeCode Available | 1 | 5 |
| PoM: Efficient Image and Video Generation with the Polynomial Mixer | Nov 19, 2024 | Video Generation | CodeCode Available | 1 | 5 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 | 5 |
| Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary | Apr 29, 2021 | Face GenerationGenerative Adversarial Network | CodeCode Available | 1 | 5 |
| INR-V: A Continuous Representation Space for Video-based Generative Tasks | Oct 29, 2022 | Video GenerationVideo Inpainting | CodeCode Available | 1 | 5 |
| CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Jan 28, 2025 | 2kVideo Generation | CodeCode Available | 1 | 5 |
| Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework | Sep 19, 2024 | Motion CompensationVideo Generation | CodeCode Available | 1 | 5 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Jan 31, 2025 | DenoisingVideo Alignment | CodeCode Available | 1 | 5 |
| Constrained Synthesis with Projected Diffusion Models | Feb 5, 2024 | Motion SynthesisVideo Generation | CodeCode Available | 1 | 5 |
| StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3 | Aug 16, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 | 5 |
| StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2 | Dec 29, 2021 | Video Generation | CodeCode Available | 1 | 5 |
| Structure-Aware Human-Action Generation | Jul 4, 2020 | Action Generationgraph construction | CodeCode Available | 1 | 5 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| Improved Training Technique for Latent Consistency Models | Feb 3, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN | Mar 21, 2024 | Unconditional Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| FlexDiT: Dynamic Token Density Control for Diffusion Transformer | Dec 8, 2024 | Computational EfficiencyDenoising | CodeCode Available | 1 | 5 |
| DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation | Feb 17, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| CamContextI2V: Context-aware Controllable Video Generation | Apr 8, 2025 | DiversityScene Understanding | CodeCode Available | 1 | 5 |
| Diverse Video Generation using a Gaussian Process Trigger | Jul 9, 2021 | DiversityVideo Generation | CodeCode Available | 1 | 5 |
| Diverse Video Generation from a Single Video | May 11, 2022 | Video Generation | CodeCode Available | 1 | 5 |
| Diverse Generation from a Single Video Made Possible | Sep 17, 2021 | Video GenerationVideo Inpainting | CodeCode Available | 1 | 5 |