| VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation | May 29, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Jan 28, 2025 | 2kVideo Generation | CodeCode Available | 1 | 5 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 | 5 |
| Latent Image Animator: Learning to animate image via latent space navigation | Sep 29, 2021 | Image AnimationVideo Generation | CodeCode Available | 1 | 5 |
| Latent Neural Differential Equations for Video Generation | Nov 7, 2020 | Unconditional Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation | May 23, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Latent Video Transformer | Jun 18, 2020 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| Sliced Wasserstein Generative Models | Apr 10, 2019 | Image GenerationVideo Generation | CodeCode Available | 1 | 5 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 | 5 |
| DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation | Feb 17, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| CamContextI2V: Context-aware Controllable Video Generation | Apr 8, 2025 | DiversityScene Understanding | CodeCode Available | 1 | 5 |
| INR-V: A Continuous Representation Space for Video-based Generative Tasks | Oct 29, 2022 | Video GenerationVideo Inpainting | CodeCode Available | 1 | 5 |
| Diverse Video Generation using a Gaussian Process Trigger | Jul 9, 2021 | DiversityVideo Generation | CodeCode Available | 1 | 5 |
| Diverse Video Generation from a Single Video | May 11, 2022 | Video Generation | CodeCode Available | 1 | 5 |
| Diverse Generation from a Single Video Made Possible | Sep 17, 2021 | Video GenerationVideo Inpainting | CodeCode Available | 1 | 5 |
| Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation | Sep 28, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| TrailBlazer: Trajectory Control for Diffusion-Based Video Generation | Dec 31, 2023 | Video Generation | CodeCode Available | 1 | 5 |
| Free-Editor: Zero-shot Text-driven 3D Scene Editing | Dec 21, 2023 | 3D scene EditingStyle Transfer | CodeCode Available | 1 | 5 |
| Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Jan 31, 2025 | DenoisingVideo Alignment | CodeCode Available | 1 | 5 |
| BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | May 19, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 1 | 5 |
| Improved Training Technique for Latent Consistency Models | Feb 3, 2025 | Video Generation | CodeCode Available | 1 | 5 |
| FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling | Oct 23, 2023 | Video Generation | CodeCode Available | 1 | 5 |
| Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework | Sep 19, 2024 | Motion CompensationVideo Generation | CodeCode Available | 1 | 5 |
| 3D-Aware Video Generation | Jun 29, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs | Oct 4, 2018 | Action RecognitionImage Generation | CodeCode Available | 1 | 5 |
| Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method | Dec 19, 2023 | Video Generation | CodeCode Available | 1 | 5 |
| Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers | Mar 20, 2023 | Video Generation | CodeCode Available | 1 | 5 |
| Towards Smooth Video Composition | Dec 14, 2022 | Image Generationsingle-image-generation | CodeCode Available | 1 | 5 |
| Diffusion Transformers for Tabular Data Time Series Generation | Apr 10, 2025 | Tabular Data GenerationTime Series | CodeCode Available | 1 | 5 |
| Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model | Nov 28, 2024 | DenoisingVideo Generation | CodeCode Available | 1 | 5 |
| Diffusion Probabilistic Modeling for Video Generation | Mar 16, 2022 | DenoisingImage Generation | CodeCode Available | 1 | 5 |
| Diffusion Models for Video Prediction and Infilling | Jun 15, 2022 | PredictionVideo Generation | CodeCode Available | 1 | 5 |
| D'ARTAGNAN: Counterfactual Video Generation | Jun 3, 2022 | Anatomycounterfactual | CodeCode Available | 1 | 5 |
| StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3 | Aug 16, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample | Jun 22, 2020 | DiversityVideo Generation | CodeCode Available | 1 | 5 |
| The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation | Mar 6, 2025 | Semantic CompressionVideo Generation | CodeCode Available | 1 | 5 |
| Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation | Mar 6, 2025 | DecoderGPU | CodeCode Available | 1 | 5 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 | 5 |
| GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions | Apr 30, 2021 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction | Mar 24, 2025 | Video GenerationVideo Prediction | CodeCode Available | 1 | 5 |
| Generative Recommendation: Towards Next-generation Recommender Paradigm | Apr 7, 2023 | Recommendation SystemsRetrieval | CodeCode Available | 1 | 5 |
| BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Dec 5, 2023 | Image GenerationModel Selection | CodeCode Available | 1 | 5 |
| Generative Modeling of Weights: Generalization or Memorization? | Jun 9, 2025 | MemorizationVideo Generation | CodeCode Available | 1 | 5 |
| LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation | May 17, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 | 5 |
| Generative Adversarial Graph Convolutional Networks for Human Action Synthesis | Oct 21, 2021 | Action GenerationDisentanglement | CodeCode Available | 1 | 5 |
| Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer | Jul 15, 2023 | motion predictionPose Transfer | CodeCode Available | 1 | 5 |
| Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks | Feb 21, 2022 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 | 5 |
| Generative Disco: Text-to-Video Generation for Music Visualization | Apr 17, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |