| Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample | Jun 22, 2020 | DiversityVideo Generation | CodeCode Available | 1 | 5 |
| DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation | May 23, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 | 5 |
| Generative Adversarial Graph Convolutional Networks for Human Action Synthesis | Oct 21, 2021 | Action GenerationDisentanglement | CodeCode Available | 1 | 5 |
| Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks | Feb 21, 2022 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 | 5 |
| Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation | May 18, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 | 5 |
| WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Dec 5, 2023 | Autonomous DrivingDiversity | CodeCode Available | 1 | 5 |
| VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos | May 29, 2025 | Question AnsweringVideo Generation | CodeCode Available | 0 | 5 |
| VGMShield: Mitigating Misuse of Video Generative Models | Feb 20, 2024 | Video Generation | CodeCode Available | 0 | 5 |
| VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting | Dec 16, 2024 | InformativenessLarge Language Model | CodeCode Available | 0 | 5 |
| DeepLandscape: Adversarial Modeling of Landscape Videos | Aug 1, 2020 | Video Generation | CodeCode Available | 0 | 5 |
| Benchmarking Generative Latent Variable Models for Speech | Feb 22, 2022 | BenchmarkingImage Generation | CodeCode Available | 0 | 5 |
| Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion | Jun 17, 2024 | Video Generation | CodeCode Available | 0 | 5 |
| Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model | Jul 31, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 | 5 |
| Long Context Question Answering via Supervised Contrastive Learning | Dec 16, 2021 | Contrastive LearningQuestion Answering | CodeCode Available | 0 | 5 |
| GD-VDM: Generated Depth for better Diffusion-based Video Generation | Jun 19, 2023 | Image GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance | Aug 27, 2024 | Clinical KnowledgeLesion Segmentation | CodeCode Available | 0 | 5 |
| UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer | Dec 12, 2024 | Video Generation | CodeCode Available | 0 | 5 |
| Unsupervised Learning for Physical Interaction through Video Prediction | May 23, 2016 | ObjectPrediction | CodeCode Available | 0 | 5 |
| Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Oct 9, 2024 | Video Generation | CodeCode Available | 0 | 5 |
| Towards Understanding Unsafe Video Generation | Jul 17, 2024 | Image GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation | Oct 14, 2021 | DecoderStyle Transfer | CodeCode Available | 0 | 5 |
| Unsupervised object-centric video generation and decomposition in 3D | Jul 7, 2020 | 3D Object DetectionDepth Estimation | CodeCode Available | 0 | 5 |
| Time-Conditioned Generative Modeling of Object-Centric Representations for Video Decomposition and Prediction | Jan 21, 2023 | DisentanglementGaussian Processes | CodeCode Available | 0 | 5 |
| CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training | Dec 20, 2024 | parameter-efficient fine-tuningVideo Generation | CodeCode Available | 0 | 5 |
| Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction | Mar 17, 2025 | Video GenerationVideo Prediction | CodeCode Available | 0 | 5 |
| Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN | Nov 22, 2018 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 0 | 5 |
| Talking Face Generation by Conditional Recurrent Adversarial Network | Apr 13, 2018 | Constrained Lip-synchronizationFace Generation | CodeCode Available | 0 | 5 |
| Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures | Nov 30, 2016 | Text-to-Video GenerationVideo Generation | CodeCode Available | 0 | 5 |
| A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence | May 24, 2024 | Text GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Synthesizing Audio from Silent Video using Sequence to Sequence Modeling | Apr 25, 2024 | DecoderDiversity | CodeCode Available | 0 | 5 |
| StoryGAN: A Sequential Conditional GAN for Story Visualization | Dec 6, 2018 | SentenceStory Visualization | CodeCode Available | 0 | 5 |
| Stochastic Video Generation with a Learned Prior | Feb 21, 2018 | Video GenerationVideo Prediction | CodeCode Available | 0 | 5 |
| Stochastic Adversarial Video Prediction | Apr 4, 2018 | PredictionRepresentation Learning | CodeCode Available | 0 | 5 |
| FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models | Jul 28, 2024 | DenoisingVideo Generation | CodeCode Available | 0 | 5 |
| Stochastic Talking Face Generation Using Latent Distribution Matching | Nov 21, 2020 | Face GenerationTalking Face Generation | CodeCode Available | 0 | 5 |
| Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets | Nov 25, 2023 | Image GenerationImage to Video Generation | CodeCode Available | 0 | 5 |
| Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation | Mar 29, 2023 | Audio GenerationContrastive Learning | CodeCode Available | 0 | 5 |
| Source Camera Verification from Strongly Stabilized Videos | Nov 26, 2019 | Video Generation | CodeCode Available | 0 | 5 |
| 3-D PET Image Generation with tumour masks using TGAN | Nov 2, 2021 | Image GenerationImage Segmentation | CodeCode Available | 0 | 5 |
| Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data | Aug 19, 2024 | DescriptiveImage to Video Generation | CodeCode Available | 0 | 5 |
| TwoStreamVAN: Improving Motion Modeling in Video Generation | Dec 3, 2018 | Motion GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Attentive Semantic Video Generation using Captions | Aug 20, 2017 | Action RecognitionStyle Transfer | CodeCode Available | 0 | 5 |
| ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Jun 20, 2024 | GPUVideo Generation | CodeCode Available | 0 | 5 |
| Scalable Adaptive Computation for Iterative Generation | Dec 22, 2022 | Image GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Consistent Human Image and Video Generation with Spatially Conditioned Diffusion | Dec 19, 2024 | Computational EfficiencyDenoising | CodeCode Available | 0 | 5 |
| RoboScape: Physics-informed Embodied World Model | Jun 29, 2025 | 3D geometryDepth Estimation | CodeCode Available | 0 | 5 |
| REGIS: Refining Generated Videos via Iterative Stylistic Redesigning | Nov 3, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Everybody Dance Now | Aug 22, 2018 | Face GenerationImage-to-Image Translation | CodeCode Available | 0 | 5 |
| ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer | Apr 3, 2025 | DisentanglementMotion Disentanglement | CodeCode Available | 0 | 5 |
| Recycle-GAN: Unsupervised Video Retargeting | Aug 15, 2018 | Face to Face TranslationTranslation | CodeCode Available | 0 | 5 |