| LIFI: Towards Linguistically Informed Frame Interpolation | Oct 30, 2020 | Video Generation | CodeCode Available | 0 | 5 |
| Generative Video Bi-flow | Mar 9, 2025 | Unconditional Video GenerationVideo Generation | CodeCode Available | 0 | 5 |
| Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation | Jul 31, 2024 | PositionVideo Generation | CodeCode Available | 0 | 5 |
| Learning to Forecast and Refine Residual Motion for Image-to-Video Generation | Jul 26, 2018 | Human Pose ForecastingImage to Video Generation | CodeCode Available | 0 | 5 |
| Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN | Nov 22, 2018 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 0 | 5 |
| Generating time-consistent dynamics with discriminator-guided image diffusion models | May 14, 2025 | Video Generation | —Unverified | 0 | 0 |
| Generating Persuasive Visual Storylines for Promotional Videos | Aug 30, 2019 | ClusteringPersuasiveness | —Unverified | 0 | 0 |
| Deep Video Generation, Prediction and Completion of Human Action Sequences | Nov 23, 2017 | Human action generationPrediction | —Unverified | 0 | 0 |
| Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models | Oct 12, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 | 0 |
| DeepVerse: 4D Autoregressive Video Generation as a World Model | Jun 1, 2025 | Video Generation | —Unverified | 0 | 0 |
| Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks | Mar 21, 2025 | DenoisingOptical Flow Estimation | —Unverified | 0 | 0 |
| Gender Bias in Text-to-Video Generation Models: A case study of Sora | Dec 30, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| GenDeF: Learning Generative Deformation Field for Video Generation | Dec 7, 2023 | DisentanglementVideo Editing | —Unverified | 0 | 0 |
| GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Aug 28, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 | 0 |
| DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms | Jun 13, 2020 | DeepFake DetectionFace Swapping | —Unverified | 0 | 0 |
| DeepHS-HDRVideo: Deep High Speed High Dynamic Range Video Reconstruction | Oct 10, 2022 | Optical Flow EstimationVideo Frame Interpolation | —Unverified | 0 | 0 |
| Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation | Sep 24, 2024 | Robot ManipulationVideo Generation | —Unverified | 0 | 0 |
| Decouple Content and Motion for Conditional Image-to-Video Generation | Nov 24, 2023 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| AnimateAnything: Consistent and Controllable Animation for Video Generation | Nov 16, 2024 | Video Generation | —Unverified | 0 | 0 |
| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| GameFactory: Creating New Games with Generative Interactive Videos | Jan 14, 2025 | Domain GeneralizationMinecraft | —Unverified | 0 | 0 |
| GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Mar 26, 2025 | Autonomous DrivingVideo Generation | —Unverified | 0 | 0 |
| G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer | Sep 10, 2024 | 3D GenerationVideo Generation | —Unverified | 0 | 0 |
| FVD: A new Metric for Video Generation | Mar 27, 2019 | DiversityRepresentation Learning | —Unverified | 0 | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 | 0 |
| FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling | Mar 25, 2025 | Deep LearningVideo Generation | —Unverified | 0 | 0 |
| Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model | Oct 17, 2024 | Disease PredictionGenerative Adversarial Network | —Unverified | 0 | 0 |
| AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation | Jun 11, 2024 | Audio GenerationVideo Generation | —Unverified | 0 | 0 |
| AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Dec 3, 2024 | 3D ReconstructionVideo Generation | —Unverified | 0 | 0 |
| FullDiT: Multi-Task Video Generative Foundation Model with Full Attention | Mar 25, 2025 | Video Generation | —Unverified | 0 | 0 |
| FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers | Jun 4, 2025 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks | Sep 6, 2018 | UnityVideo Generation | —Unverified | 0 | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 | 0 |
| AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection | May 21, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| From Slow Bidirectional to Fast Autoregressive Video Diffusion Models | Dec 10, 2024 | GPUVideo Generation | —Unverified | 0 | 0 |
| From Single Images to Motion Policies via Video-Generation Environment Representations | May 25, 2025 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 | 0 |
| From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models | Jun 8, 2025 | ARCFew-Shot Learning | —Unverified | 0 | 0 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 | 0 |
| AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations | Mar 17, 2025 | Semantic SegmentationVideo Generation | —Unverified | 0 | 0 |
| AniClipart: Clipart Animation with Text-to-Video Priors | Apr 18, 2024 | Image to Video GenerationText-to-Video Generation | —Unverified | 0 | 0 |
| Action Concept Grounding Network for Semantically-Consistent Video Generation | Sep 28, 2020 | Action Recognitionobject-detection | —Unverified | 0 | 0 |
| 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | May 31, 2024 | NeRFVideo Generation | —Unverified | 0 | 0 |
| FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise | Feb 5, 2025 | Video Generation | —Unverified | 0 | 0 |
| CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention | Sep 3, 2024 | Human AnimationVideo Generation | —Unverified | 0 | 0 |
| FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Jul 29, 2024 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions | Jan 2, 2025 | FormVideo Generation | —Unverified | 0 | 0 |
| CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | Feb 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| Anchored Diffusion for Video Face Reenactment | Jul 21, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 | 0 |
| Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models | Nov 26, 2024 | Reinforcement Learning (RL)Text-to-Video Generation | —Unverified | 0 | 0 |
| CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Jan 18, 2024 | ObjectText-to-Video Generation | —Unverified | 0 | 0 |