SOTAVerified|Agents Browse Leaderboard About

Image to Video Generation

Image to Video Generation refers to the task of generating a sequence of video frames based on a single still image or a set of still images. The goal is to produce a video that is coherent and consistent in terms of appearance, motion, and style, while also being temporally consistent, meaning that the generated video should look like a coherent sequence of frames that are temporally ordered. This task is typically tackled using deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs), that are trained on large datasets of videos. The models learn to generate plausible video frames that are conditioned on the input image, as well as on any other auxiliary information, such as a sound or text track.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 71–80 of 85 papers

Title	Date	Tasks	Status
AtomoVideo: High Fidelity Image-to-Video Generation	Mar 4, 2024	Image GenerationImage to Video Generation	—Unverified
Self-Training for Domain Adaptive Scene Text Detection	May 23, 2020	Image to Video GenerationScene Text Detection	—Unverified
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation	Nov 7, 2024	Image to Video GenerationVideo Generation	—Unverified
ATI: Any Trajectory Instruction for Controllable Video Generation	May 28, 2025	Image to Video GenerationVideo Generation	—Unverified
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults	Dec 22, 2024	Data AugmentationFault Diagnosis	—Unverified
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation	Jan 6, 2025	Image to Video GenerationObject	—Unverified
A Survey of Emerging Approaches and Advances in Video Generation	Nov 9, 2024	Image to Video GenerationLanguage Modeling	—Unverified
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation	Nov 5, 2024	Image to Video GenerationMisinformation	—Unverified
TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation	Dec 13, 2024	Image to Video GenerationObject	—Unverified
Learning to Forecast and Refine Residual Motion for Image-to-Video Generation	Jul 26, 2018	Human Pose ForecastingImage to Video Generation	CodeCode Available

Show:10 25 50

← PrevPage 8 of 9Next →

No leaderboard results yet.