SOTAVerified

Image to Video Generation

Image to Video Generation refers to the task of generating a sequence of video frames based on a single still image or a set of still images. The goal is to produce a video that is coherent and consistent in terms of appearance, motion, and style, while also being temporally consistent, meaning that the generated video should look like a coherent sequence of frames that are temporally ordered. This task is typically tackled using deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs), that are trained on large datasets of videos. The models learn to generate plausible video frames that are conditioned on the input image, as well as on any other auxiliary information, such as a sound or text track.

Title	Date	Tasks	Status
I2V3D: Controllable image-to-video generation with 3D guidance	Mar 12, 2025	3D geometryImage to Video Generation	—Unverified
I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models	Jan 1, 2025	Adversarial AttackImage to Video Generation	—Unverified
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model	Jun 22, 2024	AttributeImage to Video Generation	—Unverified
Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM	May 26, 2025	Image to Video GenerationVideo Generation	—Unverified
Image-to-Video Generation via 3D Facial Dynamics	May 31, 2021	Image to Video GenerationVideo Generation	—Unverified

Title

Status

Hype

I2V3D: Controllable image-to-video generation with 3D guidance

—Unverified

I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models

—Unverified

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model

—Unverified

Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM

—Unverified

Image-to-Video Generation via 3D Facial Dynamics

—Unverified

No leaderboard results yet.

Image to Video Generation

Papers