SOTAVerified|Agents Browse Leaderboard About

Image to Video Generation

Image to Video Generation refers to the task of generating a sequence of video frames based on a single still image or a set of still images. The goal is to produce a video that is coherent and consistent in terms of appearance, motion, and style, while also being temporally consistent, meaning that the generated video should look like a coherent sequence of frames that are temporally ordered. This task is typically tackled using deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs), that are trained on large datasets of videos. The models learn to generate plausible video frames that are conditioned on the input image, as well as on any other auxiliary information, such as a sound or text track.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 85 papers

Title	Date	Tasks	Status	Hype
Conditional Image-to-Video Generation with Latent Flow Diffusion Models	Mar 24, 2023	Image to Video GenerationMotion Generation	CodeCode Available	2
SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields	Dec 5, 2022	3D Reconstruction3D Scene Reconstruction	CodeCode Available	2
Collaborative Neural Rendering using Anime Character Sheets	Jul 12, 2022	Image GenerationImage to 3D	CodeCode Available	2
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Mar 5, 2025	3D Object DetectionBEV Segmentation	CodeCode Available	1
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think	Mar 2, 2025	DenoisingImage to Video Generation	CodeCode Available	1
Object-Centric Image to Video Generation with Language Guidance	Feb 17, 2025	Image to Video GenerationObject	CodeCode Available	1
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach	Oct 4, 2024	Image GenerationImage to Video Generation	CodeCode Available	1
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions	Jul 30, 2024	Audio GenerationImage to Video Generation	CodeCode Available	1
MVOC: a training-free multiple video object composition method with diffusion models	Jun 22, 2024	Image to Video GenerationObject	CodeCode Available	1
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation	Jun 12, 2024	BenchmarkingImage Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 3 of 9Next →

No leaderboard results yet.