Video Generation

( Various Video Generation Tasks. Gif credit: MaGViT )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1050 of 1466 papers

Title	Date	Tasks	Status
Framer: Interactive Frame Interpolation	Oct 24, 2024	Image MorphingVideo Generation	—Unverified
Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models	Nov 26, 2024	Reinforcement Learning (RL)Text-to-Video Generation	—Unverified
Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions	Jan 2, 2025	FormVideo Generation	—Unverified
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention	Jul 29, 2024	DenoisingVideo Generation	—Unverified
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise	Feb 5, 2025	Video Generation	—Unverified
From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models	Jun 8, 2025	ARCFew-Shot Learning	—Unverified
From Single Images to Motion Policies via Video-Generation Environment Representations	May 25, 2025	Depth EstimationMonocular Depth Estimation	—Unverified
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models	Dec 10, 2024	GPUVideo Generation	—Unverified
Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks	Sep 6, 2018	UnityVideo Generation	—Unverified
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers	Jun 4, 2025	Video EditingVideo Generation	—Unverified
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention	Mar 25, 2025	Video Generation	—Unverified
Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model	Oct 17, 2024	Disease PredictionGenerative Adversarial Network	—Unverified
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling	Mar 25, 2025	Deep LearningVideo Generation	—Unverified
FVD: A new Metric for Video Generation	Mar 27, 2019	DiversityRepresentation Learning	—Unverified
G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer	Sep 10, 2024	3D GenerationVideo Generation	—Unverified
GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving	Mar 26, 2025	Autonomous DrivingVideo Generation	—Unverified
GameFactory: Creating New Games with Generative Interactive Videos	Jan 14, 2025	Domain GeneralizationMinecraft	—Unverified
Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation	Sep 24, 2024	Robot ManipulationVideo Generation	—Unverified
GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model	Aug 28, 2024	Autonomous DrivingData Augmentation	—Unverified
GenDeF: Learning Generative Deformation Field for Video Generation	Dec 7, 2023	DisentanglementVideo Editing	—Unverified
Gender Bias in Text-to-Video Generation Models: A case study of Sora	Dec 30, 2024	Text-to-Video GenerationVideo Generation	—Unverified
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks	Mar 21, 2025	DenoisingOptical Flow Estimation	—Unverified
Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models	Oct 12, 2019	Action RecognitionOptical Flow Estimation	—Unverified
Generating Persuasive Visual Storylines for Promotional Videos	Aug 30, 2019	ClusteringPersuasiveness	—Unverified
Generating time-consistent dynamics with discriminator-guided image diffusion models	May 14, 2025	Video Generation	—Unverified
Generating Videos with Scene Dynamics	Sep 8, 2016	Action ClassificationFuture prediction	—Unverified
Generative AI for Autonomous Driving: A Review	May 21, 2025	Autonomous DrivingImage Generation	—Unverified
Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Feb 11, 2025	Contrastive LearningImage Retrieval	—Unverified
Generative Pre-trained Autoregressive Diffusion Transformer	May 12, 2025	Few-Shot LearningVideo Generation	—Unverified
Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models	Dec 3, 2023	Image GenerationText to Image Generation	—Unverified
Generative Video Propagation	Dec 27, 2024	Image to Video GenerationVideo Generation	—Unverified
Generative Video Transformer: Can Objects be the Words?	Jul 20, 2021	GPUScene Understanding	—Unverified
GenLit: Reformulating Single-Image Relighting as Video Generation	Dec 15, 2024	Image GenerationImage Relighting	—Unverified
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration	Dec 5, 2024	AttributeHallucination	—Unverified
GenTron: Diffusion Transformers for Image and Video Generation	Dec 7, 2023	Text-to-Video GenerationVideo Generation	—Unverified
GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video	Jan 20, 2025	Video ClassificationVideo Generation	—Unverified
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos	Jun 12, 2025	Video Generation	—Unverified
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion	May 29, 2025	Depth EstimationImage to Video Generation	—Unverified
Geometry-aware 4D Video Generation for Robot Manipulation	Jul 1, 2025	Robot ManipulationVideo Generation	—Unverified
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation	Feb 13, 2025	Contrastive LearningVideo Generation	—Unverified
GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning	Jun 12, 2025	GPUVideo Generation	—Unverified
GiVE: Guiding Visual Encoder to Perceive Overlooked Information	Oct 26, 2024	ObjectQuestion Answering	—Unverified
GlocalNet: Class-aware Long-term Human Motion Synthesis	Dec 19, 2020	Motion SynthesisPedestrian Trajectory Prediction	—Unverified
Goal-Conditioned Video Prediction	Sep 25, 2019	Imitation LearningPrediction	—Unverified
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning	Nov 21, 2023	Image GenerationText-to-Video Generation	—Unverified
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation	Nov 25, 2023	Instruction FollowingLanguage Modeling	—Unverified
GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation	Oct 8, 2024	Multi-Task LearningRobot Manipulation	—Unverified
Grid Diffusion Models for Text-to-Video Generation	Mar 30, 2024	GPUImage Generation	—Unverified
GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking	Jan 1, 2025	Novel View SynthesisPoint Tracking	—Unverified
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Jan 5, 2025	Novel View SynthesisPoint Tracking	—Unverified

Show:10 25 50

← PrevPage 21 of 30Next →

All datasets UCF-101 BAIR Robot Pushing Sky Time-lapse UCF-101 16 frames, 64x64, Unconditional UCF-101 16 frames, Unconditional, Single GPU LAION-400M Taichi UCF-101 16 frames, 128x128, Unconditional Kinetics-600 12 frames, 64x64 How2Sign Kinetics-600 12 frames, 128x128 Kinetics-600 48 frames, 64x64

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MCVD	FVD16	2,460	—	Unverified
2	VDM	FVD16	1,396	—	Unverified
3	TGAN-v2 (128x128)	FVD16	1,209	—	Unverified
4	MCVD (64x64)	FVD16	1,143	—	Unverified
5	MoCoGAN-HD (256x256, unconditional)	FVD16	700	—	Unverified
6	MagicVideo (256x256, text-conditional)	FVD16	699	—	Unverified
7	TATS (256x256)	FVD16	635	—	Unverified
8	FIFO-Diffusion	FVD128	596.64	—	Unverified
9	DIGAN (128x128, unconditional)	FVD16	577	—	Unverified
10	LVDM (256x256, unconditional)	FVD16	552	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MoCoGAN	FVD score	503	—	Unverified
2	Baseline (from LVT)	FVD score	320.9	—	Unverified
3	SVG-FP (from FVD)	FVD score	315.5	—	Unverified
4	CDNA (from FVD)	FVD score	296.5	—	Unverified
5	SV2P (from FVD)	FVD score	262.5	—	Unverified
6	SVG-LP (from vRNN)	FVD score	256.62	—	Unverified
7	WAM	FVD score	159.6	—	Unverified
8	VRNN 1L	FVD score	149.22	—	Unverified
9	SAVP (from vRNN)	FVD score	143.43	—	Unverified
10	Hier-VRNN	FVD score	143.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MoCoGAN-HD (128x128)	FVD 16	183.6	—	Unverified
2	TATS (128x128)	FVD 16	132.6	—	Unverified
3	Long-video GAN (256x256)	FVD 16	116.5	—	Unverified
4	DIGAN (128x128)	FVD 16	114.6	—	Unverified
5	Long-video GAN (128x128)	FVD 16	107.5	—	Unverified
6	LVDM (256x256)	FVD 16	95.2	—	Unverified
7	DDMI	FVD 16	66.25	—	Unverified
8	Latte + LeanVAE	FVD 16	49.59	—	Unverified
9	StyleSV (256x256)	FVD 16	49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Video Diffusion Model	Inception Score	57	—	Unverified
2	TGAN-ODE	Inception Score	15.2	—	Unverified
3	TGAN-F	Inception Score	13.62	—	Unverified
4	MoCoGAN	Inception Score	12.42	—	Unverified
5	MoCoGAN-MDP	Inception Score	11.86	—	Unverified
6	TGAN-SVC	Inception Score	11.85	—	Unverified
7	VGAN	Inception Score	8.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGAN-F	Inception Score	22.91	—	Unverified
2	TGANv2	Inception Score	21.45	—	Unverified
3	TGANv2-ODE	Inception Score	21.02	—	Unverified
4	MoCoGAN	Inception Score	12.42	—	Unverified
5	MoCoGAN-MDP	Inception Score	11.86	—	Unverified
6	TGAN-SVC	Inception Score	11.85	—	Unverified
7	VGAN	Inception Score	8.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Imagen original (constant=6)	CLIP R-Precision	92.12	—	Unverified
2	Imagen fully distilled (oscillate (15,1))	CLIP R-Precision	90.97	—	Unverified
3	Imagen distilled (constant=6)	CLIP R-Precision	90.88	—	Unverified
4	Imagen original (oscillate(15,1))	CLIP R-Precision	89.91	—	Unverified
5	Imagen fully distilled (constant=6)	CLIP R-Precision	89.68	—	Unverified
6	Imagen distilled (oscillate (15,1))	CLIP R-Precision	88.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DIGAN (256x256)	FVD16	156.7	—	Unverified
2	MoCoGAN-HD (128x128)	FVD16	144.7	—	Unverified
3	DIGAN (128x128)	FVD16	128.1	—	Unverified
4	LVDM (256x256)	FVD16	99	—	Unverified
5	TATS (128x128)	FVD16	94.6	—	Unverified
6	StyleSV (256x256)	FVD16	82.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGANv2 (2020)	Inception Score	28.87	—	Unverified
2	DVD-GAN	Inception Score	27.38	—	Unverified
3	VideoGPT	Inception Score	24.69	—	Unverified
4	TGANv2	Inception Score	24.34	—	Unverified
5	TGAN-F	Inception Score	22.91	—	Unverified
6	TGANv2-ODE	Inception Score	21.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FVD	31.1	—	Unverified
2	MAGVIT	FVD	9.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	INR-V	FVD16	144	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FID	2.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FID	12.92	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiT-XL/2 + CVAE-FT-SE	FID	8.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VideoAssembler (Zero-Shot, 256x256, class-conditional)	FVD16	252	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PG-SWGAN-3D	FID	404.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	StyleSV	FVD16	207.2	—	Unverified