Video Generation

( Various Video Generation Tasks. Gif credit: MaGViT )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1400 of 1466 papers

Title	Date	Tasks	Status
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Feb 6, 2025	Image to Video GenerationVideo Editing	—Unverified
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation	Nov 27, 2024	AttributeVideo Generation	—Unverified
Motion Control for Enhanced Complex Action Video Generation	Nov 13, 2024	Motion GenerationVideo Generation	—Unverified
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling	Jan 29, 2024	Image to Video GenerationVideo Generation	—Unverified
MotionMaster: Training-free Camera Motion Transfer For Video Generation	Apr 24, 2024	DisentanglementMotion Disentanglement	—Unverified
Motion Modes: What Could Happen Next?	Nov 29, 2024	DiversityObject	—Unverified
MotionPro: A Precise Motion Controller for Image-to-Video Generation	May 26, 2025	DenoisingImage to Video Generation	—Unverified
Motion Prompting: Controlling Video Generation with Motion Trajectories	Dec 3, 2024	Video Generation	—Unverified
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation	Dec 8, 2024	Contrastive LearningImage to Video Generation	—Unverified
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation	Nov 28, 2023	DisentanglementText-to-Video Generation	—Unverified
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation	Jan 18, 2024	DenoisingPosition	—Unverified
MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models	Dec 2, 2024	Language ModelingLanguage Modelling	—Unverified
MoVideo: Motion-Aware Video Generation with Diffusion Models	Nov 19, 2023	Image GenerationImage to Video Generation	—Unverified
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence	Jul 23, 2024	Video Generation	—Unverified
Movie Gen: SWOT Analysis of Meta's Generative AI Foundation Model for Transforming Media Generation, Advertising, and Entertainment Industries	Dec 5, 2024	Video Generation	—Unverified
MOVi: Training-free Text-conditioned Multi-Object Video Generation	May 29, 2025	ObjectVideo Generation	—Unverified
MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion	Dec 13, 2024	Video Generation	—Unverified
MSG score: A Comprehensive Evaluation for Multi-Scene Video Generation	Nov 28, 2024	Video Generation	—Unverified
MEVG: Multi-event Video Generation with Text-to-Video Models	Dec 7, 2023	Video Generation	—Unverified
Multi-Frame Content Integration with a Spatio-Temporal Attention Mechanism for Person Video Motion Transfer	Aug 12, 2019	Video Generation	—Unverified
Multi Modal Adaptive Normalization for Audio to Video Generation	Dec 14, 2020	Optical Flow EstimationSSIM	—Unverified
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond	Sep 23, 2024	Language ModellingLarge Language Model	—Unverified
Multi-object Video Generation from Single Frame Layouts	May 6, 2023	Image GenerationObject	—Unverified
Multi-sentence Video Grounding for Long Video Generation	Jul 18, 2024	Moment RetrievalRetrieval	—Unverified
Multi-Shot Character Consistency for Text-to-Video Generation	Dec 10, 2024	Text-to-Video GenerationVideo Generation	—Unverified
Multi-subject Open-set Personalization in Video Generation	Jan 10, 2025	Video Generation	—Unverified
Music Consistency Models	Apr 20, 2024	Computational EfficiencyMusic Generation	—Unverified
MusicInfuser: Making Video Diffusion Listen and Dance	Mar 18, 2025	Video Generation	—Unverified
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation	Jan 1, 2025	Portrait AnimationVideo Generation	—Unverified
MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control	Sep 10, 2024	Autonomous DrivingVideo Generation	—Unverified
Neural Cell Video Synthesis via Optical-Flow Diffusion	Dec 6, 2022	Cultural Vocal Bursts Intensity PredictionDenoising	—Unverified
Neural Rendering and Its Hardware Acceleration: A Review	Jan 6, 2024	3D ReconstructionDeep Learning	—Unverified
NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties	Feb 2, 2024	Contrastive LearningSSIM	—Unverified
Next Block Prediction: Video Generation via Semi-Auto-Regressive Modeling	Feb 11, 2025	Video Generation	—Unverified
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation	Jun 17, 2024	Knowledge DistillationNeRF	—Unverified
Noise Crystallization and Liquid Noise: Zero-shot Video Generation using Image Diffusion Models	Oct 5, 2024	Image GenerationStyle Transfer	—Unverified
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation	Mar 22, 2023	Video Generation	—Unverified
Object-Centric World Model for Language-Guided Manipulation	Mar 8, 2025	Autonomous Drivingmodel	—Unverified
ObjectMover: Generative Object Movement with Video Prior	Mar 11, 2025	Multi-Task LearningObject	—Unverified
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model	Sep 2, 2024	GPUVideo Generation	—Unverified
OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation	Jun 23, 2025	Human AnimationVideo Generation	—Unverified
OmniCam: Unified Multimodal Video Generation via Camera Control	Apr 3, 2025	Video Generation	—Unverified
OmniCreator: Self-Supervised Unified Generation with Universal Editing	Dec 3, 2024	DenoisingSemantic correspondence	—Unverified
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Dec 12, 2024	Image to Video GenerationVideo Generation	—Unverified
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models	Feb 3, 2025	Human AnimationHuman-Object Interaction Detection	—Unverified
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding	Apr 15, 2025	Semantic SegmentationVideo Generation	—Unverified
One-Minute Video Generation with Test-Time Training	Apr 7, 2025	MambaVideo Generation	—Unverified
One Shot Audio to Animated Video Generation	Feb 19, 2021	Video Generation	—Unverified
One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2	Feb 15, 2023	AttributeDisentanglement	—Unverified
One-Shot Learning Meets Depth Diffusion in Multi-Object Videos	Aug 29, 2024	One-Shot LearningVideo Generation	—Unverified

Show:10 25 50

← PrevPage 28 of 30Next →

All datasets UCF-101 BAIR Robot Pushing Sky Time-lapse UCF-101 16 frames, 64x64, Unconditional UCF-101 16 frames, Unconditional, Single GPU LAION-400M Taichi UCF-101 16 frames, 128x128, Unconditional Kinetics-600 12 frames, 64x64 How2Sign Kinetics-600 12 frames, 128x128 Kinetics-600 48 frames, 64x64

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MCVD	FVD16	2,460	—	Unverified
2	VDM	FVD16	1,396	—	Unverified
3	TGAN-v2 (128x128)	FVD16	1,209	—	Unverified
4	MCVD (64x64)	FVD16	1,143	—	Unverified
5	MoCoGAN-HD (256x256, unconditional)	FVD16	700	—	Unverified
6	MagicVideo (256x256, text-conditional)	FVD16	699	—	Unverified
7	TATS (256x256)	FVD16	635	—	Unverified
8	FIFO-Diffusion	FVD128	596.64	—	Unverified
9	DIGAN (128x128, unconditional)	FVD16	577	—	Unverified
10	LVDM (256x256, unconditional)	FVD16	552	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MoCoGAN	FVD score	503	—	Unverified
2	Baseline (from LVT)	FVD score	320.9	—	Unverified
3	SVG-FP (from FVD)	FVD score	315.5	—	Unverified
4	CDNA (from FVD)	FVD score	296.5	—	Unverified
5	SV2P (from FVD)	FVD score	262.5	—	Unverified
6	SVG-LP (from vRNN)	FVD score	256.62	—	Unverified
7	WAM	FVD score	159.6	—	Unverified
8	VRNN 1L	FVD score	149.22	—	Unverified
9	SAVP (from vRNN)	FVD score	143.43	—	Unverified
10	Hier-VRNN	FVD score	143.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MoCoGAN-HD (128x128)	FVD 16	183.6	—	Unverified
2	TATS (128x128)	FVD 16	132.6	—	Unverified
3	Long-video GAN (256x256)	FVD 16	116.5	—	Unverified
4	DIGAN (128x128)	FVD 16	114.6	—	Unverified
5	Long-video GAN (128x128)	FVD 16	107.5	—	Unverified
6	LVDM (256x256)	FVD 16	95.2	—	Unverified
7	DDMI	FVD 16	66.25	—	Unverified
8	Latte + LeanVAE	FVD 16	49.59	—	Unverified
9	StyleSV (256x256)	FVD 16	49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Video Diffusion Model	Inception Score	57	—	Unverified
2	TGAN-ODE	Inception Score	15.2	—	Unverified
3	TGAN-F	Inception Score	13.62	—	Unverified
4	MoCoGAN	Inception Score	12.42	—	Unverified
5	MoCoGAN-MDP	Inception Score	11.86	—	Unverified
6	TGAN-SVC	Inception Score	11.85	—	Unverified
7	VGAN	Inception Score	8.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGAN-F	Inception Score	22.91	—	Unverified
2	TGANv2	Inception Score	21.45	—	Unverified
3	TGANv2-ODE	Inception Score	21.02	—	Unverified
4	MoCoGAN	Inception Score	12.42	—	Unverified
5	MoCoGAN-MDP	Inception Score	11.86	—	Unverified
6	TGAN-SVC	Inception Score	11.85	—	Unverified
7	VGAN	Inception Score	8.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Imagen original (constant=6)	CLIP R-Precision	92.12	—	Unverified
2	Imagen fully distilled (oscillate (15,1))	CLIP R-Precision	90.97	—	Unverified
3	Imagen distilled (constant=6)	CLIP R-Precision	90.88	—	Unverified
4	Imagen original (oscillate(15,1))	CLIP R-Precision	89.91	—	Unverified
5	Imagen fully distilled (constant=6)	CLIP R-Precision	89.68	—	Unverified
6	Imagen distilled (oscillate (15,1))	CLIP R-Precision	88.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DIGAN (256x256)	FVD16	156.7	—	Unverified
2	MoCoGAN-HD (128x128)	FVD16	144.7	—	Unverified
3	DIGAN (128x128)	FVD16	128.1	—	Unverified
4	LVDM (256x256)	FVD16	99	—	Unverified
5	TATS (128x128)	FVD16	94.6	—	Unverified
6	StyleSV (256x256)	FVD16	82.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGANv2 (2020)	Inception Score	28.87	—	Unverified
2	DVD-GAN	Inception Score	27.38	—	Unverified
3	VideoGPT	Inception Score	24.69	—	Unverified
4	TGANv2	Inception Score	24.34	—	Unverified
5	TGAN-F	Inception Score	22.91	—	Unverified
6	TGANv2-ODE	Inception Score	21.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FVD	31.1	—	Unverified
2	MAGVIT	FVD	9.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	INR-V	FVD16	144	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FID	2.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVD-GAN	FID	12.92	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiT-XL/2 + CVAE-FT-SE	FID	8.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VideoAssembler (Zero-Shot, 256x256, class-conditional)	FVD16	252	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PG-SWGAN-3D	FID	404.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	StyleSV	FVD16	207.2	—	Unverified