SOTAVerified

Video Generation

( Various Video Generation Tasks. Gif credit: MaGViT )

Papers

Showing 11511200 of 1466 papers

TitleStatusHype
Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method0
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation0
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models0
Matten: Video Generation with Mamba-Attention0
Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model0
Synthesizing Audio from Silent Video using Sequence to Sequence ModelingCode0
MotionMaster: Training-free Camera Motion Transfer For Video Generation0
Accelerating Image Generation with Sub-path Linear Approximation Model0
Motion-aware Latent Diffusion Models for Video Frame Interpolation0
Music Consistency Models0
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation0
AniClipart: Clipart Animation with Text-to-Video Priors0
SparseDM: Toward Sparse Efficient Diffusion Models0
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model0
LoopAnimate: Loopable Salient Object Animation0
Action-conditioned video data improves predictability0
AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment0
Grid Diffusion Models for Text-to-Video Generation0
A Review of Multi-Modal Large Language and Vision Models0
Frame by Familiar Frame: Understanding Replication in Video Diffusion Models0
Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow FieldsCode0
TC4D: Trajectory-Conditioned Text-to-4D Generation0
Tutorial on Diffusion Models for Imaging and Vision0
A Survey on Long Video Generation: Challenges, Methods, and Prospects0
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models0
Opportunities and challenges in the application of large artificial intelligence models in radiology0
Spectral Motion Alignment for Video Motion Transfer using Diffusion Models0
Explorative Inbetweening of Time and Space0
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition0
Enabling Visual Composition and Animation in Unsupervised Video Generation0
S2DM: Sector-Shaped Diffusion Models for Video Generation0
AnimateDiff-Lightning: Cross-Model Diffusion Distillation0
Endora: Video Generation Models as Endoscopy Simulators0
Animate Your Motion: Turning Still Images into Dynamic Videos0
Video Editing via Factorized Diffusion Distillation0
Intention-driven Ego-to-Exo Video Generation0
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis0
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production0
Video Generation with Consistency Tuning0
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering0
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs0
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing0
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation0
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation0
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation0
AtomoVideo: High Fidelity Image-to-Video Generation0
Abductive Ego-View Accident Video Understanding for Safe Driving Perception0
Context-aware Talking Face Video Generation0
Video as the New Language for Real-World Decision Making0
EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions0
Show:102550
← PrevPage 24 of 30Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MCVDFVD162,460Unverified
2VDMFVD161,396Unverified
3TGAN-v2 (128x128)FVD161,209Unverified
4MCVD (64x64)FVD161,143Unverified
5MoCoGAN-HD (256x256, unconditional)FVD16700Unverified
6MagicVideo (256x256, text-conditional)FVD16699Unverified
7TATS (256x256)FVD16635Unverified
8FIFO-DiffusionFVD128596.64Unverified
9DIGAN (128x128, unconditional)FVD16577Unverified
10LVDM (256x256, unconditional)FVD16552Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGANFVD score503Unverified
2Baseline (from LVT)FVD score320.9Unverified
3SVG-FP (from FVD)FVD score315.5Unverified
4CDNA (from FVD)FVD score296.5Unverified
5SV2P (from FVD)FVD score262.5Unverified
6SVG-LP (from vRNN)FVD score256.62Unverified
7WAMFVD score159.6Unverified
8VRNN 1LFVD score149.22Unverified
9SAVP (from vRNN)FVD score143.43Unverified
10Hier-VRNNFVD score143.4Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGAN-HD (128x128)FVD 16183.6Unverified
2TATS (128x128)FVD 16132.6Unverified
3Long-video GAN (256x256)FVD 16116.5Unverified
4DIGAN (128x128)FVD 16114.6Unverified
5Long-video GAN (128x128)FVD 16107.5Unverified
6LVDM (256x256)FVD 1695.2Unverified
7DDMIFVD 1666.25Unverified
8Latte + LeanVAEFVD 1649.59Unverified
9StyleSV (256x256)FVD 1649Unverified
#ModelMetricClaimedVerifiedStatus
1Video Diffusion ModelInception Score57Unverified
2TGAN-ODEInception Score15.2Unverified
3TGAN-FInception Score13.62Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1TGAN-FInception Score22.91Unverified
2TGANv2Inception Score21.45Unverified
3TGANv2-ODEInception Score21.02Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1Imagen original (constant=6)CLIP R-Precision92.12Unverified
2Imagen fully distilled (oscillate (15,1))CLIP R-Precision90.97Unverified
3Imagen distilled (constant=6)CLIP R-Precision90.88Unverified
4Imagen original (oscillate(15,1))CLIP R-Precision89.91Unverified
5Imagen fully distilled (constant=6)CLIP R-Precision89.68Unverified
6Imagen distilled (oscillate (15,1))CLIP R-Precision88.78Unverified
#ModelMetricClaimedVerifiedStatus
1DIGAN (256x256)FVD16156.7Unverified
2MoCoGAN-HD (128x128)FVD16144.7Unverified
3DIGAN (128x128)FVD16128.1Unverified
4LVDM (256x256)FVD1699Unverified
5TATS (128x128)FVD1694.6Unverified
6StyleSV (256x256)FVD1682.6Unverified
#ModelMetricClaimedVerifiedStatus
1TGANv2 (2020)Inception Score28.87Unverified
2DVD-GANInception Score27.38Unverified
3VideoGPTInception Score24.69Unverified
4TGANv2Inception Score24.34Unverified
5TGAN-FInception Score22.91Unverified
6TGANv2-ODEInception Score21.02Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFVD31.1Unverified
2MAGVITFVD9.9Unverified
#ModelMetricClaimedVerifiedStatus
1INR-VFVD16144Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID2.16Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID12.92Unverified
#ModelMetricClaimedVerifiedStatus
1DiT-XL/2 + CVAE-FT-SEFID8.59Unverified
#ModelMetricClaimedVerifiedStatus
1VideoAssembler (Zero-Shot, 256x256, class-conditional)FVD16252Unverified
#ModelMetricClaimedVerifiedStatus
1PG-SWGAN-3DFID404.1Unverified
#ModelMetricClaimedVerifiedStatus
1StyleSVFVD16207.2Unverified