SOTAVerified

Video Generation

( Various Video Generation Tasks. Gif credit: MaGViT )

Papers

Showing 13511400 of 1466 papers

TitleStatusHype
Xp-GAN: Unsupervised Multi-object Controllable Video Generation0
3-D PET Image Generation with tumour masks using TGANCode0
Image Comes Dancing with Collaborative Parsing-Flow Video SynthesisCode0
ViDA-MAN: Visual Dialog with Digital Humans0
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor0
Towards Using Clothes Style Transfer for Scenario-aware Person Video GenerationCode0
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction0
Video Autoencoder: self-supervised disentanglement of static 3D structure and motion0
A review of Generative Adversarial Networks (GANs) and its applications in a wide variety of disciplines -- From Medical to Remote Sensing0
Towards Generative Latent Variable Models for Speech0
Conditional MoCoGAN for Zero-Shot Video Generation0
Simple Video Generation using Neural ODEs0
iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering0
RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generation0
Video Generation from Text Employing Latent Path Construction for Temporal Modeling0
Generative Video Transformer: Can Objects be the Words?0
Speech2Video: Cross-Modal Distillation for Speech to Video Generation0
Cross-View Exocentric to Egocentric Video Synthesis0
Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions0
NWT: Towards natural audio-to-video generation with representation learningCode0
Efficient training for future video generation based on hierarchical disentangled representation of latent variables0
Hierarchical Video Generation for Complex Data0
Image-to-Video Generation via 3D Facial Dynamics0
Adaptive Appearance RenderingCode0
Learning Long-Term Style-Preserving Blind Video Temporal Consistency0
Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis0
One Shot Audio to Animated Video Generation0
Disentangled Recurrent Wasserstein Autoencoder0
ArrowGAN : Learning to Generate Videos by Learning Arrow of Time0
InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation0
VideoGen: Generative Modeling of Videos using VQ-VAE and Transformers0
Contrastive Video Textures0
Learning to Generate Videos Using Neural Uncertainty Priors0
Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses0
GlocalNet: Class-aware Long-term Human Motion Synthesis0
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single ImageCode0
Multi Modal Adaptive Normalization for Audio to Video Generation0
Robust One Shot Audio to Video Generation0
Modular Action Concept Grounding in Semantic Video Prediction0
Stochastic Talking Face Generation Using Latent Distribution MatchingCode0
Everybody Sign Now: Translating Spoken Language to Photo Realistic Sign Language Video0
Lets Play Music: Audio-driven Performance Video Generation0
LIFI: Towards Linguistically Informed Frame InterpolationCode0
EEG to fMRI Synthesis: Is Deep Learning a candidate?0
Action Concept Grounding Network for Semantically-Consistent Video Generation0
TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary Generator0
Pose-Guided High-Resolution Appearance Transfer via Progressive Training0
How Do the Hearts of Deep Fakes Beat? Deep Fake Source Detection via Interpreting Residuals with Biological Signals0
HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN0
Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation0
Show:102550
← PrevPage 28 of 30Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MCVDFVD162,460Unverified
2VDMFVD161,396Unverified
3TGAN-v2 (128x128)FVD161,209Unverified
4MCVD (64x64)FVD161,143Unverified
5MoCoGAN-HD (256x256, unconditional)FVD16700Unverified
6MagicVideo (256x256, text-conditional)FVD16699Unverified
7TATS (256x256)FVD16635Unverified
8FIFO-DiffusionFVD128596.64Unverified
9DIGAN (128x128, unconditional)FVD16577Unverified
10LVDM (256x256, unconditional)FVD16552Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGANFVD score503Unverified
2Baseline (from LVT)FVD score320.9Unverified
3SVG-FP (from FVD)FVD score315.5Unverified
4CDNA (from FVD)FVD score296.5Unverified
5SV2P (from FVD)FVD score262.5Unverified
6SVG-LP (from vRNN)FVD score256.62Unverified
7WAMFVD score159.6Unverified
8VRNN 1LFVD score149.22Unverified
9SAVP (from vRNN)FVD score143.43Unverified
10Hier-VRNNFVD score143.4Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGAN-HD (128x128)FVD 16183.6Unverified
2TATS (128x128)FVD 16132.6Unverified
3Long-video GAN (256x256)FVD 16116.5Unverified
4DIGAN (128x128)FVD 16114.6Unverified
5Long-video GAN (128x128)FVD 16107.5Unverified
6LVDM (256x256)FVD 1695.2Unverified
7DDMIFVD 1666.25Unverified
8Latte + LeanVAEFVD 1649.59Unverified
9StyleSV (256x256)FVD 1649Unverified
#ModelMetricClaimedVerifiedStatus
1Video Diffusion ModelInception Score57Unverified
2TGAN-ODEInception Score15.2Unverified
3TGAN-FInception Score13.62Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1TGAN-FInception Score22.91Unverified
2TGANv2Inception Score21.45Unverified
3TGANv2-ODEInception Score21.02Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1Imagen original (constant=6)CLIP R-Precision92.12Unverified
2Imagen fully distilled (oscillate (15,1))CLIP R-Precision90.97Unverified
3Imagen distilled (constant=6)CLIP R-Precision90.88Unverified
4Imagen original (oscillate(15,1))CLIP R-Precision89.91Unverified
5Imagen fully distilled (constant=6)CLIP R-Precision89.68Unverified
6Imagen distilled (oscillate (15,1))CLIP R-Precision88.78Unverified
#ModelMetricClaimedVerifiedStatus
1DIGAN (256x256)FVD16156.7Unverified
2MoCoGAN-HD (128x128)FVD16144.7Unverified
3DIGAN (128x128)FVD16128.1Unverified
4LVDM (256x256)FVD1699Unverified
5TATS (128x128)FVD1694.6Unverified
6StyleSV (256x256)FVD1682.6Unverified
#ModelMetricClaimedVerifiedStatus
1TGANv2 (2020)Inception Score28.87Unverified
2DVD-GANInception Score27.38Unverified
3VideoGPTInception Score24.69Unverified
4TGANv2Inception Score24.34Unverified
5TGAN-FInception Score22.91Unverified
6TGANv2-ODEInception Score21.02Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFVD31.1Unverified
2MAGVITFVD9.9Unverified
#ModelMetricClaimedVerifiedStatus
1INR-VFVD16144Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID2.16Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID12.92Unverified
#ModelMetricClaimedVerifiedStatus
1DiT-XL/2 + CVAE-FT-SEFID8.59Unverified
#ModelMetricClaimedVerifiedStatus
1VideoAssembler (Zero-Shot, 256x256, class-conditional)FVD16252Unverified
#ModelMetricClaimedVerifiedStatus
1PG-SWGAN-3DFID404.1Unverified
#ModelMetricClaimedVerifiedStatus
1StyleSVFVD16207.2Unverified