SOTAVerified

Video Generation

( Various Video Generation Tasks. Gif credit: MaGViT )

Papers

Showing 13511400 of 1466 papers

TitleStatusHype
Transformation-based Adversarial Video Prediction on Large-Scale Data0
Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights0
Transframer: Arbitrary Frame Prediction with Generative Models0
TR-DQ: Time-Rotation Diffusion Quantization0
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models0
T-SVG: Text-Driven Stereoscopic Video Generation0
Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion0
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation0
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis0
Tutorial on Diffusion Models for Imaging and Vision0
TVG: A Training-free Transition Video Generation Method with Diffusion Models0
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions0
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models0
Understanding World or Predicting Future? A Comprehensive Survey of World Models0
UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation0
Unified Dense Prediction of Video Diffusion0
Unified Video Action Model0
Time-Conditioned Generative Modeling of Object-Centric Representations for Video Decomposition and PredictionCode0
Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GANCode0
Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video PredictionCode0
Talking Face Generation by Conditional Recurrent Adversarial NetworkCode0
Causally Steered Diffusion for Automated Video Counterfactual GenerationCode0
Synthesizing Audio from Silent Video using Sequence to Sequence ModelingCode0
FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion ModelsCode0
Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive ArchitecturesCode0
G3AN: Disentangling Appearance and Motion for Video GenerationCode0
Magic 1-For-1: Generating One Minute Video Clips within One MinuteCode0
High Frame Rate Video Reconstruction based on an Event CameraCode0
Lumos-1: On Autoregressive Video Generation from a Unified Model PerspectiveCode0
StoryGAN: A Sequential Conditional GAN for Story VisualizationCode0
Towards Understanding Unsafe Video GenerationCode0
Towards Using Clothes Style Transfer for Scenario-aware Person Video GenerationCode0
Stochastic Video Generation with a Learned PriorCode0
Lower Dimensional Kernels for Video DiscriminatorsCode0
Stochastic Talking Face Generation Using Latent Distribution MatchingCode0
LIFI: Towards Linguistically Informed Frame InterpolationCode0
Stochastic Adversarial Video PredictionCode0
Learning to navigate image manifolds induced by generative adversarial networks for unsupervised video generationCode0
Learning to Forecast and Refine Residual Motion for Image-to-Video GenerationCode0
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large DatasetsCode0
InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPOCode0
Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow FieldsCode0
Adaptive Appearance RenderingCode0
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single ImageCode0
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D SynthesisCode0
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality DataCode0
3-D PET Image Generation with tumour masks using TGANCode0
Improving Video Generation for Multi-functional ApplicationsCode0
Improved Conditional VRNNs for Video PredictionCode0
Source Camera Verification from Strongly Stabilized VideosCode0
Show:102550
← PrevPage 28 of 30Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MCVDFVD162,460Unverified
2VDMFVD161,396Unverified
3TGAN-v2 (128x128)FVD161,209Unverified
4MCVD (64x64)FVD161,143Unverified
5MoCoGAN-HD (256x256, unconditional)FVD16700Unverified
6MagicVideo (256x256, text-conditional)FVD16699Unverified
7TATS (256x256)FVD16635Unverified
8FIFO-DiffusionFVD128596.64Unverified
9DIGAN (128x128, unconditional)FVD16577Unverified
10LVDM (256x256, unconditional)FVD16552Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGANFVD score503Unverified
2Baseline (from LVT)FVD score320.9Unverified
3SVG-FP (from FVD)FVD score315.5Unverified
4CDNA (from FVD)FVD score296.5Unverified
5SV2P (from FVD)FVD score262.5Unverified
6SVG-LP (from vRNN)FVD score256.62Unverified
7WAMFVD score159.6Unverified
8VRNN 1LFVD score149.22Unverified
9SAVP (from vRNN)FVD score143.43Unverified
10Hier-VRNNFVD score143.4Unverified
#ModelMetricClaimedVerifiedStatus
1MoCoGAN-HD (128x128)FVD 16183.6Unverified
2TATS (128x128)FVD 16132.6Unverified
3Long-video GAN (256x256)FVD 16116.5Unverified
4DIGAN (128x128)FVD 16114.6Unverified
5Long-video GAN (128x128)FVD 16107.5Unverified
6LVDM (256x256)FVD 1695.2Unverified
7DDMIFVD 1666.25Unverified
8Latte + LeanVAEFVD 1649.59Unverified
9StyleSV (256x256)FVD 1649Unverified
#ModelMetricClaimedVerifiedStatus
1Video Diffusion ModelInception Score57Unverified
2TGAN-ODEInception Score15.2Unverified
3TGAN-FInception Score13.62Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1TGAN-FInception Score22.91Unverified
2TGANv2Inception Score21.45Unverified
3TGANv2-ODEInception Score21.02Unverified
4MoCoGANInception Score12.42Unverified
5MoCoGAN-MDPInception Score11.86Unverified
6TGAN-SVCInception Score11.85Unverified
7VGANInception Score8.18Unverified
#ModelMetricClaimedVerifiedStatus
1Imagen original (constant=6)CLIP R-Precision92.12Unverified
2Imagen fully distilled (oscillate (15,1))CLIP R-Precision90.97Unverified
3Imagen distilled (constant=6)CLIP R-Precision90.88Unverified
4Imagen original (oscillate(15,1))CLIP R-Precision89.91Unverified
5Imagen fully distilled (constant=6)CLIP R-Precision89.68Unverified
6Imagen distilled (oscillate (15,1))CLIP R-Precision88.78Unverified
#ModelMetricClaimedVerifiedStatus
1DIGAN (256x256)FVD16156.7Unverified
2MoCoGAN-HD (128x128)FVD16144.7Unverified
3DIGAN (128x128)FVD16128.1Unverified
4LVDM (256x256)FVD1699Unverified
5TATS (128x128)FVD1694.6Unverified
6StyleSV (256x256)FVD1682.6Unverified
#ModelMetricClaimedVerifiedStatus
1TGANv2 (2020)Inception Score28.87Unverified
2DVD-GANInception Score27.38Unverified
3VideoGPTInception Score24.69Unverified
4TGANv2Inception Score24.34Unverified
5TGAN-FInception Score22.91Unverified
6TGANv2-ODEInception Score21.02Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFVD31.1Unverified
2MAGVITFVD9.9Unverified
#ModelMetricClaimedVerifiedStatus
1INR-VFVD16144Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID2.16Unverified
#ModelMetricClaimedVerifiedStatus
1DVD-GANFID12.92Unverified
#ModelMetricClaimedVerifiedStatus
1DiT-XL/2 + CVAE-FT-SEFID8.59Unverified
#ModelMetricClaimedVerifiedStatus
1VideoAssembler (Zero-Shot, 256x256, class-conditional)FVD16252Unverified
#ModelMetricClaimedVerifiedStatus
1PG-SWGAN-3DFID404.1Unverified
#ModelMetricClaimedVerifiedStatus
1StyleSVFVD16207.2Unverified