SOTAVerified

Text-to-Video Generation

Ma grand-mère m’a raconté que quand elle était étudiante, elle avait un petit-ami. À l’âge de 18 ans, il a dû partir pour le service militaire, elle ne l’a pas attendu et elle a épousé quelqu’un d’autre. Quand ma grand-mère avait 58-59 ans, un homme (son premier amour) lui a envoyé une demande d’amis sur un réseau social, ils ont commencé à parler... En moins de six mois, ils ont décidé de se voir. Le trajet en train a duré deux jours et ils se sont finalement rencontrés. Cela fait maintenant deux ans qu’ils habitent ensemble et qu’ils nous rendent visite de temps en temps. Je réalise maintenant que leur amour l’un envers l’autre n’a jamais cessé.

Papers

Showing 76100 of 201 papers

TitleStatusHype
GODIVA: Generating Open-DomaIn Videos from nAtural DescriptionsCode1
Make-A-Video: Text-to-Video Generation without Text-Video DataCode1
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsCode1
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model AdaptationCode1
MMTrail: A Multimodal Trailer Video Dataset with Language and Music DescriptionsCode1
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video GenerationCode1
VPO: Aligning Text-to-Video Generation Models with Prompt OptimizationCode1
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration0
Gender Bias in Text-to-Video Generation Models: A case study of Sora0
DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control0
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects0
NewMove: Customizing text-to-video models with novel motions0
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way0
Animate Your Motion: Turning Still Images into Dynamic Videos0
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation0
Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models0
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way0
Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance0
MagicVideo: Efficient Video Generation With Latent Diffusion Models0
FlexLip: A Controllable Text-to-Lip System0
M4V: Multi-Modal Mamba for Text-to-Video Generation0
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation0
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations0
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation0
Make Pixels Dance: High-Dynamic Video Generation0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MagicVideoFVD998Unverified
2VideoComposerFVD580Unverified
3ModelScopeT2VFVD550Unverified
4Show-1FVD538Unverified
5TF-T2VFVD441Unverified
6HiGenFVD406Unverified
7PixelDanceFVD381Unverified
8VideoPoetFVD213Unverified
9Video-LaVITFVD188.36Unverified
10Snap Video (288×288)FVD110.4Unverified
#ModelMetricClaimedVerifiedStatus
1MagicVideo (Zero-shot, 256x256)FVD16699Unverified
2Video LDM (Zero-shot, 320x512)FVD16550.61Unverified
3LAVIE (Zero-shot, 320x512)FVD16526.3Unverified
4PYoCo (Zero-shot, 64x64)FVD16355.19Unverified
5VideoPoetFVD16355Unverified
6Lumiere (Zero-shot, 1024x1024)FVD16332.49Unverified
7Snap Video (Zero-shot, 288×288)FVD16260.1Unverified
8W.A.L.T 3BFVD16258.1Unverified
9PixelDance (Zero-shot, 256x256)FVD16242.82Unverified
10Snap Video (Zero-shot, 512x288)FVD16200.2Unverified
#ModelMetricClaimedVerifiedStatus
1VideoCrafter2Visual Quality54.82Unverified
2Show-1Visual Quality53.74Unverified
3VideoCrafter1Visual Quality53.08Unverified
4LavieVisual Quality52.83Unverified
5ModelScopeVisual Quality52.47Unverified
#ModelMetricClaimedVerifiedStatus
1MAGVITFVD79.1Unverified
2MAGVITFVD28.5Unverified
#ModelMetricClaimedVerifiedStatus
1NUWA (128×128)Accuracy77.9Unverified
#ModelMetricClaimedVerifiedStatus
1VideoFactoryFVD292.35Unverified