SOTAVerified

Text-to-Video Generation

Ma grand-mère m’a raconté que quand elle était étudiante, elle avait un petit-ami. À l’âge de 18 ans, il a dû partir pour le service militaire, elle ne l’a pas attendu et elle a épousé quelqu’un d’autre. Quand ma grand-mère avait 58-59 ans, un homme (son premier amour) lui a envoyé une demande d’amis sur un réseau social, ils ont commencé à parler... En moins de six mois, ils ont décidé de se voir. Le trajet en train a duré deux jours et ils se sont finalement rencontrés. Cela fait maintenant deux ans qu’ils habitent ensemble et qu’ils nous rendent visite de temps en temps. Je réalise maintenant que leur amour l’un envers l’autre n’a jamais cessé.

Papers

Showing 176200 of 201 papers

TitleStatusHype
Grid Diffusion Models for Text-to-Video Generation0
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models0
H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models0
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models0
Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models0
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models0
I4VGen: Image as Free Stepping Stone for Text-to-Video Generation0
Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner0
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption0
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation0
A Review of Multi-Modal Large Language and Vision Models0
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation0
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation0
FlexLip: A Controllable Text-to-Lip System0
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation0
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation0
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity0
LivePhoto: Real Image Animation with Text-guided Motion Control0
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning0
LoViC: Efficient Long Video Generation with Context Compression0
Fine-grained Controllable Video Generation via Object Appearance and Context0
M4V: Multi-Modal Mamba for Text-to-Video Generation0
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing0
Show:102550
← PrevPage 8 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MagicVideoFVD998Unverified
2VideoComposerFVD580Unverified
3ModelScopeT2VFVD550Unverified
4Show-1FVD538Unverified
5TF-T2VFVD441Unverified
6HiGenFVD406Unverified
7PixelDanceFVD381Unverified
8VideoPoetFVD213Unverified
9Video-LaVITFVD188.36Unverified
10Snap Video (288×288)FVD110.4Unverified
#ModelMetricClaimedVerifiedStatus
1MagicVideo (Zero-shot, 256x256)FVD16699Unverified
2Video LDM (Zero-shot, 320x512)FVD16550.61Unverified
3LAVIE (Zero-shot, 320x512)FVD16526.3Unverified
4PYoCo (Zero-shot, 64x64)FVD16355.19Unverified
5VideoPoetFVD16355Unverified
6Lumiere (Zero-shot, 1024x1024)FVD16332.49Unverified
7Snap Video (Zero-shot, 288×288)FVD16260.1Unverified
8W.A.L.T 3BFVD16258.1Unverified
9PixelDance (Zero-shot, 256x256)FVD16242.82Unverified
10Snap Video (Zero-shot, 512x288)FVD16200.2Unverified
#ModelMetricClaimedVerifiedStatus
1VideoCrafter2Visual Quality54.82Unverified
2Show-1Visual Quality53.74Unverified
3VideoCrafter1Visual Quality53.08Unverified
4LavieVisual Quality52.83Unverified
5ModelScopeVisual Quality52.47Unverified
#ModelMetricClaimedVerifiedStatus
1MAGVITFVD79.1Unverified
2MAGVITFVD28.5Unverified
#ModelMetricClaimedVerifiedStatus
1NUWA (128×128)Accuracy77.9Unverified
#ModelMetricClaimedVerifiedStatus
1VideoFactoryFVD292.35Unverified