SOTAVerified

Text-to-Video Generation

Ma grand-mère m’a raconté que quand elle était étudiante, elle avait un petit-ami. À l’âge de 18 ans, il a dû partir pour le service militaire, elle ne l’a pas attendu et elle a épousé quelqu’un d’autre. Quand ma grand-mère avait 58-59 ans, un homme (son premier amour) lui a envoyé une demande d’amis sur un réseau social, ils ont commencé à parler... En moins de six mois, ils ont décidé de se voir. Le trajet en train a duré deux jours et ils se sont finalement rencontrés. Cela fait maintenant deux ans qu’ils habitent ensemble et qu’ils nous rendent visite de temps en temps. Je réalise maintenant que leur amour l’un envers l’autre n’a jamais cessé.

Papers

Showing 101150 of 201 papers

TitleStatusHype
Can Text-to-Video Generation help Video-Language Alignment?0
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance0
Enabling Versatile Controls for Video Diffusion ModelsCode0
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models0
WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation0
VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video GenerationCode0
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models0
RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers0
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation0
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation0
Magic 1-For-1: Generating One Minute Video Clips within One MinuteCode0
IPO: Iterative Preference Optimization for Text-to-Video Generation0
Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models0
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation0
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations0
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning0
STDD: Spatio-Temporal Dual Diffusion for Video Generation0
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation0
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way0
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception0
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner0
Gender Bias in Text-to-Video Generation Models: A case study of Sora0
Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance0
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation0
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video PromptingCode0
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity0
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption0
Mojito: Motion Trajectory and Intensity Control for Video Generation0
T-SVG: Text-Driven Stereoscopic Video Generation0
Multi-Shot Character Consistency for Text-to-Video Generation0
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation0
Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models0
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation0
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement0
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal VerificationCode0
A Survey of Emerging Approaches and Advances in Video Generation0
GiVE: Guiding Visual Encoder to Perceive Overlooked Information0
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete DiffusionCode0
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way0
Technical Report: Competition Solution For Modelscope-Sora0
Advancing Video Quality Assessment for AIGC0
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives0
Compositional 3D-aware Video Generation with LLM Director0
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation0
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation0
Unlearning Concepts from Text-to-Video Diffusion Models0
Video-to-Audio Generation with Hidden Alignment0
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation TaskCode0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MagicVideoFVD998Unverified
2VideoComposerFVD580Unverified
3ModelScopeT2VFVD550Unverified
4Show-1FVD538Unverified
5TF-T2VFVD441Unverified
6HiGenFVD406Unverified
7PixelDanceFVD381Unverified
8VideoPoetFVD213Unverified
9Video-LaVITFVD188.36Unverified
10Snap Video (288×288)FVD110.4Unverified
#ModelMetricClaimedVerifiedStatus
1MagicVideo (Zero-shot, 256x256)FVD16699Unverified
2Video LDM (Zero-shot, 320x512)FVD16550.61Unverified
3LAVIE (Zero-shot, 320x512)FVD16526.3Unverified
4PYoCo (Zero-shot, 64x64)FVD16355.19Unverified
5VideoPoetFVD16355Unverified
6Lumiere (Zero-shot, 1024x1024)FVD16332.49Unverified
7Snap Video (Zero-shot, 288×288)FVD16260.1Unverified
8W.A.L.T 3BFVD16258.1Unverified
9PixelDance (Zero-shot, 256x256)FVD16242.82Unverified
10Snap Video (Zero-shot, 512x288)FVD16200.2Unverified
#ModelMetricClaimedVerifiedStatus
1VideoCrafter2Visual Quality54.82Unverified
2Show-1Visual Quality53.74Unverified
3VideoCrafter1Visual Quality53.08Unverified
4LavieVisual Quality52.83Unverified
5ModelScopeVisual Quality52.47Unverified
#ModelMetricClaimedVerifiedStatus
1MAGVITFVD79.1Unverified
2MAGVITFVD28.5Unverified
#ModelMetricClaimedVerifiedStatus
1NUWA (128×128)Accuracy77.9Unverified
#ModelMetricClaimedVerifiedStatus
1VideoFactoryFVD292.35Unverified