SOTAVerified

Text-to-Video Generation

Ma grand-mère m’a raconté que quand elle était étudiante, elle avait un petit-ami. À l’âge de 18 ans, il a dû partir pour le service militaire, elle ne l’a pas attendu et elle a épousé quelqu’un d’autre. Quand ma grand-mère avait 58-59 ans, un homme (son premier amour) lui a envoyé une demande d’amis sur un réseau social, ils ont commencé à parler... En moins de six mois, ils ont décidé de se voir. Le trajet en train a duré deux jours et ils se sont finalement rencontrés. Cela fait maintenant deux ans qu’ils habitent ensemble et qu’ils nous rendent visite de temps en temps. Je réalise maintenant que leur amour l’un envers l’autre n’a jamais cessé.

Papers

Showing 101150 of 201 papers

TitleStatusHype
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback0
Make Pixels Dance: High-Dynamic Video Generation0
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation0
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation0
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation0
DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation0
Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM0
Mojito: Motion Trajectory and Intensity Control for Video Generation0
Dual-Stream Diffusion Net for Text-to-Video Generation0
MotionBooth: Motion-Aware Customized Text-to-Video Generation0
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization0
Video Generation from Text Employing Latent Path Construction for Temporal Modeling0
MotionMaster: Training-free Camera Motion Transfer For Video Generation0
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation0
Animate Your Motion: Turning Still Images into Dynamic Videos0
Multi-Shot Character Consistency for Text-to-Video Generation0
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models0
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation0
DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control0
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation0
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects0
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation0
POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation0
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception0
VideoPoet: A Large Language Model for Zero-Shot Video Generation0
Photorealistic Video Generation with Diffusion Models0
NewMove: Customizing text-to-video models with novel motions0
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models0
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation0
RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers0
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance0
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement0
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation0
S2DM: Sector-Shaped Diffusion Models for Video Generation0
Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking0
AniClipart: Clipart Animation with Text-to-Video Priors0
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning0
Video-to-Audio Generation with Hidden Alignment0
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis0
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation0
STDD: Spatio-Temporal Dual Diffusion for Video Generation0
Compositional 3D-aware Video Generation with LLM Director0
Structure and Content-Guided Video Synthesis with Diffusion Models0
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation0
Can Text-to-Video Generation help Video-Language Alignment?0
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation0
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models0
Technical Report: Competition Solution For Modelscope-Sora0
Advancing Video Quality Assessment for AIGC0
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MagicVideoFVD998Unverified
2VideoComposerFVD580Unverified
3ModelScopeT2VFVD550Unverified
4Show-1FVD538Unverified
5TF-T2VFVD441Unverified
6HiGenFVD406Unverified
7PixelDanceFVD381Unverified
8VideoPoetFVD213Unverified
9Video-LaVITFVD188.36Unverified
10Snap Video (288×288)FVD110.4Unverified
#ModelMetricClaimedVerifiedStatus
1MagicVideo (Zero-shot, 256x256)FVD16699Unverified
2Video LDM (Zero-shot, 320x512)FVD16550.61Unverified
3LAVIE (Zero-shot, 320x512)FVD16526.3Unverified
4PYoCo (Zero-shot, 64x64)FVD16355.19Unverified
5VideoPoetFVD16355Unverified
6Lumiere (Zero-shot, 1024x1024)FVD16332.49Unverified
7Snap Video (Zero-shot, 288×288)FVD16260.1Unverified
8W.A.L.T 3BFVD16258.1Unverified
9PixelDance (Zero-shot, 256x256)FVD16242.82Unverified
10Snap Video (Zero-shot, 512x288)FVD16200.2Unverified
#ModelMetricClaimedVerifiedStatus
1VideoCrafter2Visual Quality54.82Unverified
2Show-1Visual Quality53.74Unverified
3VideoCrafter1Visual Quality53.08Unverified
4LavieVisual Quality52.83Unverified
5ModelScopeVisual Quality52.47Unverified
#ModelMetricClaimedVerifiedStatus
1MAGVITFVD79.1Unverified
2MAGVITFVD28.5Unverified
#ModelMetricClaimedVerifiedStatus
1NUWA (128×128)Accuracy77.9Unverified
#ModelMetricClaimedVerifiedStatus
1VideoFactoryFVD292.35Unverified