SOTAVerified

Audio-Visual Synchronization

Papers

Showing 110 of 32 papers

TitleStatusHype
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation0
Audio-Sync Video Generation with Multi-Stream Temporal Control0
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions0
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
UniSync: A Unified Framework for Audio-Visual Synchronization0
FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis0
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal SamplingCode9
Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.