SOTAVerified

Video Synchronization

Papers

Showing 125 of 30 papers

TitleStatusHype
Beyond Audio and Pose: A General-Purpose Framework for Video SynchronizationCode0
AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation0
DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmenCode1
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models0
OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication0
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation0
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized SoundsCode4
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control0
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation0
Context-aware Talking Face Video Generation0
PoseSync: Robust pose based video synchronizationCode0
Video alignment using unsupervised learning of local and global features0
Deep learning-based stereo camera multi-video synchronizationCode0
ModEFormer: Modality-Preserving Embedding for Audio-Video Synchronization using Transformers0
Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions0
Applying Automated Machine Translation to Educational Video Courses0
SIDGAN: High-Resolution Dubbed Video Generation via Shift-Invariant Learning0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
A subjective study of the perceptual acceptability of audio-video desynchronization in sports videosCode0
Sub-millisecond Video Synchronization of Multiple Android Smartphones0
Representation Learning via Global Temporal Alignment and Cycle-ConsistencyCode1
Detection of Audio-Video Synchronization Errors Via Event Detection0
Self-supervised learning for audio-visual speaker diarization0
Multi-Task Learning for Audio Visual Active Speaker Detection0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.