SOTAVerified

Video Synchronization

Papers

Showing 130 of 30 papers

TitleStatusHype
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized SoundsCode4
DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmenCode1
Representation Learning via Global Temporal Alignment and Cycle-ConsistencyCode1
Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and BeyondCode0
A subjective study of the perceptual acceptability of audio-video desynchronization in sports videosCode0
Beyond Audio and Pose: A General-Purpose Framework for Video SynchronizationCode0
Deep learning-based stereo camera multi-video synchronizationCode0
PoseSync: Robust pose based video synchronizationCode0
Rolling Shutter Camera Synchronization with Sub-millisecond AccuracyCode0
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control0
Context-aware Talking Face Video Generation0
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation0
SIDGAN: High-Resolution Dubbed Video Generation via Shift-Invariant Learning0
Detection of Audio-Video Synchronization Errors Via Event Detection0
AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation0
ACCURATE METHOD OF TEMPORAL-SHIFT ESTIMATION FOR 3D VIDEO0
Learning Robust Video Synchronization without Annotations0
ModEFormer: Modality-Preserving Embedding for Audio-Video Synchronization using Transformers0
Multi-Task Learning for Audio Visual Active Speaker Detection0
OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication0
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation0
Sub-millisecond Video Synchronization of Multiple Android Smartphones0
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
Applying Automated Machine Translation to Educational Video Courses0
Video alignment using unsupervised learning of local and global features0
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation0
Self-supervised learning for audio-visual speaker diarization0
Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions0
Show:102550

No leaderboard results yet.