| FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds | Jul 1, 2024 | Audio GenerationVideo Alignment | CodeCode Available | 4 | 5 |
| DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Apr 15, 2025 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 | 5 |
| Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and Beyond | May 26, 2019 | Gunshot DetectionShooter Localization | CodeCode Available | 0 | 5 |
| A subjective study of the perceptual acceptability of audio-video desynchronization in sports videos | Dec 3, 2022 | Video Synchronization | CodeCode Available | 0 | 5 |
| Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization | Jun 19, 2025 | Pose EstimationVideo Synchronization | CodeCode Available | 0 | 5 |
| Deep learning-based stereo camera multi-video synchronization | Mar 22, 2023 | Deep LearningVideo Synchronization | CodeCode Available | 0 | 5 |
| PoseSync: Robust pose based video synchronization | Aug 24, 2023 | Dynamic Time WarpingVideo Synchronization | CodeCode Available | 0 | 5 |
| Rolling Shutter Camera Synchronization with Sub-millisecond Accuracy | Feb 28, 2019 | Video Synchronization | CodeCode Available | 0 | 5 |
| Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | May 27, 2024 | Scene GenerationVideo Generation | —Unverified | 0 | 0 |
| Context-aware Talking Face Video Generation | Feb 28, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 | 0 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 | 0 |
| SIDGAN: High-Resolution Dubbed Video Generation via Shift-Invariant Learning | Jan 1, 2023 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| Detection of Audio-Video Synchronization Errors Via Event Detection | Apr 20, 2021 | Event DetectionVideo Synchronization | —Unverified | 0 | 0 |
| AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation | Apr 29, 2025 | In-Context LearningSpeech Synthesis | —Unverified | 0 | 0 |
| ACCURATE METHOD OF TEMPORAL-SHIFT ESTIMATION FOR 3D VIDEO | Jun 3, 2018 | Video AlignmentVideo Synchronization | —Unverified | 0 | 0 |
| Learning Robust Video Synchronization without Annotations | Oct 19, 2016 | Video AlignmentVideo Synchronization | —Unverified | 0 | 0 |
| ModEFormer: Modality-Preserving Embedding for Audio-Video Synchronization using Transformers | Mar 21, 2023 | Contrastive LearningVideo Synchronization | —Unverified | 0 | 0 |
| Multi-Task Learning for Audio Visual Active Speaker Detection | Jun 1, 2019 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 | 0 |
| OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication | Apr 3, 2025 | Talking Head GenerationVideo Synchronization | —Unverified | 0 | 0 |
| Perfect match: Improved cross-modal embeddings for audio-visual synchronisation | Sep 21, 2018 | Binary ClassificationCross-Modal Retrieval | —Unverified | 0 | 0 |
| Sub-millisecond Video Synchronization of Multiple Android Smartphones | Jul 2, 2021 | Video Synchronization | —Unverified | 0 | 0 |
| Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models | Apr 6, 2025 | Audio GenerationGPU | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement | Dec 21, 2022 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Jan 1, 2023 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| Applying Automated Machine Translation to Educational Video Courses | Jan 9, 2023 | Machine TranslationSpeech Synthesis | —Unverified | 0 | 0 |
| Video alignment using unsupervised learning of local and global features | Apr 13, 2023 | Dynamic Time WarpingHuman Detection | —Unverified | 0 | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 | 0 |
| Self-supervised learning for audio-visual speaker diarization | Feb 13, 2020 | Self-Supervised Learningspeaker-diarization | —Unverified | 0 | 0 |
| Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions | Mar 20, 2023 | Computed Tomography (CT)Video Synchronization | —Unverified | 0 | 0 |