| FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds | Jul 1, 2024 | Audio GenerationVideo Alignment | CodeCode Available | 4 |
| DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Apr 15, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 |
| ACCURATE METHOD OF TEMPORAL-SHIFT ESTIMATION FOR 3D VIDEO | Jun 3, 2018 | Video AlignmentVideo Synchronization | —Unverified | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 |
| Applying Automated Machine Translation to Educational Video Courses | Jan 9, 2023 | Machine TranslationSpeech Synthesis | —Unverified | 0 |
| Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions | Mar 20, 2023 | Computed Tomography (CT)Video Synchronization | —Unverified | 0 |
| Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | May 27, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation | Apr 29, 2025 | In-Context LearningSpeech Synthesis | —Unverified | 0 |