| FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds | Jul 1, 2024 | Audio GenerationVideo Alignment | CodeCode Available | 4 |
| DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Apr 15, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization | Jun 19, 2025 | Pose EstimationVideo Synchronization | CodeCode Available | 0 |
| AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation | Apr 29, 2025 | In-Context LearningSpeech Synthesis | —Unverified | 0 |
| Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models | Apr 6, 2025 | Audio GenerationGPU | —Unverified | 0 |
| OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication | Apr 3, 2025 | Talking Head GenerationVideo Synchronization | —Unverified | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 |
| Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | May 27, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 |