| Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet | Oct 5, 2021 | Audio-Visual Synchronization | CodeCode Available | 1 |
| Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation | Jun 24, 2025 | Audio GenerationAudio-Visual Synchronization | —Unverified | 0 |
| Audio-Sync Video Generation with Multi-Stream Temporal Control | Jun 9, 2025 | Audio-Visual SynchronizationVideo Alignment | —Unverified | 0 |
| OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions | May 27, 2025 | Audio-Visual SynchronizationConversational Response Generation | —Unverified | 0 |
| DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation | Mar 28, 2025 | Audio GenerationAudio-Visual Synchronization | —Unverified | 0 |
| UniSync: A Unified Framework for Audio-Visual Synchronization | Mar 20, 2025 | Audio-Visual SynchronizationContrastive Learning | —Unverified | 0 |
| FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis | Mar 6, 2025 | Audio-Visual Synchronization | —Unverified | 0 |
| Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis | Sep 10, 2024 | Audio SynthesisAudio-Visual Synchronization | —Unverified | 0 |
| A Comprehensive Review and Taxonomy of Audio-Visual Synchronization Techniques for Realistic Speech Animation | Jul 24, 2024 | Audio-Visual Synchronization | —Unverified | 0 |
| RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network | Jun 26, 2024 | Audio-Visual SynchronizationFace Generation | —Unverified | 0 |