| UniSync: A Unified Framework for Audio-Visual Synchronization | Mar 20, 2025 | Audio-Visual SynchronizationContrastive Learning | —Unverified | 0 |
| DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation | Mar 28, 2025 | Audio GenerationAudio-Visual Synchronization | —Unverified | 0 |
| Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis | Sep 10, 2024 | Audio SynthesisAudio-Visual Synchronization | —Unverified | 0 |
| CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing | Jan 22, 2024 | AudioCapsAudio-Visual Synchronization | —Unverified | 0 |
| FaceDirector: Continuous Control of Facial Performance in Video | Dec 1, 2015 | Audio-Visual Synchronizationcontinuous-control | —Unverified | 0 |
| FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis | Mar 6, 2025 | Audio-Visual Synchronization | —Unverified | 0 |
| Identity-Preserving Realistic Talking Face Generation | May 25, 2020 | Audio-Visual SynchronizationFace Generation | —Unverified | 0 |