| Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA | Aug 10, 2019 | audio-visual learningRetrieval | —Unverified | 0 |
| Deep Audio-Visual Learning: A Survey | Jan 14, 2020 | audio-visual learningRepresentation Learning | —Unverified | 0 |
| Few-Shot Audio-Visual Learning of Environment Acoustics | Jun 8, 2022 | audio-visual learningRoom Impulse Response (RIR) | —Unverified | 0 |
| Learning in Audio-visual Context: A Review, Analysis, and New Perspective | Aug 20, 2022 | audio-visual learningScene Understanding | —Unverified | 0 |
| Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning | Sep 8, 2023 | audio-visual learningQuantization | —Unverified | 0 |
| Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework | Jun 9, 2025 | audio-visual learningDeepFake Detection | —Unverified | 0 |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Jun 16, 2023 | audio-visual learning | —Unverified | 0 |
| Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives | Feb 17, 2025 | Adversarial Robustnessaudio-visual learning | —Unverified | 0 |
| Sequential Contrastive Audio-Visual Learning | Jul 8, 2024 | audio-visual learningContrastive Learning | —Unverified | 0 |
| Telling Left from Right: Learning Spatial Correspondence of Sight and Sound | Jun 11, 2020 | audio-visual learning | —Unverified | 0 |