| Object Segmentation with Audio Context | Jan 4, 2023 | audio-visual learningDecoder | —Unverified | 0 |
| Learning in Audio-visual Context: A Review, Analysis, and New Perspective | Aug 20, 2022 | audio-visual learningScene Understanding | —Unverified | 0 |
| UAVM: Towards Unifying Audio and Visual Models | Jul 29, 2022 | Audio Classificationaudio-visual learning | CodeCode Available | 1 |
| Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection | Jul 12, 2022 | Anomaly Detection In Surveillance Videosaudio-visual learning | CodeCode Available | 1 |
| Few-Shot Audio-Visual Learning of Environment Acoustics | Jun 8, 2022 | audio-visual learningRoom Impulse Response (RIR) | —Unverified | 0 |
| Learning to Answer Questions in Dynamic Audio-Visual Scenarios | Mar 26, 2022 | audio-visual learningAudio-visual Question Answering | CodeCode Available | 1 |
| Cascaded Multilingual Audio-Visual Learning from Videos | Nov 8, 2021 | audio-visual learningRetrieval | CodeCode Available | 1 |
| Distilling Audio-Visual Knowledge by Compositional Contrastive Learning | Apr 22, 2021 | Audio Taggingaudio-visual learning | CodeCode Available | 1 |
| Can audio-visual integration strengthen robustness under multimodal attacks? | Apr 5, 2021 | audio-visual learningVisual Localization | CodeCode Available | 1 |
| Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching | Jan 12, 2021 | audio-visual learningMetric Learning | CodeCode Available | 0 |
| Telling Left from Right: Learning Spatial Correspondence of Sight and Sound | Jun 11, 2020 | audio-visual learning | —Unverified | 0 |
| Deep Audio-Visual Learning: A Survey | Jan 14, 2020 | audio-visual learningRepresentation Learning | —Unverified | 0 |
| Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA | Aug 10, 2019 | audio-visual learningRetrieval | —Unverified | 0 |