| MA-AVT: Modality Alignment for Parameter-Efficient Audio-Visual Transformers | Jun 7, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 0 | 5 |
| Revisiting Pre-training in Audio-Visual Learning | Feb 7, 2023 | audio-visual learning | CodeCode Available | 0 | 5 |
| Boosting Audio-visual Zero-shot Learning with Large Language Models | Nov 21, 2023 | audio-visual learningDescriptive | CodeCode Available | 0 | 5 |
| Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching | Jan 12, 2021 | audio-visual learningMetric Learning | CodeCode Available | 0 | 5 |
| Versatile audio-visual learning for emotion recognition | May 12, 2023 | Arousal EstimationAttribute | —Unverified | 0 | 0 |
| Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA | Aug 10, 2019 | audio-visual learningRetrieval | —Unverified | 0 | 0 |
| Deep Audio-Visual Learning: A Survey | Jan 14, 2020 | audio-visual learningRepresentation Learning | —Unverified | 0 | 0 |
| Few-Shot Audio-Visual Learning of Environment Acoustics | Jun 8, 2022 | audio-visual learningRoom Impulse Response (RIR) | —Unverified | 0 | 0 |
| Learning in Audio-visual Context: A Review, Analysis, and New Perspective | Aug 20, 2022 | audio-visual learningScene Understanding | —Unverified | 0 | 0 |
| Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning | Sep 8, 2023 | audio-visual learningQuantization | —Unverified | 0 | 0 |