| Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives | Feb 17, 2025 | Adversarial Robustnessaudio-visual learning | —Unverified | 0 |
| Unveiling Visual Biases in Audio-Visual Localization Benchmarks | Aug 25, 2024 | audio-visual learningVisual Localization | —Unverified | 0 |
| Sequential Contrastive Audio-Visual Learning | Jul 8, 2024 | audio-visual learningContrastive Learning | —Unverified | 0 |
| MA-AVT: Modality Alignment for Parameter-Efficient Audio-Visual Transformers | Jun 7, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 0 |
| Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization | Jan 16, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Boosting Audio-visual Zero-shot Learning with Large Language Models | Nov 21, 2023 | audio-visual learningDescriptive | CodeCode Available | 0 |
| Deep Video Inpainting Guided by Audio-Visual Self-Supervision | Oct 11, 2023 | audio-visual learningVideo Inpainting | CodeCode Available | 0 |
| Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning | Sep 8, 2023 | audio-visual learningQuantization | —Unverified | 0 |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Jun 16, 2023 | audio-visual learning | —Unverified | 0 |
| Versatile audio-visual learning for emotion recognition | May 12, 2023 | Arousal EstimationAttribute | —Unverified | 0 |