| Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework | Jun 9, 2025 | audio-visual learningDeepFake Detection | —Unverified | 0 |
| CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment | May 2, 2025 | audio-visual learningcross-modal alignment | CodeCode Available | 1 |
| Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives | Feb 17, 2025 | Adversarial Robustnessaudio-visual learning | —Unverified | 0 |
| Language-Guided Audio-Visual Learning for Long-Term Sports Assessment | Jan 1, 2025 | audio-visual learningKnowledge Graphs | CodeCode Available | 1 |
| Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration | Dec 17, 2024 | audio-visual event localizationaudio-visual learning | CodeCode Available | 1 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| Unveiling Visual Biases in Audio-Visual Localization Benchmarks | Aug 25, 2024 | audio-visual learningVisual Localization | —Unverified | 0 |
| Sequential Contrastive Audio-Visual Learning | Jul 8, 2024 | audio-visual learningContrastive Learning | —Unverified | 0 |
| MA-AVT: Modality Alignment for Parameter-Efficient Audio-Visual Transformers | Jun 7, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 0 |
| EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning | Mar 14, 2024 | Audio Classificationaudio-visual learning | CodeCode Available | 1 |