| IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization | May 11, 2024 | Sound Source Localization | CodeCode Available | 2 |
| Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | May 8, 2025 | Scene UnderstandingSound Source Localization | CodeCode Available | 1 |
| Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Dec 9, 2024 | Sound Source Localization | CodeCode Available | 1 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| wav2pos: Sound Source Localization using Masked Autoencoders | Aug 28, 2024 | Indoor LocalizationSound Source Localization | CodeCode Available | 1 |
| Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment | Jul 18, 2024 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 |
| Speaker Distance Estimation in Enclosures from Single-Channel Audio | Mar 26, 2024 | Sound Source Localization | CodeCode Available | 1 |
| Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Mar 26, 2024 | ObjectSound Source Localization | CodeCode Available | 1 |
| Can CLIP Help Sound Source Localization? | Nov 7, 2023 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis | Nov 2, 2023 | Density EstimationDiversity | CodeCode Available | 1 |