| IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization | May 11, 2024 | Sound Source Localization | CodeCode Available | 2 |
| Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos | Nov 5, 2021 | PredictionSaliency Prediction | CodeCode Available | 1 |
| A Closer Look at Weakly-Supervised Audio-Visual Source Localization | Aug 30, 2022 | Sound Source Localization | CodeCode Available | 1 |
| HRTF measurement for accurate sound localization cues | Mar 7, 2022 | Sound Source Localization | CodeCode Available | 1 |
| ODAS: Open embeddeD Audition System | Mar 5, 2021 | Sound Source Localization | CodeCode Available | 1 |
| Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization | Aug 11, 2023 | Sound Source Localization | CodeCode Available | 1 |
| Localize to Binauralize: Audio Spatialization From Visual Sound Source Localization | Jan 1, 2021 | Audio GenerationSound Source Localization | CodeCode Available | 1 |
| Visual Sound Localization in the Wild by Cross-Modal Interference Erasing | Feb 13, 2022 | Sound Source Localization | CodeCode Available | 1 |
| Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events | Jan 1, 2023 | Action LocalizationPathfinder | CodeCode Available | 1 |
| Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | May 8, 2025 | Scene UnderstandingSound Source Localization | CodeCode Available | 1 |
| Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes | Mar 25, 2022 | Contrastive LearningSound Source Localization | CodeCode Available | 1 |
| Dual input neural networks for positional sound source localization | Aug 8, 2023 | Sound Source Localization | CodeCode Available | 1 |
| Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization | Nov 6, 2022 | Optical Flow EstimationSound Source Localization | CodeCode Available | 1 |
| Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers | Oct 29, 2021 | ClassificationDeep Learning | CodeCode Available | 1 |
| wav2pos: Sound Source Localization using Masked Autoencoders | Aug 28, 2024 | Indoor LocalizationSound Source Localization | CodeCode Available | 1 |
| Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Mar 26, 2024 | ObjectSound Source Localization | CodeCode Available | 1 |
| Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis | Nov 2, 2023 | Density EstimationDiversity | CodeCode Available | 1 |
| Speaker Distance Estimation in Enclosures from Single-Channel Audio | Mar 26, 2024 | Sound Source Localization | CodeCode Available | 1 |
| Audio-Visual Grouping Network for Sound Localization from Mixtures | Mar 29, 2023 | Object LocalizationSound Source Localization | CodeCode Available | 1 |
| FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization | May 31, 2023 | Sound Source Localization | CodeCode Available | 1 |
| Can CLIP Help Sound Source Localization? | Nov 7, 2023 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Dec 9, 2024 | Sound Source Localization | CodeCode Available | 1 |
| Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment | Jul 18, 2024 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 |
| Audio-Visual Instance Segmentation | Oct 28, 2023 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Novel-View Acoustic Synthesis from 3D Reconstructed Rooms | Oct 23, 2023 | 3D geometrySound Source Localization | CodeCode Available | 1 |
| DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models | Apr 29, 2025 | Audio Signal ProcessingData Augmentation | —Unverified | 0 |
| A Proposal-Based Paradigm for Self-Supervised Sound Source Localization in Videos | Jan 1, 2022 | Multiple Instance LearningSound Source Localization | —Unverified | 0 |
| Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function | Oct 26, 2022 | Active Speaker DetectionSound Source Localization | —Unverified | 0 |
| AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments | Aug 3, 2021 | Depth EstimationObject | —Unverified | 0 |
| Improving Sound Source Localization with Joint Slot Attention on Image and Audio | Apr 21, 2025 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 |
| Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization | Dec 10, 2020 | Sound Source Localization | —Unverified | 0 |
| Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone | Aug 28, 2023 | Sound Source Localization | —Unverified | 0 |
| Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning | Feb 11, 2015 | Active Learningregression | —Unverified | 0 |
| Contrastive Self-Supervised Learning of Global-Local Audio-Visual Representations | Jan 1, 2021 | ClassificationDeepFake Detection | —Unverified | 0 |
| Where's That Voice Coming? Continual Learning for Sound Source Localization | Jul 4, 2024 | Continual LearningExemplar-Free | —Unverified | 0 |
| Improving trajectory localization accuracy via direction-of-arrival derivative estimation | Dec 7, 2022 | Direction of Arrival EstimationSound Source Localization | —Unverified | 0 |
| Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT | Feb 12, 2020 | regressionSound Source Localization | —Unverified | 0 |
| Fast and Robust 3-D Sound Source Localization with DSVD-PHAT | Jul 29, 2019 | Sound Source Localization | —Unverified | 0 |
| Advances in Online Audio-Visual Meeting Transcription | Dec 10, 2019 | Sound Source Localizationspeaker-diarization | —Unverified | 0 |
| Ensemble of Discriminators for Domain Adaptation in Multiple Sound Source 2D Localization | Dec 10, 2020 | Domain AdaptationSound Source Localization | —Unverified | 0 |
| Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization | Feb 13, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone Arrays | Oct 16, 2022 | Sound Source Localization | —Unverified | 0 |
| Feature Aggregation in Joint Sound Classification and Localization Neural Networks | Oct 29, 2023 | regressionSound Classification | —Unverified | 0 |
| CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge | Mar 3, 2025 | Sound Source Localization | —Unverified | 0 |
| Broadband MEMS Microphone Arrays with Reduced Aperture Through 3D-Printed Waveguides | Jun 11, 2024 | Sound Source Localization | —Unverified | 0 |
| Audio Simulation for Sound Source Localization in Virtual Evironment | Apr 2, 2024 | Sound Source Localization | —Unverified | 0 |
| Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring | Dec 26, 2024 | Graph AttentionSound Source Localization | —Unverified | 0 |
| Emergency Vehicles Audio Detection and Localization in Autonomous Driving | Sep 30, 2021 | Autonomous DrivingSound Source Localization | —Unverified | 0 |
| Efficient and Microphone-Fault-Tolerant 3D Sound Source Localization | May 27, 2025 | PositionSound Source Localization | —Unverified | 0 |