| IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization | May 11, 2024 | Sound Source Localization | CodeCode Available | 2 | 5 |
| Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Dec 9, 2024 | Sound Source Localization | CodeCode Available | 1 | 5 |
| ODAS: Open embeddeD Audition System | Mar 5, 2021 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization | Aug 11, 2023 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Speaker Distance Estimation in Enclosures from Single-Channel Audio | Mar 26, 2024 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Can CLIP Help Sound Source Localization? | Nov 7, 2023 | audio-visual learningContrastive Learning | CodeCode Available | 1 | 5 |
| Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis | Nov 2, 2023 | Density EstimationDiversity | CodeCode Available | 1 | 5 |
| Localize to Binauralize: Audio Spatialization From Visual Sound Source Localization | Jan 1, 2021 | Audio GenerationSound Source Localization | CodeCode Available | 1 | 5 |
| Dual input neural networks for positional sound source localization | Aug 8, 2023 | Sound Source Localization | CodeCode Available | 1 | 5 |
| A Closer Look at Weakly-Supervised Audio-Visual Source Localization | Aug 30, 2022 | Sound Source Localization | CodeCode Available | 1 | 5 |
| wav2pos: Sound Source Localization using Masked Autoencoders | Aug 28, 2024 | Indoor LocalizationSound Source Localization | CodeCode Available | 1 | 5 |
| FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization | May 31, 2023 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 | 5 |
| Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events | Jan 1, 2023 | Action LocalizationPathfinder | CodeCode Available | 1 | 5 |
| Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos | Nov 5, 2021 | PredictionSaliency Prediction | CodeCode Available | 1 | 5 |
| HRTF measurement for accurate sound localization cues | Mar 7, 2022 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Novel-View Acoustic Synthesis from 3D Reconstructed Rooms | Oct 23, 2023 | 3D geometrySound Source Localization | CodeCode Available | 1 | 5 |
| Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Mar 26, 2024 | ObjectSound Source Localization | CodeCode Available | 1 | 5 |
| Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes | Mar 25, 2022 | Contrastive LearningSound Source Localization | CodeCode Available | 1 | 5 |
| Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers | Oct 29, 2021 | ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization | Nov 6, 2022 | Optical Flow EstimationSound Source Localization | CodeCode Available | 1 | 5 |
| Visual Sound Localization in the Wild by Cross-Modal Interference Erasing | Feb 13, 2022 | Sound Source Localization | CodeCode Available | 1 | 5 |
| Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | May 8, 2025 | Scene UnderstandingSound Source Localization | CodeCode Available | 1 | 5 |
| Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment | Jul 18, 2024 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 | 5 |
| Audio-Visual Instance Segmentation | Oct 28, 2023 | Instance SegmentationSegmentation | CodeCode Available | 1 | 5 |
| Audio-Visual Grouping Network for Sound Localization from Mixtures | Mar 29, 2023 | Object LocalizationSound Source Localization | CodeCode Available | 1 | 5 |
| Deep Neural Networks for Multiple Speaker Detection and Localization | Nov 30, 2017 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Audio-Visual Scene Analysis with Self-Supervised Multisensory Features | Apr 10, 2018 | Action RecognitionAudio Source Separation | CodeCode Available | 0 | 5 |
| FlowGrad: Using Motion for Visual Sound Source Localization | Nov 15, 2022 | Optical Flow EstimationScene Understanding | CodeCode Available | 0 | 5 |
| SemiPL: A Semi-supervised Method for Event Sound Source Localization | Apr 30, 2024 | Contrastive LearningManagement | CodeCode Available | 0 | 5 |
| The LOCATA Challenge: Acoustic Source Localization and Tracking | Sep 3, 2019 | BenchmarkingSound Source Localization | CodeCode Available | 0 | 5 |
| Eliminating Quantization Errors in Classification-Based Sound Source Localization | Nov 21, 2023 | ClassificationQuantization | CodeCode Available | 0 | 5 |
| A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio | Oct 1, 2024 | Scene UnderstandingSound Source Localization | CodeCode Available | 0 | 5 |
| T-VSL: Text-Guided Visual Sound Source Localization in Mixtures | Apr 2, 2024 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications | Nov 20, 2019 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Iterative Sound Source Localization for Unknown Number of Sources | Jun 24, 2022 | Sound Source Localization | CodeCode Available | 0 | 5 |
| DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization | Nov 5, 2020 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Direction of Arrival with One Microphone, a few LEGOs, and Non-Negative Matrix Factorization | Aug 28, 2018 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization | Aug 9, 2023 | Contrastive LearningSound Source Localization | CodeCode Available | 0 | 5 |
| Object-aware Sound Source Localization via Audio-Visual Scene Understanding | Jan 1, 2025 | Scene UnderstandingSound Source Localization | CodeCode Available | 0 | 5 |
| DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models | Apr 29, 2025 | Audio Signal ProcessingData Augmentation | —Unverified | 0 | 0 |
| Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function | Oct 26, 2022 | Active Speaker DetectionSound Source Localization | —Unverified | 0 | 0 |
| A Proposal-Based Paradigm for Self-Supervised Sound Source Localization in Videos | Jan 1, 2022 | Multiple Instance LearningSound Source Localization | —Unverified | 0 | 0 |
| Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization | Dec 10, 2020 | Sound Source Localization | —Unverified | 0 | 0 |
| Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone | Aug 28, 2023 | Sound Source Localization | —Unverified | 0 | 0 |
| Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring | Dec 26, 2024 | Graph AttentionSound Source Localization | —Unverified | 0 | 0 |
| Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning | Feb 11, 2015 | Active Learningregression | —Unverified | 0 | 0 |
| Contrastive Self-Supervised Learning of Global-Local Audio-Visual Representations | Jan 1, 2021 | ClassificationDeepFake Detection | —Unverified | 0 | 0 |
| Where's That Voice Coming? Continual Learning for Sound Source Localization | Jul 4, 2024 | Continual LearningExemplar-Free | —Unverified | 0 | 0 |
| AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments | Aug 3, 2021 | Depth EstimationObject | —Unverified | 0 | 0 |