SOTAVerified

Sound Source Localization

Papers

Showing 150 of 104 papers

TitleStatusHype
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source LocalizationCode2
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent AlignmentCode1
ODAS: Open embeddeD Audition SystemCode1
Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source LocalizationCode1
Speaker Distance Estimation in Enclosures from Single-Channel AudioCode1
Can CLIP Help Sound Source Localization?Code1
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysisCode1
Localize to Binauralize: Audio Spatialization From Visual Sound Source LocalizationCode1
Dual input neural networks for positional sound source localizationCode1
A Closer Look at Weakly-Supervised Audio-Visual Source LocalizationCode1
wav2pos: Sound Source Localization using Masked AutoencodersCode1
FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source LocalizationCode1
Enhancing Sound Source Localization via False Negative EliminationCode1
Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic EventsCode1
Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face VideosCode1
HRTF measurement for accurate sound localization cuesCode1
Novel-View Acoustic Synthesis from 3D Reconstructed RoomsCode1
Learning to Visually Localize Sound Sources from Mixtures without Prior Source KnowledgeCode1
Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual ScenesCode1
Differentiable Tracking-Based Training of Deep Learning Sound Source LocalizersCode1
Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source LocalizationCode1
Visual Sound Localization in the Wild by Cross-Modal Interference ErasingCode1
Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source LocalizationCode1
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual AlignmentCode1
Audio-Visual Instance SegmentationCode1
Audio-Visual Grouping Network for Sound Localization from MixturesCode1
Deep Neural Networks for Multiple Speaker Detection and LocalizationCode0
Audio-Visual Scene Analysis with Self-Supervised Multisensory FeaturesCode0
FlowGrad: Using Motion for Visual Sound Source LocalizationCode0
SemiPL: A Semi-supervised Method for Event Sound Source LocalizationCode0
The LOCATA Challenge: Acoustic Source Localization and TrackingCode0
Eliminating Quantization Errors in Classification-Based Sound Source LocalizationCode0
A Critical Assessment of Visual Sound Source Localization Models Including Negative AudioCode0
T-VSL: Text-Guided Visual Sound Source Localization in MixturesCode0
Learning to Localize Sound Sources in Visual Scenes: Analysis and ApplicationsCode0
Iterative Sound Source Localization for Unknown Number of SourcesCode0
DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localizationCode0
Direction of Arrival with One Microphone, a few LEGOs, and Non-Negative Matrix FactorizationCode0
Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source LocalizationCode0
Object-aware Sound Source Localization via Audio-Visual Scene UnderstandingCode0
DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models0
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function0
A Proposal-Based Paradigm for Self-Supervised Sound Source Localization in Videos0
Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization0
Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone0
Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring0
Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning0
Contrastive Self-Supervised Learning of Global-Local Audio-Visual Representations0
Where's That Voice Coming? Continual Learning for Sound Source Localization0
AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1YOLO0..5sec21Unverified