Sound Event Localization and Detection

Given multichannel audio input, a sound event detection and localization (SELD) system outputs a temporal activation track for each of the target sound classes, along with one or more corresponding spatial trajectories when the track indicates activity. This results in a spatio-temporal characterization of the acoustic scene that can be used in a wide range of machine cognition tasks, such as inference on the type of environment, self-localization, navigation without visual input or with occluded targets, tracking of specific types of sound sources, smart-home applications, scene visualization systems, and audio surveillance, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–65 of 65 papers

Title	Date	Tasks	Status
Learning Spatially-Aware Language and Audio Embeddings	Sep 17, 2024	AttributeContrastive Learning	—Unverified
Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation	Oct 29, 2024	Sound Event Localization and Detection	—Unverified
6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human	Mar 4, 2024	Sound Event Localization and Detection	—Unverified
META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection	Aug 17, 2023	Meta-LearningSound Event Localization and Detection	—Unverified
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments	Jun 28, 2021	Sound Event Localization and Detection	—Unverified
Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality	Feb 14, 2024	Audio Signal ProcessingSound Event Localization and Detection	—Unverified
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation	Aug 9, 2024	Computational EfficiencyEvent Detection	—Unverified
Sound Event Localization based on Sound Intensity Vector Refined By DNN-Based Denoising and Source Separation	Feb 14, 2020	DenoisingSound Event Localization and Detection	—Unverified
Sound source detection, localization and classification using consecutive ensemble of CRNN models	Aug 2, 2019	General ClassificationMulti-Label Classification	—Unverified
Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos	Jul 7, 2025	Sound Event Localization and Detection	—Unverified
Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge	Jul 12, 2024	Sound Event Localization and Detection	—Unverified
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling	Jun 16, 2025	DecoderMamba	—Unverified
SwG-former: A Sliding-Window Graph Convolutional Network for Simultaneous Spatial-Temporal Information Extraction in Sound Event Localization and Detection	Oct 21, 2023	Event DetectionSound Event Detection	—Unverified
TASK3 DCASE2021 Challenge: Sound event localization and detection using squeeze-excitation residual CNNs	Jul 30, 2021	Direction of Arrival EstimationSound Event Localization and Detection	—Unverified
Text-Queried Target Sound Event Localization	Jun 23, 2024	Room Impulse Response (RIR)Sound Event Localization and Detection	—Unverified

Show:10 25 50

← PrevPage 2 of 2Next →

All datasets PodcastFillers STARSS22 L3DAS21 RWCP Sound Scene Database TAU-NIGENS Spatial Sound Events 2021

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AVC-FillerNet	event-based F1 score	92.8	—	Unverified
2	VC-FillerNet	event-based F1 score	71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Baseline (MIC)	Class-dependent localization error	32.2	—	Unverified
2	Baseline (FOA)	Class-dependent localization error	29.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DualQSELD-TCN (parallel)	SELD score	0.32	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STL-SNN	accuracy	98.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SALSA-FOA	ER≤20°	0.38	—	Unverified