SOTAVerified

Sound Event Localization and Detection

Given multichannel audio input, a sound event detection and localization (SELD) system outputs a temporal activation track for each of the target sound classes, along with one or more corresponding spatial trajectories when the track indicates activity. This results in a spatio-temporal characterization of the acoustic scene that can be used in a wide range of machine cognition tasks, such as inference on the type of environment, self-localization, navigation without visual input or with occluded targets, tracking of specific types of sound sources, smart-home applications, scene visualization systems, and audio surveillance, among others.

Papers

Showing 2130 of 65 papers

TitleStatusHype
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and DetectionCode1
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019Code1
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and DetectionCode1
SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional NetworksCode1
Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos0
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling0
CST-former: Multidimensional Attention-based Transformer for Sound Event Localization and Detection in Real Scenes0
Reverberation-based Features for Sound Event Localization and Detection with Distance EstimationCode0
An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation0
MVANet: Multi-Stage Video Attention Network for Sound Event Localization and Detection with Source Distance EstimationCode0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AVC-FillerNetevent-based F1 score92.8Unverified
2VC-FillerNetevent-based F1 score71Unverified
#ModelMetricClaimedVerifiedStatus
1Baseline (MIC)Class-dependent localization error32.2Unverified
2Baseline (FOA)Class-dependent localization error29.3Unverified
#ModelMetricClaimedVerifiedStatus
1DualQSELD-TCN (parallel)SELD score0.32Unverified
#ModelMetricClaimedVerifiedStatus
1STL-SNNaccuracy98.4Unverified
#ModelMetricClaimedVerifiedStatus
1SALSA-FOAER≤20°0.38Unverified