SOTAVerified

Sound Event Localization and Detection

Given multichannel audio input, a sound event detection and localization (SELD) system outputs a temporal activation track for each of the target sound classes, along with one or more corresponding spatial trajectories when the track indicates activity. This results in a spatio-temporal characterization of the acoustic scene that can be used in a wide range of machine cognition tasks, such as inference on the type of environment, self-localization, navigation without visual input or with occluded targets, tracking of specific types of sound sources, smart-home applications, scene visualization systems, and audio surveillance, among others.

Papers

Showing 5165 of 65 papers

TitleStatusHype
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and DetectionCode1
TASK3 DCASE2021 Challenge: Sound event localization and detection using squeeze-excitation residual CNNs0
What Makes Sound Event Localization and Detection Difficult? Insights from Error AnalysisCode1
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and DetectionCode1
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments0
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection0
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and DetectionCode1
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and DetectionCode1
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019Code1
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and DetectionCode1
SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional NetworksCode1
Sound Event Localization based on Sound Intensity Vector Refined By DNN-Based Denoising and Source Separation0
A Sequence Matching Network for Polyphonic Sound Event Localization and Detection0
A hybrid parametric-deep learning approach for sound event localization and detectionCode0
Sound source detection, localization and classification using consecutive ensemble of CRNN models0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AVC-FillerNetevent-based F1 score92.8Unverified
2VC-FillerNetevent-based F1 score71Unverified
#ModelMetricClaimedVerifiedStatus
1Baseline (MIC)Class-dependent localization error32.2Unverified
2Baseline (FOA)Class-dependent localization error29.3Unverified
#ModelMetricClaimedVerifiedStatus
1DualQSELD-TCN (parallel)SELD score0.32Unverified
#ModelMetricClaimedVerifiedStatus
1STL-SNNaccuracy98.4Unverified
#ModelMetricClaimedVerifiedStatus
1SALSA-FOAER≤20°0.38Unverified