SOTAVerified

Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Showing 150 of 194 papers

TitleStatusHype
Frequency Dynamic Convolutions for Sound Event Detection0
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event DetectionCode0
Exploring the Potential of SSL Models for Sound Event Detection0
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event DetectionCode0
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data0
Exploring Performance-Complexity Trade-Offs in Sound Event Detection ModelsCode1
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning0
Robust detection of overlapping bioacoustic sound events0
Synthetic data enables context-aware bioacoustic sound event detection0
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event DetectionCode0
Towards Understanding of Frequency Dependence on Sound Event Detection0
An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation0
Pseudo Strong Labels from Frame-Level Predictions for Weakly Supervised Sound Event Detection0
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event DetectionCode1
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionCode2
Exploring Text-Queried Sound Event Detection with Audio Source SeparationCode1
The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event DetectionCode0
Effective Pre-Training of Audio Transformers for Sound Event DetectionCode1
Unified Audio Event Detection0
Energy Consumption Trends in Sound Event Detection Systems0
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event DetectionCode0
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems0
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors0
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event DetectionCode2
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation0
Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage TrainingCode1
Multi-Iteration Multi-Stage Fine-Tuning of Transformers for Sound Event Detection with Heterogeneous DatasetsCode1
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection0
Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data0
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection SystemCode1
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels0
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding BoxesCode1
Automated Bioacoustic Monitoring for South African Bird Species on Unlabeled Data0
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic ConvolutionCode1
Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 40
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels0
Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event DetectionCode1
Sound Event Bounding BoxesCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event LocalizationCode1
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
Sound Event Detection and Localization with Distance Estimation0
Multitask frame-level learning for few-shot sound event detection0
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals0
Dual Knowledge Distillation for Efficient Sound Event Detection0
BAT: Learning to Reason about Spatial Sounds with Large Language Models0
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detectionCode1
Fusion of Audio and Visual Embeddings for Sound Event Localization and DetectionCode1
w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-TrainingCode1
tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ATST-SEDevent-based F1 score63.4Unverified
2SE-CRNN-16 with DualKDevent-based F1 score55.6Unverified
3FDY-CRNNevent-based F1 score54Unverified
4HTS-ATevent-based F1 score50.7Unverified
5RCTevent-based F1 score49.62Unverified
6FiltAug SEDevent-based F1 score49.6Unverified
7SED-SSep baseline dcase task 4 2020 v2event-based F1 score40.7Unverified
8Baseline dcase task 4 2020 v2event-based F1 score39Unverified
9Baselineevent-based F1 score25.8Unverified
10MAT-SEDPSDS10.59Unverified
#ModelMetricClaimedVerifiedStatus
1PHC SEDnet n=8Error Rate0.56Unverified
2Quaternion SEDnetError Rate0.52Unverified
3PHC SEDnet n=16Error Rate0.51Unverified
4PHC SEDnet n=4Error Rate0.45Unverified
5PHC SEDnet n=2Error Rate0.39Unverified
#ModelMetricClaimedVerifiedStatus
1CRNN (with BEATs + Separation)PSDS1 (-5dB)0.13Unverified
2CRNN (with BEATs)PSDS1 (-5dB)0.07Unverified
3CRNN (WildDESED + Curriculrm learning)PSDS1 (-5dB)0.05Unverified
4CRNN (WildDESED)PSDS1 (-5dB)0.05Unverified
5CRNNPSDS1 (-5dB)0.02Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate0.98Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate1Unverified