Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 194 papers

Title	Date	Tasks	Status	Hype
Frequency Dynamic Convolutions for Sound Event Detection	Jun 15, 2025	ARCEvent Detection	—Unverified	0
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection	May 27, 2025	Active LearningDiversity	CodeCode Available	0
Exploring the Potential of SSL Models for Sound Event Detection	May 17, 2025	Event DetectionModel Selection	—Unverified	0
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection	Apr 17, 2025	Event DetectionSound Event Detection	CodeCode Available	0
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data	Apr 6, 2025	Event DetectionSound Event Detection	—Unverified	0
Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models	Mar 14, 2025	Audio TaggingEvent Detection	CodeCode Available	1
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning	Mar 4, 2025	Active LearningEvent Detection	—Unverified	0
Robust detection of overlapping bioacoustic sound events	Mar 4, 2025	Event DetectionGraph Matching	—Unverified	0
Synthetic data enables context-aware bioacoustic sound event detection	Mar 1, 2025	Event DetectionIn-Context Learning	—Unverified	0
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event Detection	Feb 28, 2025	Boundary DetectionEvent Detection	CodeCode Available	0
Towards Understanding of Frequency Dependence on Sound Event Detection	Feb 11, 2025	Data AugmentationEvent Detection	—Unverified	0
An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation	Jan 18, 2025	Event DetectionSound Event Detection	—Unverified	0
Pseudo Strong Labels from Frame-Level Predictions for Weakly Supervised Sound Event Detection	Jan 7, 2025	Event DetectionSound Event Detection	—Unverified	0
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection	Nov 2, 2024	Audio Source SeparationEvent Detection	CodeCode Available	1
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection	Sep 26, 2024	Event DetectionRepresentation Learning	CodeCode Available	2
Exploring Text-Queried Sound Event Detection with Audio Source Separation	Sep 20, 2024	Audio Source SeparationEvent Detection	CodeCode Available	1
The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection	Sep 17, 2024	BenchmarkingEvent Detection	CodeCode Available	0
Effective Pre-Training of Audio Transformers for Sound Event Detection	Sep 14, 2024	Data AugmentationEvent Detection	CodeCode Available	1
Unified Audio Event Detection	Sep 13, 2024	Event DetectionSound Event Detection	—Unverified	0
Energy Consumption Trends in Sound Event Detection Systems	Sep 13, 2024	Event DetectionSound Event Detection	—Unverified	0
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection	Sep 10, 2024	Event DetectionSound Event Detection	CodeCode Available	0
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems	Sep 8, 2024	Audio TaggingEvent Detection	—Unverified	0
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors	Aug 27, 2024	Event DetectionSound Event Detection	—Unverified	0
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection	Aug 16, 2024	Event DetectionSound Event Detection	CodeCode Available	2
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation	Aug 9, 2024	Computational EfficiencyEvent Detection	—Unverified	0
Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training	Jul 17, 2024	Event DetectionMissing Labels	CodeCode Available	1
Multi-Iteration Multi-Stage Fine-Tuning of Transformers for Sound Event Detection with Heterogeneous Datasets	Jul 17, 2024	Event DetectionSound Event Detection	CodeCode Available	1
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection	Jul 4, 2024	class-incremental learningClass Incremental Learning	—Unverified	0
Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data	Jul 4, 2024	Domain GeneralizationEvent Detection	—Unverified	0
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System	Jul 4, 2024	Event DetectionLanguage Modeling	CodeCode Available	1
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels	Jun 29, 2024	Domain GeneralizationEvent Detection	—Unverified	0
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes	Jun 22, 2024	Change DetectionData Augmentation	CodeCode Available	1
Automated Bioacoustic Monitoring for South African Bird Species on Unlabeled Data	Jun 19, 2024	Event DetectionSound Event Detection	—Unverified	0
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution	Jun 19, 2024	Event DetectionSound Event Detection	CodeCode Available	1
Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4	Jun 17, 2024	DecoderEvent Detection	—Unverified	0
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels	Jun 12, 2024	Event DetectionMissing Labels	—Unverified	0
Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection	Jun 8, 2024	Event DetectionSound Event Detection	CodeCode Available	1
Sound Event Bounding Boxes	Jun 6, 2024	Change DetectionEvent Detection	CodeCode Available	1
Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation	May 16, 2024	AudioCapsEvent Detection	CodeCode Available	1
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization	Apr 4, 2024	Action Localizationaudio-visual event localization	CodeCode Available	1
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection	Mar 27, 2024	Data AugmentationDomain Adaptation	CodeCode Available	2
Sound Event Detection and Localization with Distance Estimation	Mar 18, 2024	Event DetectionSound Event Detection	—Unverified	0
Multitask frame-level learning for few-shot sound event detection	Mar 17, 2024	Data AugmentationEvent Detection	—Unverified	0
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals	Mar 16, 2024	Event DetectionFault Detection	—Unverified	0
Dual Knowledge Distillation for Efficient Sound Event Detection	Feb 5, 2024	Event DetectionKnowledge Distillation	—Unverified	0
BAT: Learning to Reason about Spatial Sounds with Large Language Models	Feb 2, 2024	Event DetectionLanguage Modelling	—Unverified	0
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection	Jan 10, 2024	Event DetectionSound Event Detection	CodeCode Available	1
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection	Dec 14, 2023	Data AugmentationEvent Detection	CodeCode Available	1
w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training	Dec 12, 2023	Event DetectionSound Event Detection	CodeCode Available	1
tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models	Nov 24, 2023	Audio GenerationEvent Detection	—Unverified	0

Show:10 25 50

← PrevPage 1 of 4Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified