Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 194 papers

Title	Date	Tasks	Status	Hype
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes	Jun 22, 2024	Change DetectionData Augmentation	CodeCode Available	1
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection	Jan 10, 2024	Event DetectionSound Event Detection	CodeCode Available	1
Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection	Aug 17, 2020	Event DetectionMultiple Instance Learning	CodeCode Available	1
Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection	Mar 29, 2022	Event DetectionSound Event Detection	CodeCode Available	1
What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis	Jul 22, 2021	Direction of Arrival EstimationEvent Detection	CodeCode Available	1
Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models	Mar 14, 2025	Audio TaggingEvent Detection	CodeCode Available	1
Conditioned Time-Dilated Convolutions for Sound Event Detection	Jul 10, 2020	Event DetectionLanguage Modelling	—Unverified	0
Compact recurrent neural networks for acoustic event detection on low-energy low-complexity platforms	Jan 29, 2020	Event DetectionQuantization	—Unverified	0
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds	May 18, 2023	Event DetectionMulti-Task Learning	—Unverified	0
Exploring the Potential of SSL Models for Sound Event Detection	May 17, 2025	Event DetectionModel Selection	—Unverified	0
Channel-Spatial-Based Few-Shot Bird Sound Event Detection	Jun 18, 2023	Event DetectionFew-Shot Learning	—Unverified	0
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals	Mar 16, 2024	Event DetectionFault Detection	—Unverified	0
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection	May 24, 2022	Event DetectionFew-Shot Learning	—Unverified	0
Improving Sound Event Detection Metrics: Insights from DCASE 2020	Oct 26, 2020	Event DetectionSound Event Detection	—Unverified	0
Channel Compression: Rethinking Information Redundancy among Channels in CNN Architecture	Jul 2, 2020	Acoustic Scene ClassificationEvent Detection	—Unverified	0
Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall	Sep 25, 2023	Event DetectionSound Event Detection	—Unverified	0
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning	Mar 4, 2025	Active LearningEvent Detection	—Unverified	0
Energy Consumption Trends in Sound Event Detection Systems	Sep 13, 2024	Event DetectionSound Event Detection	—Unverified	0
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input	May 9, 2018	Event DetectionSound Event Detection	—Unverified	0
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification	Sep 13, 2022	Acoustic Scene ClassificationEvent Detection	—Unverified	0
BAT: Learning to Reason about Spatial Sounds with Large Language Models	Feb 2, 2024	Event DetectionLanguage Modelling	—Unverified	0
Effect of noise suppression losses on speech distortion and ASR performance	Nov 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Affinity Mixup for Weakly Supervised Sound Event Detection	Jun 21, 2021	Event DetectionSound Event Detection	—Unverified	0
Dual Knowledge Distillation for Efficient Sound Event Detection	Feb 5, 2024	Event DetectionKnowledge Distillation	—Unverified	0
Active Learning for Sound Event Detection	Feb 12, 2020	Active LearningChange Point Detection	—Unverified	0

Show:10 25 50

← PrevPage 3 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified