Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 194 papers

Title	Date	Tasks	Status
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels	Jun 12, 2024	Event DetectionMissing Labels	—Unverified
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms	Feb 13, 2021	Audio ClassificationClassification	—Unverified
DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection	Feb 17, 2021	Event DetectionFederated Learning	—Unverified
DiffSED: Sound Event Detection with Denoising Diffusion	Aug 14, 2023	DecoderDenoising	—Unverified
Divided spectro-temporal attention for sound event localization and detection in real scenes for DCASE2023 challenge	Jun 5, 2023	Event DetectionSound Event Detection	—Unverified
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning	Jun 21, 2021	Event DetectionSound Event Detection	—Unverified
Dual Knowledge Distillation for Efficient Sound Event Detection	Feb 5, 2024	Event DetectionKnowledge Distillation	—Unverified
Effect of noise suppression losses on speech distortion and ASR performance	Nov 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input	May 9, 2018	Event DetectionSound Event Detection	—Unverified
Energy Consumption Trends in Sound Event Detection Systems	Sep 13, 2024	Event DetectionSound Event Detection	—Unverified
Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall	Sep 25, 2023	Event DetectionSound Event Detection	—Unverified
Exploring the Potential of SSL Models for Sound Event Detection	May 17, 2025	Event DetectionModel Selection	—Unverified
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals	Mar 16, 2024	Event DetectionFault Detection	—Unverified
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels	Jun 29, 2024	Domain GeneralizationEvent Detection	—Unverified
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data	Apr 6, 2025	Event DetectionSound Event Detection	—Unverified
Framework for evaluation of sound event detection in web videos	Nov 2, 2017	Event DetectionSound Event Detection	—Unverified
Frequency Dynamic Convolutions for Sound Event Detection	Jun 15, 2025	ARCEvent Detection	—Unverified
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems	Sep 8, 2024	Audio TaggingEvent Detection	—Unverified
Leveraging Language Model Capabilities for Sound Event Detection	Aug 22, 2023	DecoderEvent Detection	—Unverified
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors	Aug 27, 2024	Event DetectionSound Event Detection	—Unverified
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning	Jul 22, 2021	Data AugmentationEvent Detection	—Unverified
Improving Sound Event Detection Metrics: Insights from DCASE 2020	Oct 26, 2020	Event DetectionSound Event Detection	—Unverified

Show:10 25 50

← PrevPage 6 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified