Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 194 papers

Title	Date	Tasks	Status
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals	Mar 16, 2024	Event DetectionFault Detection	—Unverified
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels	Jun 29, 2024	Domain GeneralizationEvent Detection	—Unverified
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data	Apr 6, 2025	Event DetectionSound Event Detection	—Unverified
Framework for evaluation of sound event detection in web videos	Nov 2, 2017	Event DetectionSound Event Detection	—Unverified
Frequency Dynamic Convolutions for Sound Event Detection	Jun 15, 2025	ARCEvent Detection	—Unverified
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems	Sep 8, 2024	Audio TaggingEvent Detection	—Unverified
Leveraging Language Model Capabilities for Sound Event Detection	Aug 22, 2023	DecoderEvent Detection	—Unverified
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors	Aug 27, 2024	Event DetectionSound Event Detection	—Unverified
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning	Jul 22, 2021	Data AugmentationEvent Detection	—Unverified
Improving Sound Event Detection Metrics: Insights from DCASE 2020	Oct 26, 2020	Event DetectionSound Event Detection	—Unverified
Improving Weakly Supervised Sound Event Detection with Causal Intervention	Mar 10, 2023	Event DetectionSound Event Detection	—Unverified
Incremental Learning Algorithm for Sound Event Detection	Mar 26, 2020	Event DetectionIncremental Learning	—Unverified
Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection	Nov 23, 2023	Event DetectionSound Event Detection	—Unverified
Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations	May 3, 2023	Event DetectionFew-Shot Learning	—Unverified
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions	Apr 25, 2023	Audio TaggingEvent Detection	—Unverified
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified
Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data	Jul 4, 2024	Domain GeneralizationEvent Detection	—Unverified
Multi-Branch Learning for Weakly-Labeled Sound Event Detection	Feb 22, 2020	Audio TaggingBoundary Detection	—Unverified
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers	Jul 28, 2021	Event DetectionKeyword Spotting	—Unverified
Pseudo Strong Labels from Frame-Level Predictions for Weakly Supervised Sound Event Detection	Jan 7, 2025	Event DetectionSound Event Detection	—Unverified
Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events	Dec 17, 2018	Event DetectionSound Event Detection	—Unverified
RCRNN-based Sound Event Detection System with Specific Speech Resolution	Oct 1, 2021	Data AugmentationEvent Detection	—Unverified

Show:10 25 50

← PrevPage 5 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified