Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 194 papers

Title	Date	Tasks	Status
Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning	Oct 14, 2019	Event DetectionSound Event Detection	—Unverified
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes	Feb 3, 2022	Data AugmentationEvent Detection	—Unverified
Zero-shot Audio Source Separation through Query-based Learningfrom Weakly-labeled Data	Dec 15, 2021	Audio Source SeparationEvent Detection	—Unverified
A Capsule based Approach for Polyphonic Sound Event Detection	Jul 23, 2018	Event DetectionSound Event Detection	—Unverified
A Comparative Study of Western and Chinese Classical Music based on Soundscape Models	Feb 20, 2020	Emotion RecognitionEvent Detection	—Unverified
Active Learning for Sound Event Detection	Feb 12, 2020	Active LearningChange Point Detection	—Unverified
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection	May 24, 2022	Event DetectionFew-Shot Learning	—Unverified
Affinity Mixup for Weakly Supervised Sound Event Detection	Jun 21, 2021	Event DetectionSound Event Detection	—Unverified
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning	Mar 4, 2025	Active LearningEvent Detection	—Unverified
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds	May 18, 2023	Event DetectionMulti-Task Learning	—Unverified
An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation	Jan 18, 2025	Event DetectionSound Event Detection	—Unverified
A Sequence Matching Network for Polyphonic Sound Event Localization and Detection	Feb 14, 2020	Direction of Arrival EstimationEvent Detection	—Unverified
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer	Mar 7, 2023	Audio TaggingDecoder	—Unverified
Audio-Based Epileptic Seizure Detection	Sep 6, 2019	Event DetectionSeizure Detection	—Unverified
Audiovisual transfer learning for audio tagging and sound event detection	Jun 9, 2021	Audio TaggingEvent Detection	—Unverified
Auditory Neural Response Inspired Sound Event Detection Based on Spectro-temporal Receptive Field	Jun 20, 2023	Event DetectionSound Event Detection	—Unverified
Automated Bioacoustic Monitoring for South African Bird Species on Unlabeled Data	Jun 19, 2024	Event DetectionSound Event Detection	—Unverified
BAT: Learning to Reason about Spatial Sounds with Large Language Models	Feb 2, 2024	Event DetectionLanguage Modelling	—Unverified
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification	Sep 13, 2022	Acoustic Scene ClassificationEvent Detection	—Unverified
Channel Compression: Rethinking Information Redundancy among Channels in CNN Architecture	Jul 2, 2020	Acoustic Scene ClassificationEvent Detection	—Unverified
Channel-Spatial-Based Few-Shot Bird Sound Event Detection	Jun 18, 2023	Event DetectionFew-Shot Learning	—Unverified
Compact recurrent neural networks for acoustic event detection on low-energy low-complexity platforms	Jan 29, 2020	Event DetectionQuantization	—Unverified
Conditioned Time-Dilated Convolutions for Sound Event Detection	Jul 10, 2020	Event DetectionLanguage Modelling	—Unverified
Crowdsourcing strong labels for sound event detection	Jul 26, 2021	Event DetectionSound Event Detection	—Unverified
DASED: A Multi-Domain Dataset for Sound Event Detection Domain Adaptation	Oct 19, 2018	Domain AdaptationEvent Detection	—Unverified
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels	Jun 12, 2024	Event DetectionMissing Labels	—Unverified
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms	Feb 13, 2021	Audio ClassificationClassification	—Unverified
DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection	Feb 17, 2021	Event DetectionFederated Learning	—Unverified
DiffSED: Sound Event Detection with Denoising Diffusion	Aug 14, 2023	DecoderDenoising	—Unverified
Divided spectro-temporal attention for sound event localization and detection in real scenes for DCASE2023 challenge	Jun 5, 2023	Event DetectionSound Event Detection	—Unverified
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning	Jun 21, 2021	Event DetectionSound Event Detection	—Unverified
Dual Knowledge Distillation for Efficient Sound Event Detection	Feb 5, 2024	Event DetectionKnowledge Distillation	—Unverified
Effect of noise suppression losses on speech distortion and ASR performance	Nov 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input	May 9, 2018	Event DetectionSound Event Detection	—Unverified
Energy Consumption Trends in Sound Event Detection Systems	Sep 13, 2024	Event DetectionSound Event Detection	—Unverified
Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall	Sep 25, 2023	Event DetectionSound Event Detection	—Unverified
Exploring the Potential of SSL Models for Sound Event Detection	May 17, 2025	Event DetectionModel Selection	—Unverified
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals	Mar 16, 2024	Event DetectionFault Detection	—Unverified
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels	Jun 29, 2024	Domain GeneralizationEvent Detection	—Unverified
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data	Apr 6, 2025	Event DetectionSound Event Detection	—Unverified
Framework for evaluation of sound event detection in web videos	Nov 2, 2017	Event DetectionSound Event Detection	—Unverified
Frequency Dynamic Convolutions for Sound Event Detection	Jun 15, 2025	ARCEvent Detection	—Unverified
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems	Sep 8, 2024	Audio TaggingEvent Detection	—Unverified
Leveraging Language Model Capabilities for Sound Event Detection	Aug 22, 2023	DecoderEvent Detection	—Unverified
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors	Aug 27, 2024	Event DetectionSound Event Detection	—Unverified
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning	Jul 22, 2021	Data AugmentationEvent Detection	—Unverified
Improving Sound Event Detection Metrics: Insights from DCASE 2020	Oct 26, 2020	Event DetectionSound Event Detection	—Unverified

Show:10 25 50

← PrevPage 3 of 4Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified