Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 194 papers

Title	Date	Tasks	Status	Hype	Score
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution	Jun 19, 2024	Event DetectionSound Event Detection	CodeCode Available	1	5
RCT: Random Consistency Training for Semi-supervised Sound Event Detection	Oct 21, 2021	Data AugmentationEvent Detection	CodeCode Available	1	5
Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation	May 16, 2024	AudioCapsEvent Detection	CodeCode Available	1	5
Couple Learning for semi-supervised sound event detection	Oct 12, 2021	Event DetectionSound Event Detection	CodeCode Available	1	5
A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4	Oct 18, 2022	Event DetectionMetric Learning	CodeCode Available	1	5
Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models	Mar 14, 2025	Audio TaggingEvent Detection	CodeCode Available	1	5
Sound Event Detection: A Tutorial	Jul 12, 2021	BIG-bench Machine LearningEvent Detection	CodeCode Available	0	5
Sound event detection in domestic environments withweakly labeled data and soundscape synthesis	Oct 26, 2019	Event DetectionSound Event Detection	CodeCode Available	0	5
City classification from multiple real-world sound scenes	Jul 29, 2019	Acoustic Scene ClassificationClassification	CodeCode Available	0	5
Evaluation of post-processing algorithms for polyphonic sound event detection	Jun 17, 2019	Audio TaggingEvent Detection	CodeCode Available	0	5
Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset	Nov 1, 2021	Event DetectionRetrieval	CodeCode Available	0	5
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection	Sep 16, 2023	Event DetectionFew-Shot Learning	CodeCode Available	0	5
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings	Apr 4, 2016	Data AugmentationEvent Detection	CodeCode Available	0	5
Robust sound event detection in bioacoustic sensor networks	May 20, 2019	Data AugmentationEvent Detection	CodeCode Available	0	5
Empirical Study of Drone Sound Detection in Real-Life Environment with Deep Neural Networks	Jan 20, 2017	Binary ClassificationEvent Detection	CodeCode Available	0	5
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks	Jan 5, 2023	Event DetectionSound Event Detection	CodeCode Available	0	5
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network	Jun 7, 2017	Event DetectionSound Event Detection	CodeCode Available	0	5
Musical Instrument Playing Technique Detection Based on FCN: Using Chinese Bowed-Stringed Instrument as an Example	Oct 20, 2019	Event DetectionInstrument Playing Technique Detection	CodeCode Available	0	5
Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection	May 24, 2019	Event DetectionMulti-Label Classification	CodeCode Available	0	5
On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0	5
Memory Controlled Sequential Self Attention for Sound Recognition	May 13, 2020	Event DetectionSound Event Detection	CodeCode Available	0	5
Adversarial Representation Learning for Robust Privacy Preservation in Audio	Apr 29, 2023	Event DetectionRepresentation Learning	CodeCode Available	0	5
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning	Nov 21, 2023	Acoustic Scene ClassificationAudio captioning	CodeCode Available	0	5
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection	Sep 10, 2024	Event DetectionSound Event Detection	CodeCode Available	0	5
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems	Oct 5, 2023	Event DetectionSound Event Detection	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified