Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 194 papers

Title	Date	Tasks	Status	Hype
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection	Mar 28, 2023	Direction of Arrival EstimationSound Event Detection	CodeCode Available	1
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified	0
Improving Weakly Supervised Sound Event Detection with Causal Intervention	Mar 10, 2023	Event DetectionSound Event Detection	—Unverified	0
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer	Mar 7, 2023	Audio TaggingDecoder	—Unverified	0
Training sound event detection with soft labels from crowdsourced annotations	Feb 28, 2023	Event DetectionSound Event Detection	—Unverified	0
LOCUS: LOcalization with Channel Uncertainty and Sporadic Energy	Feb 18, 2023	Event DetectionMissing Elements	—Unverified	0
Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection	Feb 18, 2023	Event DetectionSound Event Detection	—Unverified	0
A dataset for Audio-Visual Sound Event Detection in Movies	Feb 14, 2023	Event DetectionSelf-Driving Cars	CodeCode Available	1
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks	Jan 5, 2023	Event DetectionSound Event Detection	CodeCode Available	0
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified	0
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified	0
On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4	Oct 18, 2022	Event DetectionMetric Learning	CodeCode Available	1
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection	Oct 18, 2022	Event DetectionSound Event Detection	—Unverified	0
Multi-encoder attention-based architectures for sound recognition with partial visual assistance	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified	0
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified	0
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification	Sep 13, 2022	Acoustic Scene ClassificationEvent Detection	—Unverified	0
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains	Sep 5, 2022	Data AugmentationDirection of Arrival Estimation	CodeCode Available	1
Few-shot bioacoustic event detection at the DCASE 2022 challenge	Jul 14, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers	Jun 24, 2022	Emotion RecognitionEvent Detection	—Unverified	0
Frequency Dependent Sound Event Detection for DCASE 2022 Challenge Task 4	Jun 23, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection	May 24, 2022	Event DetectionFew-Shot Learning	—Unverified	0
Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection	Mar 29, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection	Mar 4, 2022	Event DetectionSound Event Detection	—Unverified	0
Nonverbal Sound Detection for Disordered Speech	Feb 15, 2022	Event DetectionSound Event Detection	—Unverified	0

Show:10 25 50

← PrevPage 4 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified