Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 194 papers

Title	Date	Tasks	Status
Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4	Jun 10, 2023	Event DetectionPseudo Label	—Unverified
Divided spectro-temporal attention for sound event localization and detection in real scenes for DCASE2023 challenge	Jun 5, 2023	Event DetectionSound Event Detection	—Unverified
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds	May 18, 2023	Event DetectionMulti-Task Learning	—Unverified
Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations	May 3, 2023	Event DetectionFew-Shot Learning	—Unverified
Adversarial Representation Learning for Robust Privacy Preservation in Audio	Apr 29, 2023	Event DetectionRepresentation Learning	CodeCode Available
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions	Apr 25, 2023	Audio TaggingEvent Detection	—Unverified
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Improving Weakly Supervised Sound Event Detection with Causal Intervention	Mar 10, 2023	Event DetectionSound Event Detection	—Unverified
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer	Mar 7, 2023	Audio TaggingDecoder	—Unverified
Training sound event detection with soft labels from crowdsourced annotations	Feb 28, 2023	Event DetectionSound Event Detection	—Unverified
LOCUS: LOcalization with Channel Uncertainty and Sporadic Energy	Feb 18, 2023	Event DetectionMissing Elements	—Unverified
Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection	Feb 18, 2023	Event DetectionSound Event Detection	—Unverified
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks	Jan 5, 2023	Event DetectionSound Event Detection	CodeCode Available
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified
On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection	Oct 18, 2022	Event DetectionSound Event Detection	—Unverified
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified
Multi-encoder attention-based architectures for sound recognition with partial visual assistance	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification	Sep 13, 2022	Acoustic Scene ClassificationEvent Detection	—Unverified
Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers	Jun 24, 2022	Emotion RecognitionEvent Detection	—Unverified
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection	May 24, 2022	Event DetectionFew-Shot Learning	—Unverified
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection	Mar 4, 2022	Event DetectionSound Event Detection	—Unverified
Nonverbal Sound Detection for Disordered Speech	Feb 15, 2022	Event DetectionSound Event Detection	—Unverified
SEED: Sound Event Early Detection via Evidential Uncertainty	Feb 5, 2022	Event DetectionSound Event Detection	—Unverified

Show:10 25 50

← PrevPage 5 of 8Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified