Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 194 papers

Title	Date	Tasks	Status	Hype
Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection	Nov 23, 2023	Event DetectionSound Event Detection	—Unverified	0
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning	Nov 21, 2023	Acoustic Scene ClassificationAudio captioning	CodeCode Available	0
SwG-former: A Sliding-Window Graph Convolutional Network for Simultaneous Spatial-Temporal Information Extraction in Sound Event Localization and Detection	Oct 21, 2023	Event DetectionSound Event Detection	—Unverified	0
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems	Oct 5, 2023	Event DetectionSound Event Detection	CodeCode Available	0
Online Active Learning For Sound Event Detection	Sep 25, 2023	Active LearningEvent Detection	—Unverified	0
Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall	Sep 25, 2023	Event DetectionSound Event Detection	—Unverified	0
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection	Sep 16, 2023	Event DetectionFew-Shot Learning	CodeCode Available	0
Semi-supervised Sound Event Detection with Local and Global Consistency Regularization	Sep 15, 2023	Event DetectionSound Event Detection	—Unverified	0
Fine-tune the pretrained ATST model for sound event detection	Sep 15, 2023	Event DetectionSelf-Supervised Learning	CodeCode Available	1
Leveraging Geometrical Acoustic Simulations of Spatial Room Impulse Responses for Improved Sound Event Detection and Localization	Sep 6, 2023	Event DetectionSound Event Detection	CodeCode Available	0
Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning	Sep 2, 2023	Contrastive LearningData Augmentation	CodeCode Available	1
Leveraging Language Model Capabilities for Sound Event Detection	Aug 22, 2023	DecoderEvent Detection	—Unverified	0
DiffSED: Sound Event Detection with Denoising Diffusion	Aug 14, 2023	DecoderDenoising	—Unverified	0
Post-Processing Independent Evaluation of Sound Event Detection Systems	Jun 27, 2023	Event DetectionSound Event Detection	CodeCode Available	1
Auditory Neural Response Inspired Sound Event Detection Based on Spectro-temporal Receptive Field	Jun 20, 2023	Event DetectionSound Event Detection	—Unverified	0
Channel-Spatial-Based Few-Shot Bird Sound Event Detection	Jun 18, 2023	Event DetectionFew-Shot Learning	—Unverified	0
Few-shot bioacoustic event detection at the DCASE 2023 challenge	Jun 15, 2023	Event DetectionFew-Shot Learning	CodeCode Available	1
Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4	Jun 10, 2023	Event DetectionPseudo Label	—Unverified	0
Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks	Jun 7, 2023	Audio ClassificationAudio Tagging	CodeCode Available	1
Divided spectro-temporal attention for sound event localization and detection in real scenes for DCASE2023 challenge	Jun 5, 2023	Event DetectionSound Event Detection	—Unverified	0
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds	May 18, 2023	Event DetectionMulti-Task Learning	—Unverified	0
Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations	May 3, 2023	Event DetectionFew-Shot Learning	—Unverified	0
Adversarial Representation Learning for Robust Privacy Preservation in Audio	Apr 29, 2023	Event DetectionRepresentation Learning	CodeCode Available	0
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions	Apr 25, 2023	Audio TaggingEvent Detection	—Unverified	0
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research	Mar 30, 2023	Audio captioningEvent Detection	CodeCode Available	2
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection	Mar 28, 2023	Direction of Arrival EstimationSound Event Detection	CodeCode Available	1
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified	0
Improving Weakly Supervised Sound Event Detection with Causal Intervention	Mar 10, 2023	Event DetectionSound Event Detection	—Unverified	0
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer	Mar 7, 2023	Audio TaggingDecoder	—Unverified	0
Training sound event detection with soft labels from crowdsourced annotations	Feb 28, 2023	Event DetectionSound Event Detection	—Unverified	0
LOCUS: LOcalization with Channel Uncertainty and Sporadic Energy	Feb 18, 2023	Event DetectionMissing Elements	—Unverified	0
Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection	Feb 18, 2023	Event DetectionSound Event Detection	—Unverified	0
A dataset for Audio-Visual Sound Event Detection in Movies	Feb 14, 2023	Event DetectionSelf-Driving Cars	CodeCode Available	1
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks	Jan 5, 2023	Event DetectionSound Event Detection	CodeCode Available	0
Impact of visual assistance for automated audio captioning	Nov 18, 2022	Audio captioningEvent Detection	—Unverified	0
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified	0
On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4	Oct 18, 2022	Event DetectionMetric Learning	CodeCode Available	1
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection	Oct 18, 2022	Event DetectionSound Event Detection	—Unverified	0
Multi-encoder attention-based architectures for sound recognition with partial visual assistance	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified	0
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection	Sep 26, 2022	Audio TaggingEvent Detection	—Unverified	0
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification	Sep 13, 2022	Acoustic Scene ClassificationEvent Detection	—Unverified	0
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains	Sep 5, 2022	Data AugmentationDirection of Arrival Estimation	CodeCode Available	1
Few-shot bioacoustic event detection at the DCASE 2022 challenge	Jul 14, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers	Jun 24, 2022	Emotion RecognitionEvent Detection	—Unverified	0
Frequency Dependent Sound Event Detection for DCASE 2022 Challenge Task 4	Jun 23, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection	May 24, 2022	Event DetectionFew-Shot Learning	—Unverified	0
Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection	Mar 29, 2022	Event DetectionSound Event Detection	CodeCode Available	1
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection	Mar 4, 2022	Event DetectionSound Event Detection	—Unverified	0
Nonverbal Sound Detection for Disordered Speech	Feb 15, 2022	Event DetectionSound Event Detection	—Unverified	0

Show:10 25 50

← PrevPage 2 of 4Next →

All datasets DESED L3DAS21 WildDESED Mivia Audio Events Mivia Road Events

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ATST-SED	event-based F1 score	63.4	—	Unverified
2	SE-CRNN-16 with DualKD	event-based F1 score	55.6	—	Unverified
3	FDY-CRNN	event-based F1 score	54	—	Unverified
4	HTS-AT	event-based F1 score	50.7	—	Unverified
5	RCT	event-based F1 score	49.62	—	Unverified
6	FiltAug SED	event-based F1 score	49.6	—	Unverified
7	SED-SSep baseline dcase task 4 2020 v2	event-based F1 score	40.7	—	Unverified
8	Baseline dcase task 4 2020 v2	event-based F1 score	39	—	Unverified
9	Baseline	event-based F1 score	25.8	—	Unverified
10	MAT-SED	PSDS1	0.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PHC SEDnet n=8	Error Rate	0.56	—	Unverified
2	Quaternion SEDnet	Error Rate	0.52	—	Unverified
3	PHC SEDnet n=16	Error Rate	0.51	—	Unverified
4	PHC SEDnet n=4	Error Rate	0.45	—	Unverified
5	PHC SEDnet n=2	Error Rate	0.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CRNN (with BEATs + Separation)	PSDS1 (-5dB)	0.13	—	Unverified
2	CRNN (with BEATs)	PSDS1 (-5dB)	0.07	—	Unverified
3	CRNN (WildDESED + Curriculrm learning)	PSDS1 (-5dB)	0.05	—	Unverified
4	CRNN (WildDESED)	PSDS1 (-5dB)	0.05	—	Unverified
5	CRNN	PSDS1 (-5dB)	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	0.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DENet	Rank-1 Recognition Rate	1	—	Unverified