SOTAVerified

Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Showing 101150 of 194 papers

TitleStatusHype
SEED: Sound Event Early Detection via Evidential Uncertainty0
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes0
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
Threshold Independent Evaluation of Sound Event Detection ScoresCode1
Zero-shot Audio Source Separation through Query-based Learningfrom Weakly-labeled Data0
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled DataCode1
Effect of noise suppression losses on speech distortion and ASR performance0
Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe DatasetCode0
RCT: Random Consistency Training for Semi-supervised Sound Event DetectionCode1
Couple Learning for semi-supervised sound event detectionCode1
PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex ConvolutionsCode1
Peer Collaborative Learning for Polyphonic Sound Event Detection0
FilterAugment: An Acoustic Environmental Data Augmentation MethodCode1
Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual PhrasesCode0
Sound Event Detection Transformer: An Event-based End-to-End Model for Sound Event DetectionCode1
RCRNN-based Sound Event Detection System with Specific Speech Resolution0
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and DetectionCode1
The impact of non-target events in synthetic soundscapes for sound event detectionCode1
You Only Hear Once: A YOLO-like Algorithm for Audio Segmentation and Sound Event DetectionCode1
Uncertainty quantification for multiclass data description0
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers0
Crowdsourcing strong labels for sound event detection0
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning0
What Makes Sound Event Localization and Detection Difficult? Insights from Error AnalysisCode1
Sound Event Detection: A TutorialCode0
Weakly-Supervised Classification and Detection of Bird Sounds in the Wild.Code1
Heavily Augmented Sound Event Detection utilizing Weak PredictionsCode1
Self-training with noisy student model and semi-supervised loss function for dcase 2021 challenge task 40
Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings0
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and DetectionCode1
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning0
Affinity Mixup for Weakly Supervised Sound Event Detection0
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform0
Improving weakly supervised sound event detection with self-supervised auxiliary tasksCode1
Audiovisual transfer learning for audio tagging and sound event detection0
Cross-Referencing Self-Training Network for Sound Event Detection in Audio MixturesCode0
USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios0
Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-supervised Sound Event DetectionCode1
DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection0
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms0
Sound Event Detection in Urban Audio With Single and Multi-Rate PCEN0
Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices0
DENet: a deep architecture for audio surveillance applicationsCode1
Soft-Median Choice: An Automatic Feature Smoothing Method for Sound Event Detection0
Sound Event Detection in Domestic Environments using Dense Recurrent Neural Network0
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and DetectionCode1
Improving Sound Event Detection Metrics: Insights from DCASE 20200
Power pooling: An adaptive pooling function for weakly labelled sound event detection0
DCASENET: A joint pre-trained deep neural network for detecting and classifying acoustic scenes and eventsCode1
Multi-Task Learning for Interpretable Weakly Labelled Sound Event DetectionCode1
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ATST-SEDevent-based F1 score63.4Unverified
2SE-CRNN-16 with DualKDevent-based F1 score55.6Unverified
3FDY-CRNNevent-based F1 score54Unverified
4HTS-ATevent-based F1 score50.7Unverified
5RCTevent-based F1 score49.62Unverified
6FiltAug SEDevent-based F1 score49.6Unverified
7SED-SSep baseline dcase task 4 2020 v2event-based F1 score40.7Unverified
8Baseline dcase task 4 2020 v2event-based F1 score39Unverified
9Baselineevent-based F1 score25.8Unverified
10MAT-SEDPSDS10.59Unverified
#ModelMetricClaimedVerifiedStatus
1PHC SEDnet n=8Error Rate0.56Unverified
2Quaternion SEDnetError Rate0.52Unverified
3PHC SEDnet n=16Error Rate0.51Unverified
4PHC SEDnet n=4Error Rate0.45Unverified
5PHC SEDnet n=2Error Rate0.39Unverified
#ModelMetricClaimedVerifiedStatus
1CRNN (with BEATs + Separation)PSDS1 (-5dB)0.13Unverified
2CRNN (with BEATs)PSDS1 (-5dB)0.07Unverified
3CRNN (WildDESED + Curriculrm learning)PSDS1 (-5dB)0.05Unverified
4CRNN (WildDESED)PSDS1 (-5dB)0.05Unverified
5CRNNPSDS1 (-5dB)0.02Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate0.98Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate1Unverified