SOTAVerified

Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Papers

Showing 5175 of 194 papers

TitleStatusHype
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic ConvolutionCode1
RCT: Random Consistency Training for Semi-supervised Sound Event DetectionCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
Couple Learning for semi-supervised sound event detectionCode1
A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4Code1
Exploring Performance-Complexity Trade-Offs in Sound Event Detection ModelsCode1
Sound Event Detection: A TutorialCode0
Sound event detection in domestic environments withweakly labeled data and soundscape synthesisCode0
City classification from multiple real-world sound scenesCode0
Evaluation of post-processing algorithms for polyphonic sound event detectionCode0
Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe DatasetCode0
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound DetectionCode0
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life RecordingsCode0
Robust sound event detection in bioacoustic sensor networksCode0
Empirical Study of Drone Sound Detection in Real-Life Environment with Deep Neural NetworksCode0
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural NetworksCode0
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural NetworkCode0
Musical Instrument Playing Technique Detection Based on FCN: Using Chinese Bowed-Stringed Instrument as an ExampleCode0
Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event DetectionCode0
On Out-of-Distribution Detection for Audio with Deep Nearest NeighborsCode0
Memory Controlled Sequential Self Attention for Sound RecognitionCode0
Adversarial Representation Learning for Robust Privacy Preservation in AudioCode0
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive LearningCode0
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event DetectionCode0
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systemsCode0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ATST-SEDevent-based F1 score63.4Unverified
2SE-CRNN-16 with DualKDevent-based F1 score55.6Unverified
3FDY-CRNNevent-based F1 score54Unverified
4HTS-ATevent-based F1 score50.7Unverified
5RCTevent-based F1 score49.62Unverified
6FiltAug SEDevent-based F1 score49.6Unverified
7SED-SSep baseline dcase task 4 2020 v2event-based F1 score40.7Unverified
8Baseline dcase task 4 2020 v2event-based F1 score39Unverified
9Baselineevent-based F1 score25.8Unverified
10MAT-SEDPSDS10.59Unverified
#ModelMetricClaimedVerifiedStatus
1PHC SEDnet n=8Error Rate0.56Unverified
2Quaternion SEDnetError Rate0.52Unverified
3PHC SEDnet n=16Error Rate0.51Unverified
4PHC SEDnet n=4Error Rate0.45Unverified
5PHC SEDnet n=2Error Rate0.39Unverified
#ModelMetricClaimedVerifiedStatus
1CRNN (with BEATs + Separation)PSDS1 (-5dB)0.13Unverified
2CRNN (with BEATs)PSDS1 (-5dB)0.07Unverified
3CRNN (WildDESED + Curriculrm learning)PSDS1 (-5dB)0.05Unverified
4CRNN (WildDESED)PSDS1 (-5dB)0.05Unverified
5CRNNPSDS1 (-5dB)0.02Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate0.98Unverified
#ModelMetricClaimedVerifiedStatus
1DENetRank-1 Recognition Rate1Unverified