Sound Event Detection
Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.
Source: A report on sound event detection with different binaural features
Papers
Showing 1–10 of 194 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CRNN (with BEATs + Separation) | PSDS1 (-5dB) | 0.13 | — | Unverified |
| 2 | CRNN (with BEATs) | PSDS1 (-5dB) | 0.07 | — | Unverified |
| 3 | CRNN (WildDESED + Curriculrm learning) | PSDS1 (-5dB) | 0.05 | — | Unverified |
| 4 | CRNN (WildDESED) | PSDS1 (-5dB) | 0.05 | — | Unverified |
| 5 | CRNN | PSDS1 (-5dB) | 0.02 | — | Unverified |