Audio Classification
Audio Classification is a machine learning task that involves identifying and tagging audio signals into different classes or categories. The goal of audio classification is to enable machines to automatically recognize and distinguish between different types of audio, such as music, speech, and environmental sounds.
Papers
Showing 1–10 of 361 papers
All datasetsAudioSetESC-50ICBHI Respiratory Sound DatabaseVGGSoundSHDFSD50KBalanced Audio SetSpeech CommandsSSCBirdCLEF 2021DCASEEPIC-KITCHENS-100
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CrissCross (AudioSet) | Top-1 Accuracy | 97 | — | Unverified |
| 2 | CrissCross (Kinetics-400) | Top-1 Accuracy | 96 | — | Unverified |
| 3 | XDC | Top-1 Accuracy | 95 | — | Unverified |
| 4 | CrissCross (Kinetics-Sound) | Top-1 Accuracy | 93 | — | Unverified |