Audio Tagging
Audio tagging is a task to predict the tags of audio clips. Audio tagging tasks include music tagging, acoustic scene classification, audio event classification, etc.
Papers
Showing 1–10 of 81 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CAV-MAE (Audio-Visual) | mean average precision | 0.51 | — | Unverified |
| 2 | mn40_as (Ensemble) | mean average precision | 0.5 | — | Unverified |
| 3 | PaSST | mean average precision | 0.5 | — | Unverified |
| 4 | DyMN-L (Audio-Only, Single) | mean average precision | 0.49 | — | Unverified |
| 5 | Audio Spectrogram Transformer | mean average precision | 0.49 | — | Unverified |
| 6 | mn40_as (Single) | mean average precision | 0.48 | — | Unverified |
| 7 | PSLA | mean average precision | 0.47 | — | Unverified |
| 8 | ST-SED | mean average precision | 0.47 | — | Unverified |
| 9 | CAV-MAE (Audio-Only) | mean average precision | 0.47 | — | Unverified |
| 10 | ERANN-1-6 | mean average precision | 0.45 | — | Unverified |