SOTAVerified

Audio Classification

Audio Classification is a machine learning task that involves identifying and tagging audio signals into different classes or categories. The goal of audio classification is to enable machines to automatically recognize and distinguish between different types of audio, such as music, speech, and environmental sounds.

Papers

Showing 110 of 361 papers

TitleStatusHype
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine0
Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons0
Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression ClassifierCode0
Adaptive Differential Denoising for Respiratory Sounds ClassificationCode1
Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental SoundsCode0
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment LossesCode0
15,500 Seconds: Lean UAV Classification Leveraging PEFT and Pre-Trained NetworksCode0
4,500 Seconds: Small Data Training Approaches for Deep UAV Audio ClassificationCode0
Large Language Models Implicitly Learn to See and Hear Just By Reading0
Show:102550
← PrevPage 1 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MBT (AV)Top 5 Accuracy85.6Unverified
2Mirasol3BTop 1 Accuracy69.8Unverified
3CA2ST(B/16)Top 1 Accuracy68.3Unverified
4CAVA(B/16)Top 1 Accuracy68.2Unverified
5ONE-PEACE (Audio-Visual)Top 1 Accuracy68.2Unverified
6MAViLTop 1 Accuracy67.1Unverified
7EquiAVTop 1 Accuracy67.1Unverified
8MMT (Audio-Visual)Top 1 Accuracy66.2Unverified
9CAV-MAE (Audio-Visual)Top 1 Accuracy65.9Unverified
10UAVM (Audio + Video)Top 1 Accuracy65.8Unverified