SOTAVerified

Audio Classification

Audio Classification is a machine learning task that involves identifying and tagging audio signals into different classes or categories. The goal of audio classification is to enable machines to automatically recognize and distinguish between different types of audio, such as music, speech, and environmental sounds.

Papers

Showing 110 of 361 papers

TitleStatusHype
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine0
Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons0
Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression ClassifierCode0
Adaptive Differential Denoising for Respiratory Sounds ClassificationCode1
Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental SoundsCode0
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment LossesCode0
15,500 Seconds: Lean UAV Classification Leveraging PEFT and Pre-Trained NetworksCode0
4,500 Seconds: Small Data Training Approaches for Deep UAV Audio ClassificationCode0
Large Language Models Implicitly Learn to See and Hear Just By Reading0
Show:102550
← PrevPage 1 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EquiAVMean AP42.4Unverified
2SSLAMMean AP40.9Unverified
3EATMean AP40.3Unverified
4BEATsMean AP38.9Unverified
5Base (ours)Mean AP37.4Unverified
6SSAST-PATCHMean AP31Unverified
7SSAST-FRAMEMean AP29.2Unverified
8ConformerMean AP27.6Unverified