SOTAVerified

Sound Classification

Papers

Showing 125 of 148 papers

TitleStatusHype
EAT: Self-Supervised Pre-Training with Efficient Audio TransformerCode3
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
AudioCLIP: Extending CLIP to Image, Text and AudioCode2
Adaptive Differential Denoising for Respiratory Sounds ClassificationCode1
CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive LearningCode1
Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital StethoscopeCode1
BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound ClassificationCode1
Self-Supervised Learning for Few-Shot Bird Sound ClassificationCode1
Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound ClassificationCode1
Multi-View Spectrogram Transformer for Respiratory Sound ClassificationCode1
Adversarial Fine-tuning using Generated Respiratory Sound to Address Class ImbalanceCode1
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound ClassificationCode1
Unsupervised classification to improve the quality of a bird song recording datasetCode1
A dataset for Audio-Visual Sound Event Detection in MoviesCode1
Epic-Sounds: A Large-scale Dataset of Actions That SoundCode1
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation LearningCode1
Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP BlockCode1
Pretraining Respiratory Sound Representations using Metadata and Contrastive LearningCode1
Enemy Spotted: in-game gun sound dataset for gunshot classification and localizationCode1
Enemy Spotted: in-game gun sound dataset for gunshot classification and localizationCode1
HouseX: A Fine-grained House Music Dataset and its Potential in the Music IndustryCode1
Continual Learning For On-Device Environmental Sound ClassificationCode1
End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification NetworkCode1
AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breathCode1
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal SynchronicityCode1
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.