| EAT: Self-Supervised Pre-Training with Efficient Audio Transformer | Jan 7, 2024 | Audio ClassificationSelf-Supervised Learning | CodeCode Available | 3 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| AudioCLIP: Extending CLIP to Image, Text and Audio | Jun 24, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 2 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning | Feb 2, 2025 | Audio ClassificationClustering | CodeCode Available | 1 |
| Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope | Oct 4, 2024 | Audio Signal ProcessingSound Classification | CodeCode Available | 1 |
| BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification | Jun 10, 2024 | Audio ClassificationSound Classification | CodeCode Available | 1 |
| Self-Supervised Learning for Few-Shot Bird Sound Classification | Dec 25, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification | Dec 15, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 |
| Multi-View Spectrogram Transformer for Respiratory Sound Classification | Nov 16, 2023 | ClassificationSound Classification | CodeCode Available | 1 |
| Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance | Nov 11, 2023 | Audio ClassificationSound Classification | CodeCode Available | 1 |
| Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification | May 23, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 |
| Unsupervised classification to improve the quality of a bird song recording dataset | Feb 15, 2023 | Sound ClassificationTemporal Localization | CodeCode Available | 1 |
| A dataset for Audio-Visual Sound Event Detection in Movies | Feb 14, 2023 | Event DetectionSelf-Driving Cars | CodeCode Available | 1 |
| Epic-Sounds: A Large-scale Dataset of Actions That Sound | Feb 1, 2023 | Action RecognitionSound Classification | CodeCode Available | 1 |
| XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | Nov 25, 2022 | Action ClassificationClassification | CodeCode Available | 1 |
| Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block | Nov 5, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning | Oct 27, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Oct 12, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Aug 22, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| HouseX: A Fine-grained House Music Dataset and its Potential in the Music Industry | Jul 24, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| Continual Learning For On-Device Environmental Sound Classification | Jul 15, 2022 | ClassificationComputational Efficiency | CodeCode Available | 1 |
| End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network | Apr 25, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath | Mar 15, 2022 | 8kAudio Classification | CodeCode Available | 1 |
| Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | Nov 9, 2021 | Audio ClassificationRetrieval | CodeCode Available | 1 |
| Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers | Oct 29, 2021 | ClassificationDeep Learning | CodeCode Available | 1 |
| Contrastive Learning of Global-Local Video Representations | Apr 7, 2021 | ClassificationContrastive Learning | CodeCode Available | 1 |
| Tiny Transformers for Environmental Sound Classification at the Edge | Mar 22, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| Learning spectro-temporal representations of complex sounds with parameterized neural networks | Mar 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices | Mar 5, 2021 | Audio ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification | Mar 2, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Comparison of semi-supervised deep learning algorithms for audio classification | Feb 16, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Urban Sound Classification : striving towards a fair comparison | Oct 22, 2020 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| ESResNet: Environmental Sound Classification Based on Visual Domain Models | Apr 15, 2020 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| USAD: Universal Speech and Audio Representation via Distillation | Jun 23, 2025 | Audio TaggingRepresentation Learning | —Unverified | 0 |
| Acoustic scattering AI for non-invasive object classifications: A case study on hair assessment | Jun 17, 2025 | ClassificationDeep Learning | —Unverified | 0 |
| Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification | Jun 12, 2025 | DisentanglementSound Classification | —Unverified | 0 |
| MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification | Jun 12, 2025 | ClassificationDomain Adaptation | —Unverified | 0 |
| Domain Adaptation Method and Modality Gap Impact in Audio-Text Models for Prototypical Sound Classification | Jun 4, 2025 | ClassificationDomain Adaptation | CodeCode Available | 0 |
| General-purpose audio representation learning for real-world sound scenes | Jun 1, 2025 | MambaRepresentation Learning | —Unverified | 0 |
| Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone | May 29, 2025 | Contrastive LearningDiagnostic | —Unverified | 0 |
| Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles | May 28, 2025 | Knowledge DistillationSound Classification | CodeCode Available | 0 |
| Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses | May 28, 2025 | Audio ClassificationLung Sound Classification | CodeCode Available | 0 |
| The Computation of Generalized Embeddings for Underwater Acoustic Target Recognition using Contrastive Learning | May 19, 2025 | Contrastive LearningSound Classification | CodeCode Available | 0 |
| Tri-MTL: A Triple Multitask Learning Approach for Respiratory Disease Diagnosis | May 6, 2025 | DiagnosticLung Sound Classification | —Unverified | 0 |
| Can Masked Autoencoders Also Listen to Birds? | Apr 17, 2025 | Audio ClassificationMulti-Label Classification | —Unverified | 0 |
| Respiratory Inhaler Sound Event Classification Using Self-Supervised Learning | Apr 15, 2025 | ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification | Apr 4, 2025 | ClassificationData Augmentation | —Unverified | 0 |
| Symbolic Audio Classification via Modal Decision Tree Learning | Mar 21, 2025 | Audio ClassificationClassification | —Unverified | 0 |
| Weakly Supervised Convolutional Dictionary Learning for Multi-Label Classification | Mar 11, 2025 | ClassificationDictionary Learning | CodeCode Available | 0 |