| EAT: Self-Supervised Pre-Training with Efficient Audio Transformer | Jan 7, 2024 | Audio ClassificationSelf-Supervised Learning | CodeCode Available | 3 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| AudioCLIP: Extending CLIP to Image, Text and Audio | Jun 24, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 2 |
| Contrastive Learning of Global-Local Video Representations | Apr 7, 2021 | ClassificationContrastive Learning | CodeCode Available | 1 |
| SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification | Mar 2, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | Nov 9, 2021 | Audio ClassificationRetrieval | CodeCode Available | 1 |
| Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance | Nov 11, 2023 | Audio ClassificationSound Classification | CodeCode Available | 1 |
| Multi-View Spectrogram Transformer for Respiratory Sound Classification | Nov 16, 2023 | ClassificationSound Classification | CodeCode Available | 1 |
| Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning | Oct 27, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers | Oct 29, 2021 | ClassificationDeep Learning | CodeCode Available | 1 |
| Urban Sound Classification : striving towards a fair comparison | Oct 22, 2020 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block | Nov 5, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath | Mar 15, 2022 | 8kAudio Classification | CodeCode Available | 1 |
| ESResNet: Environmental Sound Classification Based on Visual Domain Models | Apr 15, 2020 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices | Mar 5, 2021 | Audio ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| Comparison of semi-supervised deep learning algorithms for audio classification | Feb 16, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope | Oct 4, 2024 | Audio Signal ProcessingSound Classification | CodeCode Available | 1 |
| Self-Supervised Learning for Few-Shot Bird Sound Classification | Dec 25, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification | Dec 15, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 |
| Unsupervised classification to improve the quality of a bird song recording dataset | Feb 15, 2023 | Sound ClassificationTemporal Localization | CodeCode Available | 1 |
| BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification | Jun 10, 2024 | Audio ClassificationSound Classification | CodeCode Available | 1 |
| End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network | Apr 25, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Aug 22, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| HouseX: A Fine-grained House Music Dataset and its Potential in the Music Industry | Jul 24, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| Tiny Transformers for Environmental Sound Classification at the Edge | Mar 22, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 |
| Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification | May 23, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 |
| CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning | Feb 2, 2025 | Audio ClassificationClustering | CodeCode Available | 1 |
| XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | Nov 25, 2022 | Action ClassificationClassification | CodeCode Available | 1 |
| Learning spectro-temporal representations of complex sounds with parameterized neural networks | Mar 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| A dataset for Audio-Visual Sound Event Detection in Movies | Feb 14, 2023 | Event DetectionSelf-Driving Cars | CodeCode Available | 1 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Oct 12, 2022 | ClassificationSound Classification | CodeCode Available | 1 |
| Epic-Sounds: A Large-scale Dataset of Actions That Sound | Feb 1, 2023 | Action RecognitionSound Classification | CodeCode Available | 1 |
| Continual Learning For On-Device Environmental Sound Classification | Jul 15, 2022 | ClassificationComputational Efficiency | CodeCode Available | 1 |
| Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network | Mar 30, 2024 | Sound Classification | —Unverified | 0 |
| Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers | Nov 8, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| A Semi-Supervised Algorithm for Improving the Consistency of Crowdsourced Datasets: The COVID-19 Case Study on Respiratory Disorder Classification | Sep 9, 2022 | Cough ClassificationDiagnostic | —Unverified | 0 |
| Classification for Big Dataset of Bioacoustic Signals Based on Human Scoring System and Artificial Neural Network | May 15, 2013 | ClassificationGeneral Classification | —Unverified | 0 |
| Channel-Spatial-Based Few-Shot Bird Sound Event Detection | Jun 18, 2023 | Event DetectionFew-Shot Learning | —Unverified | 0 |
| ARSC-Net: Adventitious Respiratory Sound Classification Network Using Parallel Paths with Channel-Spatial Attention | Jan 14, 2022 | Audio ClassificationClassification | —Unverified | 0 |
| Can Masked Autoencoders Also Listen to Birds? | Apr 17, 2025 | Audio ClassificationMulti-Label Classification | —Unverified | 0 |
| A Preliminary Study on Environmental Sound Classification Leveraging Large-Scale Pretrained Model and Semi-Supervised Learning | Oct 1, 2021 | ClassificationData Augmentation | —Unverified | 0 |
| Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification | Jun 12, 2025 | DisentanglementSound Classification | —Unverified | 0 |
| Detection of Adversarial Attacks and Characterization of Adversarial Subspace | Oct 26, 2019 | BenchmarkingEnvironmental Sound Classification | —Unverified | 0 |
| AI for Earth: Rainforest Conservation by Acoustic Surveillance | Aug 20, 2019 | Audio ClassificationBIG-bench Machine Learning | —Unverified | 0 |
| Automatic Environmental Sound Recognition: Performance versus Computational Cost | Jul 15, 2016 | General ClassificationSound Classification | —Unverified | 0 |
| Deep Neural Network Based Respiratory Pathology Classification Using Cough Sounds | Jun 23, 2021 | ClassificationSound Classification | —Unverified | 0 |
| Dynamic Backdoors with Global Average Pooling | Mar 4, 2022 | Classificationimage-classification | —Unverified | 0 |
| An open access database for the evaluation of respiratory sound classification algorithms | May 22, 2019 | Sound Classification | —Unverified | 0 |
| Automatic COVID-19 disease diagnosis using 1D convolutional neural network and augmentation with human respiratory sound based on parameters: cough, breath, and voice | Dec 14, 2021 | Sound Classification | —Unverified | 0 |