| EAT: Self-Supervised Pre-Training with Efficient Audio Transformer | Jan 7, 2024 | Audio ClassificationSelf-Supervised Learning | CodeCode Available | 3 | 5 |
| AudioCLIP: Extending CLIP to Image, Text and Audio | Jun 24, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 2 | 5 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 | 5 |
| Contrastive Learning of Global-Local Video Representations | Apr 7, 2021 | ClassificationContrastive Learning | CodeCode Available | 1 | 5 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Aug 22, 2022 | ClassificationSound Classification | CodeCode Available | 1 | 5 |
| End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network | Apr 25, 2022 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers | Oct 29, 2021 | ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| Tiny Transformers for Environmental Sound Classification at the Edge | Mar 22, 2021 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 | 5 |
| BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification | Jun 10, 2024 | Audio ClassificationSound Classification | CodeCode Available | 1 | 5 |
| Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance | Nov 11, 2023 | Audio ClassificationSound Classification | CodeCode Available | 1 | 5 |
| Urban Sound Classification : striving towards a fair comparison | Oct 22, 2020 | Audio ClassificationAudio Tagging | CodeCode Available | 1 | 5 |
| CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning | Feb 2, 2025 | Audio ClassificationClustering | CodeCode Available | 1 | 5 |
| Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | Nov 9, 2021 | Audio ClassificationRetrieval | CodeCode Available | 1 | 5 |
| AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath | Mar 15, 2022 | 8kAudio Classification | CodeCode Available | 1 | 5 |
| SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification | Mar 2, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 | 5 |
| A dataset for Audio-Visual Sound Event Detection in Movies | Feb 14, 2023 | Event DetectionSelf-Driving Cars | CodeCode Available | 1 | 5 |
| Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning | Oct 27, 2022 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Enemy Spotted: in-game gun sound dataset for gunshot classification and localization | Oct 12, 2022 | ClassificationSound Classification | CodeCode Available | 1 | 5 |
| ESResNet: Environmental Sound Classification Based on Visual Domain Models | Apr 15, 2020 | ClassificationEnvironmental Sound Classification | CodeCode Available | 1 | 5 |
| XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | Nov 25, 2022 | Action ClassificationClassification | CodeCode Available | 1 | 5 |
| Comparison of semi-supervised deep learning algorithms for audio classification | Feb 16, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 1 | 5 |
| HouseX: A Fine-grained House Music Dataset and its Potential in the Music Industry | Jul 24, 2022 | ClassificationSound Classification | CodeCode Available | 1 | 5 |
| Self-Supervised Learning for Few-Shot Bird Sound Classification | Dec 25, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 | 5 |
| Learning spectro-temporal representations of complex sounds with parameterized neural networks | Mar 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 | 5 |
| Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices | Mar 5, 2021 | Audio ClassificationEnvironmental Sound Classification | CodeCode Available | 1 | 5 |
| Epic-Sounds: A Large-scale Dataset of Actions That Sound | Feb 1, 2023 | Action RecognitionSound Classification | CodeCode Available | 1 | 5 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope | Oct 4, 2024 | Audio Signal ProcessingSound Classification | CodeCode Available | 1 | 5 |
| Multi-View Spectrogram Transformer for Respiratory Sound Classification | Nov 16, 2023 | ClassificationSound Classification | CodeCode Available | 1 | 5 |
| Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification | May 23, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 | 5 |
| Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification | Dec 15, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 | 5 |
| Unsupervised classification to improve the quality of a bird song recording dataset | Feb 15, 2023 | Sound ClassificationTemporal Localization | CodeCode Available | 1 | 5 |
| Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block | Nov 5, 2022 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Continual Learning For On-Device Environmental Sound Classification | Jul 15, 2022 | ClassificationComputational Efficiency | CodeCode Available | 1 | 5 |
| Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses | May 28, 2025 | Audio ClassificationLung Sound Classification | CodeCode Available | 0 | 5 |
| Masked Conditional Neural Networks for Environmental Sound Classification | May 25, 2018 | ClassificationEnvironmental Sound Classification | CodeCode Available | 0 | 5 |
| A ResNet attention model for classifying mosquitoes from wing‑beating sounds | Jun 20, 2022 | ClassificationData Augmentation | CodeCode Available | 0 | 5 |
| BUET Multi-disease Heart Sound Dataset: A Comprehensive Auscultation Dataset for Developing Computer-Aided Diagnostic Systems | Sep 1, 2024 | DiagnosticManagement | CodeCode Available | 0 | 5 |
| LungAttn: advanced lung sound classification using attention mechanism with dual TQWT and triple STFT spectrogram | Oct 29, 2021 | Audio ClassificationLung Sound Classification | CodeCode Available | 0 | 5 |
| A Visual Domain Transfer Learning Approach for Heartbeat Sound Classification | Jul 28, 2021 | Classificationdomain classification | CodeCode Available | 0 | 5 |
| Look, Listen and Learn | May 23, 2017 | Audio ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| LungBRN: A Smart Digital Stethoscope for Detecting Respiratory Disease Using bi-ResNet Deep Learning Algorithm | Dec 5, 2019 | Audio ClassificationDiagnostic | CodeCode Available | 0 | 5 |
| Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning | Apr 16, 2020 | Anomaly DetectionGeneral Classification | CodeCode Available | 0 | 5 |
| Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification | Jan 23, 2017 | ClassificationData Augmentation | CodeCode Available | 0 | 5 |
| Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification | Aug 15, 2016 | Data AugmentationDictionary Learning | CodeCode Available | 0 | 5 |
| Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset | Oct 1, 2024 | ClassificationSound Classification | CodeCode Available | 0 | 5 |
| An Ensemble of Transfer, Semi-supervised and Supervised Learning Methods for Pathological Heart Sound Classification | Jun 18, 2018 | General ClassificationRepresentation Learning | CodeCode Available | 0 | 5 |
| Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis | Mar 30, 2022 | Sound Classification | CodeCode Available | 0 | 5 |
| Face: Fast, Accurate and Context-Aware Audio Annotation and Classification | Mar 7, 2023 | Active LearningAudio Classification | CodeCode Available | 0 | 5 |
| Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles | May 28, 2025 | Knowledge DistillationSound Classification | CodeCode Available | 0 | 5 |