MAST: Multiscale Audio Spectrogram Transformers Nov 2, 2022 Audio Classification Keyword Spotting
Code Code Available 15 MAX-AST: COMBINING CONVOLUTION, LOCAL AND GLOBAL SELF-ATTENTIONS FOR AUDIO EVENT CLASSIFICATION Apr 14, 2024 Audio Classification
Code Code Available 15 Adaptive Differential Denoising for Respiratory Sounds Classification Jun 3, 2025 Audio Classification Classification
Code Code Available 15 Continual Transformers: Redundancy-Free Attention for Online Inference Jan 17, 2022 Action Detection Audio Classification
Code Code Available 15 Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 15 EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning Mar 14, 2024 Audio Classification audio-visual learning
Code Code Available 15 Π-nets: Deep Polynomial Neural Networks Mar 8, 2020 Audio Classification Graph Representation Learning
Code Code Available 15 Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models Apr 9, 2024 Audio Classification Generalized Zero-Shot Learning
Code Code Available 15 LEAF: A Learnable Frontend for Audio Classification Jan 21, 2021 Audio Classification Classification
Code Code Available 15 MAE-AST: Masked Autoencoding Audio Spectrogram Transformer Mar 30, 2022 Audio Classification Decoder
Code Code Available 15 PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation Feb 2, 2021 Audio Classification Audio Tagging
Code Code Available 15 CRNNs for Urban Sound Tagging with spatiotemporal context Aug 24, 2020 Audio Classification Audio Tagging
Code Code Available 15 AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights Jun 15, 2020 Audio Classification Image Classification
Code Code Available 15 Ultra-light deep MIR by trimming lottery tickets Jul 31, 2020 Audio Classification CPU
Code Code Available 15 CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning Feb 2, 2025 Audio Classification Clustering
Code Code Available 15 Piano Skills Assessment Jan 13, 2021 Action Quality Assessment Audio Classification
Code Code Available 15 Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases May 13, 2024 Audio Classification Diagnostic
Code Code Available 05 Deep Feature Learning for Medical Acoustics Aug 5, 2022 Audio Classification
Code Code Available 05 Rank-based loss for learning hierarchical representations Oct 11, 2021 Audio Classification Triplet
Code Code Available 05 Augmenting Deep Classifiers with Polynomial Neural Networks Apr 16, 2021 Audio Classification General Classification
Code Code Available 05 Play It Back: Iterative Attention for Audio Recognition Oct 20, 2022 Audio Classification
Code Code Available 05 Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio Classification on Edge-devices Aug 13, 2021 Audio Classification Classification
Code Code Available 05 Cross-modal supervised learning for better acoustic representations Nov 15, 2019 Audio Classification General Classification
Code Code Available 05 Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation Dec 11, 2019 Audio Classification Data Augmentation
Code Code Available 05 AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes Aug 14, 2023 Audio Classification Classification
Code Code Available 05 Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses May 28, 2025 Audio Classification Lung Sound Classification
Code Code Available 05 Performance Analysis of Hybrid Quantum-Classical Convolutional Neural Networks for Audio Classification Nov 4, 2024 Audio Classification Quantum Machine Learning
Code Code Available 05 Self-Supervised Learning by Cross-Modal Audio-Video Clustering Nov 28, 2019 Action Recognition Audio Classification
Code Code Available 05 On-device Online Learning and Semantic Management of TinyML Systems May 13, 2024 Audio Classification image-classification
Code Code Available 05 Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data Feb 18, 2016 Audio Classification
Code Code Available 05 nEMO: Dataset of Emotional Speech in Polish Apr 9, 2024 Audio Classification Emotion Recognition
Code Code Available 05 On the Veracity of Local, Model-agnostic Explanations in Audio Classification: Targeted Investigations with Adversarial Examples Jul 19, 2021 Audio Classification
Code Code Available 05 Multi-level Attention Model for Weakly Supervised Audio Classification Mar 6, 2018 Audio Classification
Code Code Available 05 Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input Oct 26, 2022 Audio Classification Audio Tagging
Code Code Available 05 Masked Modeling Duo: Towards a Universal Audio Pre-training Framework Apr 9, 2024 Audio Classification
Code Code Available 05 Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes Jun 12, 2019 Activity Recognition Audio Classification
Code Code Available 05 4,500 Seconds: Small Data Training Approaches for Deep UAV Audio Classification May 21, 2025 Audio Classification parameter-efficient fine-tuning
Code Code Available 05 Masked Conditional Neural Networks for Audio Classification Mar 6, 2018 Audio Classification Classification
Code Code Available 05 Compact Global Descriptor for Neural Networks Jul 23, 2019 Audio Classification Deep Attention
Code Code Available 05 MAP-SNN: Mapping Spike Activities with Multiplicity, Adaptability, and Plasticity into Bio-Plausible Spiking Neural Networks Apr 21, 2022 Audio Classification Diversity
Code Code Available 05 Multi-label audio classification with a noisy zero-shot teacher Jul 20, 2024 Audio Classification Data Augmentation
Code Code Available 05 Self-Supervised MultiModal Versatile Networks Jun 29, 2020 Action Recognition In Videos Audio Classification
Code Code Available 05 LungAttn: advanced lung sound classification using attention mechanism with dual TQWT and triple STFT spectrogram Oct 29, 2021 Audio Classification Lung Sound Classification
Code Code Available 05 Attention Bottlenecks for Multimodal Fusion Jun 30, 2021 Action Classification Action Recognition
Code Code Available 05 Look, Listen and Learn May 23, 2017 Audio Classification General Classification
Code Code Available 05 LungBRN: A Smart Digital Stethoscope for Detecting Respiratory Disease Using bi-ResNet Deep Learning Algorithm Dec 5, 2019 Audio Classification Diagnostic
Code Code Available 05 A Training Framework for Optimal and Stable Training of Polynomial Neural Networks May 16, 2025 Audio Classification Homomorphic Encryption for Deep Learning
Code Code Available 05 15,500 Seconds: Lean UAV Classification Leveraging PEFT and Pre-Trained Networks May 21, 2025 Audio Classification Data Augmentation
Code Code Available 05 M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP Mar 28, 2025 Audio captioning Audio Classification
Code Code Available 05 A Study on Broadcast Networks for Music Genre Classification Aug 25, 2022 Audio Classification Classification
Code Code Available 05