Advancing Continual Learning for Robust Deepfake Audio Classification Jul 14, 2024 Audio Classification Classification
— Unverified 0ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions Jul 11, 2024 All Audio Classification
Code Code Available 1DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners Jul 4, 2024 Audio Classification Audio Tagging
Code Code Available 1Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 1Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor Jun 12, 2024 Audio Classification
Code Code Available 0FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation Jun 11, 2024 Audio Classification Knowledge Distillation
Code Code Available 0BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification Jun 10, 2024 Audio Classification Sound Classification
Code Code Available 1Contrastive Learning from Synthetic Audio Doppelgängers Jun 9, 2024 Audio Classification Contrastive Learning
— Unverified 0Audio Mamba: Bidirectional State Space Model for Audio Representation Learning Jun 5, 2024 Audio Classification Classification
Code Code Available 2M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation Jun 4, 2024 Audio Classification Linear evaluation
Code Code Available 0animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics Jun 3, 2024 Audio Classification Benchmarking
Code Code Available 1SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model May 20, 2024 Audio Classification GPU
Code Code Available 2Attention Feature Fusion Network via Knowledge Propagation for Automated Respiratory Sound Classification May 16, 2024 Audio Classification Computed Tomography (CT)
— Unverified 0Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning May 14, 2024 Audio Classification Representation Learning
Code Code Available 1On-device Online Learning and Semantic Management of TinyML Systems May 13, 2024 Audio Classification image-classification
Code Code Available 0Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases May 13, 2024 Audio Classification Diagnostic
Code Code Available 0AFEN: Respiratory Disease Classification using Ensemble Learning May 8, 2024 Audio Classification Classification
— Unverified 0Benchmarking Representations for Speech, Music, and Acoustic Events May 2, 2024 Audio Classification Benchmarking
Code Code Available 2Scalable Event-by-event Processing of Neuromorphic Sensory Signals With Deep State-Space Models Apr 29, 2024 Audio Classification Gesture Recognition
Code Code Available 1Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics Apr 25, 2024 Audio Classification Transfer Learning
Code Code Available 3AudioProtoPNet: An interpretable deep learning model for bird sound classification Apr 16, 2024 Audio Classification Classification
— Unverified 0MAX-AST: COMBINING CONVOLUTION, LOCAL AND GLOBAL SELF-ATTENTIONS FOR AUDIO EVENT CLASSIFICATION Apr 14, 2024 Audio Classification
Code Code Available 1nEMO: Dataset of Emotional Speech in Polish Apr 9, 2024 Audio Classification Emotion Recognition
Code Code Available 0Masked Modeling Duo: Towards a Universal Audio Pre-training Framework Apr 9, 2024 Audio Classification
Code Code Available 0Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models Apr 9, 2024 Audio Classification Generalized Zero-Shot Learning
Code Code Available 1DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification Mar 24, 2024 Audio Classification Information Retrieval
Code Code Available 1InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Mar 22, 2024 Action Classification Action Recognition
Code Code Available 7BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics Mar 15, 2024 Audio Classification Classification
Code Code Available 2EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning Mar 14, 2024 Audio Classification audio-visual learning
Code Code Available 1Mixer is more than just a model Feb 28, 2024 Audio Classification Environmental Sound Classification
— Unverified 0Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio Feb 14, 2024 Audio Classification Decoder
Code Code Available 2Tuning In: Analysis of Audio Classifier Performance in Clinical Settings with Limited Data Feb 7, 2024 Audio Classification Classification
— Unverified 0On the Transferability of Large-Scale Self-Supervision to Few-Shot Audio Classification Feb 2, 2024 Audio Classification Few-Shot Audio Classification
Code Code Available 1From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers Jan 16, 2024 Audio Classification
— Unverified 0Learning Audio Concepts from Counterfactual Natural Language Jan 10, 2024 Audio captioning Audio Classification
Code Code Available 0Class-Incremental Learning for Multi-Label Audio Classification Jan 9, 2024 Audio Classification Classification
— Unverified 0EAT: Self-Supervised Pre-Training with Efficient Audio Transformer Jan 7, 2024 Audio Classification Self-Supervised Learning
Code Code Available 3Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition Jan 4, 2024 Attribute Audio Classification
Code Code Available 2OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning Jan 1, 2024 3D Point Cloud Classification Action Classification
— Unverified 0On the choice of the optimal temporal support for audio classification with Pre-trained embeddings Dec 21, 2023 Audio Classification
— Unverified 0Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification Dec 15, 2023 Audio Classification Contrastive Learning
Code Code Available 1Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Dec 6, 2023 Audio Classification Few-Shot Learning
Code Code Available 1Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities Nov 30, 2023 Audio Classification Few-Shot Audio Classification
Code Code Available 1Formal Verification of Long Short-Term Memory based Audio Classifiers: A Star based Approach Nov 16, 2023 Audio Classification Classification
Code Code Available 0Investigating the Emergent Audio Classification Ability of ASR Foundation Models Nov 15, 2023 Audio Classification Decoder
Code Code Available 0Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models Nov 14, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 3Pruning random resistive memory for optimizing analogue AI Nov 13, 2023 Audio Classification Image Segmentation
— Unverified 0Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance Nov 11, 2023 Audio Classification Sound Classification
Code Code Available 1Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities Nov 9, 2023 Action Classification Audio Classification
— Unverified 0Auto deep learning for bioacoustic signals Nov 8, 2023 Audio Classification Classification
Code Code Available 0