Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

2023-05-23Code Available1· sign in to hype

Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/raymin0223/patch-mix_contrastive_learning
OfficialIn paperpytorch★ 73

Abstract

Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%.

Tasks

Audio Classification Contrastive Learning Sound Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ICBHI Respiratory Sound Database	AST (Patch-Mix CL)	ICBHI Score	62.37	—	Unverified
ICBHI Respiratory Sound Database	AST (fine-tuning)	ICBHI Score	59.55	—	Unverified
ICBHI Respiratory Sound Database	AST (fine-tuning)	Sensitivity	41.97	—	Unverified

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Code

Abstract

Tasks

Benchmark Results

Reproductions