SOTAVerified

AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation

2025-05-20Unverified0· sign in to hype

Yang Xiao, Tianyi Peng, Yanghao Zhou, Rohan Kumar Das

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Spoken keyword spotting (KWS) aims to identify keywords in audio for wide applications, especially on edge devices. Current small-footprint KWS systems focus on efficient model designs. However, their inference performance can decline in unseen environments or noisy backgrounds. Test-time adaptation (TTA) helps models adapt to test samples without needing the original training data. In this study, we present AdaKWS, the first TTA method for robust KWS to the best of our knowledge. Specifically, 1) We initially optimize the model's confidence by selecting reliable samples based on prediction entropy minimization and adjusting the normalization statistics in each batch. 2) We introduce pseudo-keyword consistency (PKC) to identify critical, reliable features without overfitting to noise. Our experiments show that AdaKWS outperforms other methods across various conditions, including Gaussian noise and real-scenario noises. The code will be released in due course.

Tasks

Reproductions