SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 150 of 407 papers

TitleStatusHype
Low-resource keyword spotting using contrastively trained transformer acoustic word embeddings0
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models0
ASAP-FE: Energy-Efficient Feature Extraction Enabling Multi-Channel Keyword Spotting on Edge Processors0
Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and AlgorithmsCode0
GLAP: General contrastive audio-text pretraining across domains and languagesCode2
Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU0
SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models0
Assessing the Impact of Anisotropy in Neural Representations of Speech: A Case Study on Keyword Spotting0
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing0
Speech Unlearning0
Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential DataCode1
LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting0
MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous DecodingCode2
Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting0
GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples0
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation0
AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting0
Adaptive Noise Resilient Keyword Spotting Using One-Shot Learning0
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks0
Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs0
AI-Powered Agile Analog Circuit Design and Optimization0
Towards efficient keyword spotting using spike-based time difference encoders0
Eventprop training for efficient neuromorphic applications0
Toward noise-robust whisper keyword spotting on headphones with in-earcup microphone and curriculum learning0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection0
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African LanguagesCode1
Vocal Tract Length Warped Features for Spoken Keyword Spotting0
Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment0
Text-Aware Adapter for Few-Shot Keyword Spotting0
NTC-KWS: Noise-aware CTC for Robust Keyword Spotting0
Streaming Keyword Spotting Boosted by Cross-layer Discrimination ConsistencyCode2
GhostRNN: Reducing State Redundancy in RNN with Cheap Operations0
Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks0
Noise-Robust Hearing Aid Voice Control0
GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting0
Audio Explanation Synthesis with Generative Foundation ModelsCode0
A Literature Review of Keyword Spotting Technologies for Urdu0
Effective Integration of KAN for Keyword Spotting0
Dark Experience for Incremental Keyword Spotting0
SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting0
Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Fold Paralysis0
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology0
EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models0
Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning0
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting0
Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio SensorsCode1
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
Convexity-based Pruning of Speech Representation Models0
Neuromorphic Keyword Spotting with Pulse Density Modulation MEMS MicrophonesCode0
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified