Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 407 papers

Title	Date	Tasks	Status
The SPL-IT-UC Query by Example Search on Speech system for MediaEval 2015	Sep 14, 2015	Dynamic Time WarpingKeyword Spotting	—Unverified
TinySV: Speaker Verification in TinyML with On-device Learning	Jun 3, 2024	Keyword SpottingSpeaker Verification	—Unverified
To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement	Apr 6, 2023	Keyword Spotting	—Unverified
Toward noise-robust whisper keyword spotting on headphones with in-earcup microphone and curriculum learning	Feb 1, 2025	Keyword Spotting	—Unverified
Towards Contactless Elevators with TinyML using CNN-based Person Detection and Keyword Spotting	May 19, 2024	Human DetectionKeyword Spotting	—Unverified
Towards efficient keyword spotting using spike-based time difference encoders	Mar 19, 2025	Keyword Spotting	—Unverified
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili	Jun 1, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Robust Domain Generalization in 2D Neural Audio Processing	Sep 29, 2021	Acoustic Scene ClassificationDomain Generalization	—Unverified
Training Keyword Spotting Models on Non-IID Data with Federated Learning	May 21, 2020	Data AugmentationFederated Learning	—Unverified
Training Wake Word Detection with Synthesized Speech Data on Confusion Words	Nov 3, 2020	Data AugmentationKeyword Spotting	—Unverified
Transfer Learning for a Letter-Ngrams to Word Decoder in the Context of Historical Handwriting Recognition with Scarce Resources	Aug 1, 2018	DecoderHandwriting Recognition	—Unverified
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit	Jul 14, 2022	image-classificationImage Classification	—Unverified
TUKE at MediaEval 2015 QUESST	Sep 14, 2015	Dynamic Time WarpingKeyword Spotting	—Unverified
TUKE System for MediaEval 2014 QUESST	Oct 16, 2014	ClusteringKeyword Spotting	—Unverified
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias	Dec 15, 2023	DecoderKeyword Spotting	—Unverified
Ultra-Low Power Keyword Spotting at the Edge	Nov 9, 2021	Keyword SpottingModel Optimization	—Unverified
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction	Sep 7, 2023	Keyword SpottingSelf-Supervised Learning	—Unverified
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model	Jul 26, 2024	2kDiversity	—Unverified
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks	Sep 22, 2023	Adversarial RobustnessKeyword Spotting	—Unverified
Visually grounded cross-lingual keyword spotting in speech	Jun 13, 2018	Keyword SpottingVisual Grounding	—Unverified
Vocal Tract Length Warped Features for Spoken Keyword Spotting	Jan 7, 2025	Keyword Spotting	—Unverified
VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion	Dec 20, 2022	Backdoor AttackKeyword Spotting	—Unverified
Wakeword Detection under Distribution Shifts	Jul 13, 2022	Keyword Spotting	—Unverified
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting	Nov 2, 2021	Keyword Spotting	—Unverified
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing	Jun 2, 2025	Keyword Spottingspeech-recognition	—Unverified

Show:10 25 50

← PrevPage 10 of 17Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified