Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 407 papers

Title	Date	Tasks	Status
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks	Feb 25, 2020	Keyword SpottingSpoken Language Understanding	—Unverified
Small-footprint slimmable networks for keyword spotting	Apr 21, 2023	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified
Speech and language technologies for the automatic monitoring and training of cognitive functions	Sep 1, 2015	Keyword SpottingSpeech Recognition	—Unverified
Speech Augmentation Based Unsupervised Learning for Keyword Spotting	May 28, 2022	Keyword Spotting	—Unverified
Speech Enhancement for Wake-Up-Word detection in Voice Assistants	Jan 29, 2021	Data AugmentationDenoising	—Unverified
Speech-MLP: a simple MLP architecture for speech processing	Sep 29, 2021	Keyword SpottingSpeech Enhancement	—Unverified
Speech Privacy Leakage from Shared Gradients in Distributed Learning	Feb 21, 2023	Federated LearningKeyword Spotting	—Unverified
Speech Recognition: Keyword Spotting Through Image Recognition	Mar 10, 2018	image-classificationImage Classification	—Unverified
Speech Unlearning	Jun 1, 2025	Adversarial RobustnessKeyword Spotting	—Unverified
SpeechYOLO: Detection and Localization of Speech Objects	Apr 14, 2019	General ClassificationKeyword Spotting	—Unverified
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks	Sep 18, 2023	Keyword SpottingSpeaker Identification	—Unverified
Split Federated Learning on Micro-controllers: A Keyword Spotting Showcase	Oct 4, 2022	Federated LearningKeyword Spotting	—Unverified
Spoken Language Identification using ConvNets	Oct 9, 2019	Keyword SpottingLanguage Identification	—Unverified
Spot keywords from very noisy and mixed speech	May 28, 2023	Data AugmentationKeyword Spotting	—Unverified
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents	Mar 6, 2023	Keyword SpottingSelf-Supervised Learning	—Unverified
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models	Oct 26, 2017	General ClassificationKeyword Spotting	—Unverified
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks	Dec 19, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Structured Transforms for Small-Footprint Deep Learning	Oct 6, 2015	Deep LearningKeyword Spotting	—Unverified
以音韻屬性偵測擷取對話語音關鍵詞之研究 (Study on Keyword Spotting using Prosodic Attribute Detection for Conversational Speech) [In Chinese]	Sep 1, 2012	AttributeKeyword Spotting	—Unverified
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets	Jul 13, 2022	CPUKeyword Spotting	—Unverified
SubSpectral Normalization for Neural Audio Data Processing	Mar 25, 2021	Keyword Spotting	—Unverified
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments	Jul 23, 2024	DiversityKeyword Spotting	—Unverified
Teaching keyword spotters to spot new keywords with limited examples	Jun 4, 2021	Keyword Spotting	—Unverified
Temporal Knowledge Distillation for On-device Audio Classification	Oct 27, 2021	Audio ClassificationClassification	—Unverified
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications	Mar 4, 2019	Keyword SpottingQuantization	—Unverified

Show:10 25 50

← PrevPage 10 of 17Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified