Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 407 papers

Title	Date	Tasks	Status
Challenges and Opportunities in Multi-device Speech Processing	Jun 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Characterizing Linguistic Attributes for Automatic Classification of Intent Based Racist/Radicalized Posts on Tumblr Micro-Blogging Website	Jan 18, 2017	Ensemble LearningKeyword Spotting	—Unverified
Conditional Online Learning for Keyword Spotting	May 19, 2023	Continual LearningKeyword Spotting	—Unverified
Continuous-Time Analog Filters for Audio Edge Intelligence: Review on Circuit Designs	Jun 6, 2022	Keyword Spotting	—Unverified
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology	Aug 31, 2024	Contrastive LearningKeyword Spotting	—Unverified
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks	Dec 19, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Structured Transforms for Small-Footprint Deep Learning	Oct 6, 2015	Deep LearningKeyword Spotting	—Unverified
以音韻屬性偵測擷取對話語音關鍵詞之研究 (Study on Keyword Spotting using Prosodic Attribute Detection for Conversational Speech) [In Chinese]	Sep 1, 2012	AttributeKeyword Spotting	—Unverified
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets	Jul 13, 2022	CPUKeyword Spotting	—Unverified
SubSpectral Normalization for Neural Audio Data Processing	Mar 25, 2021	Keyword Spotting	—Unverified
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments	Jul 23, 2024	DiversityKeyword Spotting	—Unverified
Teaching keyword spotters to spot new keywords with limited examples	Jun 4, 2021	Keyword Spotting	—Unverified
Temporal Knowledge Distillation for On-device Audio Classification	Oct 27, 2021	Audio ClassificationClassification	—Unverified
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications	Mar 4, 2019	Keyword SpottingQuantization	—Unverified
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting	Aug 12, 2021	Keyword SpottingMetric Learning	—Unverified
Text-Aware Adapter for Few-Shot Keyword Spotting	Dec 24, 2024	Keyword SpottingTransfer Learning	—Unverified
The DKU System Description for The Interspeech 2021 Auto-KWS Challenge	Apr 11, 2021	Dynamic Time WarpingKeyword Spotting	—Unverified
The Effects of Data Collection Methods in Twitter	Nov 1, 2016	Keyword Spotting	—Unverified
The IIT-B Query-by-Example System for MediaEval 2015	Sep 14, 2015	Keyword Spotting	—Unverified
The NNI Query-by-Example System for MediaEval 2014	Oct 16, 2014	Dynamic Time WarpingKeyword Spotting	—Unverified
The NNI Query-by-Example System for MediaEval 2015	Sep 14, 2015	Keyword Spotting	—Unverified
The NPU System for the 2020 Personalized Voice Trigger Challenge	Feb 26, 2021	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified
The RATS Collection: Supporting HLT Research with Degraded Audio Data	May 1, 2014	Action DetectionActivity Detection	—Unverified
The Role of Temporal Hierarchy in Spiking Neural Networks	Jul 26, 2024	Inductive BiasKeyword Spotting	—Unverified
The SPL-IT Query by Example Search on Speech system for MediaEval 2014	Oct 16, 2014	Dynamic Time WarpingKeyword Spotting	—Unverified
The SPL-IT-UC Query by Example Search on Speech system for MediaEval 2015	Sep 14, 2015	Dynamic Time WarpingKeyword Spotting	—Unverified
TinySV: Speaker Verification in TinyML with On-device Learning	Jun 3, 2024	Keyword SpottingSpeaker Verification	—Unverified
To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement	Apr 6, 2023	Keyword Spotting	—Unverified
Toward noise-robust whisper keyword spotting on headphones with in-earcup microphone and curriculum learning	Feb 1, 2025	Keyword Spotting	—Unverified
Towards Contactless Elevators with TinyML using CNN-based Person Detection and Keyword Spotting	May 19, 2024	Human DetectionKeyword Spotting	—Unverified
Towards efficient keyword spotting using spike-based time difference encoders	Mar 19, 2025	Keyword Spotting	—Unverified
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili	Jun 1, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Robust Domain Generalization in 2D Neural Audio Processing	Sep 29, 2021	Acoustic Scene ClassificationDomain Generalization	—Unverified
Training Keyword Spotting Models on Non-IID Data with Federated Learning	May 21, 2020	Data AugmentationFederated Learning	—Unverified
Training Wake Word Detection with Synthesized Speech Data on Confusion Words	Nov 3, 2020	Data AugmentationKeyword Spotting	—Unverified
Transfer Learning for a Letter-Ngrams to Word Decoder in the Context of Historical Handwriting Recognition with Scarce Resources	Aug 1, 2018	DecoderHandwriting Recognition	—Unverified
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit	Jul 14, 2022	image-classificationImage Classification	—Unverified
TUKE at MediaEval 2015 QUESST	Sep 14, 2015	Dynamic Time WarpingKeyword Spotting	—Unverified
TUKE System for MediaEval 2014 QUESST	Oct 16, 2014	ClusteringKeyword Spotting	—Unverified
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias	Dec 15, 2023	DecoderKeyword Spotting	—Unverified
Ultra-Low Power Keyword Spotting at the Edge	Nov 9, 2021	Keyword SpottingModel Optimization	—Unverified
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction	Sep 7, 2023	Keyword SpottingSelf-Supervised Learning	—Unverified
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model	Jul 26, 2024	2kDiversity	—Unverified
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks	Sep 22, 2023	Adversarial RobustnessKeyword Spotting	—Unverified
Visually grounded cross-lingual keyword spotting in speech	Jun 13, 2018	Keyword SpottingVisual Grounding	—Unverified
Vocal Tract Length Warped Features for Spoken Keyword Spotting	Jan 7, 2025	Keyword Spotting	—Unverified
VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion	Dec 20, 2022	Backdoor AttackKeyword Spotting	—Unverified
Wakeword Detection under Distribution Shifts	Jul 13, 2022	Keyword Spotting	—Unverified
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting	Nov 2, 2021	Keyword Spotting	—Unverified
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing	Jun 2, 2025	Keyword Spottingspeech-recognition	—Unverified

Show:10 25 50

← PrevPage 5 of 9Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified