Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 407 papers

Title	Date	Tasks	Status
Fully Unsupervised Training of Few-shot Keyword Spotting	Oct 6, 2022	Keyword SpottingMetric Learning	—Unverified
Frequency & Channel Attention Network for Small Footprint Noisy Spoken Keyword Spotting	Jul 29, 2024	Keyword Spotting	—Unverified
Challenges and Opportunities in Multi-device Speech Processing	Jun 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
An In-Vehicle KWS System with Multi-Source Fusion for Vehicle Applications	Feb 12, 2019	General ClassificationKeyword Spotting	—Unverified
Flexible Keyword Spotting based on Homogeneous Audio-Text Embedding	Aug 12, 2023	Keyword Spotting	—Unverified
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting	Jun 19, 2024	Keyword Spotting	—Unverified
GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples	May 20, 2025	Keyword Spotting	—Unverified
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting	Feb 13, 2019	Event DetectionKeyword Spotting	—Unverified
GTTS-EHU Systems for QUESST at MediaEval 2014	Oct 16, 2014	Action DetectionActivity Detection	—Unverified
Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware	Sep 9, 2020	Keyword Spotting	—Unverified
Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs	Apr 28, 2025	Human Detectionimage-classification	—Unverified
HEiMDaL: Highly Efficient Method for Detection and Localization of wake-words	Oct 26, 2022	Keyword Spotting	—Unverified
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology	Aug 31, 2024	Contrastive LearningKeyword Spotting	—Unverified
Hierarchical Neural Network Architecture In Keyword Spotting	Nov 6, 2018	Keyword Spottingspeech-recognition	—Unverified
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech	Jan 12, 2024	Contrastive LearningKeyword Spotting	—Unverified
A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting	Feb 4, 2022	Keyword SpottingQuantization	—Unverified
Fixed-point quantization aware training for on-device keyword-spotting	Mar 4, 2023	Keyword SpottingQuantization	—Unverified
How Tiny Can Analog Filterbank Features Be Made for Ultra-low-power On-device Keyword Spotting?	Apr 17, 2023	Keyword Spotting	—Unverified
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting	Sep 18, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
IIIT-H System for MediaEval 2014 QUESST	Oct 16, 2014	Dynamic Time WarpingKeyword Spotting	—Unverified
台語關鍵詞辨識之實作與比較 (Implementation and Comparison of Keyword Spotting for Taiwanese) [In Chinese]	Sep 1, 2012	Keyword Spotting	—Unverified
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection	Nov 20, 2021	Acoustic echo cancellationKeyword Spotting	—Unverified
CUHK System for QUESST Task of MediaEval 2014	Oct 16, 2014	ClusteringDynamic Time Warping	—Unverified
Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training	Jul 6, 2019	Acoustic ModellingAutomatic Speech Recognition	—Unverified
Finding Opinion Manipulation Trolls in News Community Forums	Jul 1, 2015	Keyword SpottingSentiment Analysis	—Unverified
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting	Nov 19, 2022	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation	Jul 9, 2019	BIG-bench Machine LearningKeyword Spotting	—Unverified
Improving Small Footprint Few-shot Keyword Spotting with Supervision on Auxiliary Data	Aug 31, 2023	Keyword SpottingMulti-Task Learning	—Unverified
BUT QUESST 2015 System Description	Sep 14, 2015	Dynamic Time WarpingKeyword Spotting	—Unverified
Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder	Aug 31, 2023	Keyword Spotting	—Unverified
An Integrated Framework for Two-pass Personalized Voice Trigger	Jun 30, 2021	Keyword SpottingMulti-Task Learning	—Unverified
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified
BUT QUESST 2014 System Description	Oct 16, 2014	Dynamic Time WarpingKeyword Spotting	—Unverified
Data Augmentation for Robust Keyword Spotting under Playback Interference	Aug 1, 2018	Acoustic echo cancellationData Augmentation	—Unverified
Keyword-Guided Adaptation of Automatic Speech Recognition	Jun 4, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting	May 21, 2023	DenoisingKeyword Spotting	—Unverified
A Channel-Pruned and Weight-Binarized Convolutional Neural Network for Keyword Spotting	Sep 12, 2019	BinarizationGeneral Classification	—Unverified
Keyword spotting -- Detecting commands in speech using deep learning	Dec 9, 2023	Deep LearningFeature Engineering	—Unverified
Keyword spotting for audiovisual archival search in Uralic languages	Sep 1, 2021	Keyword Spotting	—Unverified
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers	Jun 22, 2019	Keyword SpottingMulti-Task Learning	—Unverified
An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild'' Edge Applications	May 9, 2023	Emotion Recognitionintent-classification	—Unverified
A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization	May 10, 2022	Keyword SpottingQuantization	—Unverified
Leveraging Large Language Models for Exploiting ASR Uncertainty	Sep 9, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Lightweight dynamic filter for keyword spotting	Sep 23, 2021	Keyword Spottingspeech-recognition	—Unverified
Latency Control for Keyword Spotting	Jun 15, 2022	Keyword Spotting	—Unverified
Learnable Front Ends Based on Temporal Modulation for Music Tagging	Nov 28, 2022	Keyword SpottingMusic Tagging	—Unverified
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning	Jun 7, 2022	Ensemble LearningFederated Learning	—Unverified
Learning Decoupling Features Through Orthogonality Regularization	Mar 31, 2022	Keyword SpottingSpeaker Verification	—Unverified
Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders	Nov 14, 2018	Dynamic Time WarpingHumanitarian	—Unverified
Feature learning for efficient ASR-free keyword spotting in low-resource languages	Aug 13, 2021	Dynamic Time WarpingHumanitarian	—Unverified

Show:10 25 50

← PrevPage 4 of 9Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified