Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–175 of 407 papers

Title	Date	Tasks	Status	Hype
Metric Learning for User-defined Keyword Spotting	Nov 1, 2022	Keyword SpottingMetric Learning	—Unverified	0
WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit	Oct 30, 2022	Keyword Spotting	CodeCode Available	2
Application of Knowledge Distillation to Multi-task Speech Representation Learning	Oct 29, 2022	Keyword SpottingKnowledge Distillation	—Unverified	0
HEiMDaL: Highly Efficient Method for Detection and Localization of wake-words	Oct 26, 2022	Keyword Spotting	—Unverified	0
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input	Oct 26, 2022	Audio ClassificationAudio Tagging	—Unverified	0
Discriminatory and orthogonal feature learning for noise robust keyword spotting	Oct 20, 2022	Keyword SpottingTriplet	—Unverified	0
Fully Unsupervised Training of Few-shot Keyword Spotting	Oct 6, 2022	Keyword SpottingMetric Learning	—Unverified	0
Split Federated Learning on Micro-controllers: A Keyword Spotting Showcase	Oct 4, 2022	Federated LearningKeyword Spotting	—Unverified	0
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining	Oct 4, 2022	Keyword SpottingSelf-Supervised Learning	CodeCode Available	1
Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting	Oct 1, 2022	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified	0
SiDi KWS: A Large-Scale Multilingual Dataset for Keyword Spotting	Sep 22, 2022	Deep LearningKeyword Extraction	CodeCode Available	1
A Few Shot Multi-Representation Approach for N-gram Spotting in Historical Manuscripts	Sep 21, 2022	Few-Shot LearningHandwritten Text Recognition	—Unverified	0
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages	Aug 24, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
ReckOn: A 28nm Sub-mm2 Task-Agnostic Spiking Recurrent Neural Network Processor Enabling On-Chip Learning over Second-Long Timescales	Aug 20, 2022	Gesture RecognitionKeyword Spotting	—Unverified	0
An Anchor-Free Detector for Continuous Speech Keyword Spotting	Aug 9, 2022	Keyword Spottingobject-detection	—Unverified	0
Keyword Spotting System and Evaluation of Pruning and Quantization Methods on Low-power Edge Microcontrollers	Aug 4, 2022	Edge-computingKeyword Spotting	CodeCode Available	1
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit	Jul 14, 2022	image-classificationImage Classification	—Unverified	0
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets	Jul 13, 2022	CPUKeyword Spotting	—Unverified	0
Wakeword Detection under Distribution Shifts	Jul 13, 2022	Keyword Spotting	—Unverified	0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource Devices	Jul 12, 2022	Emotion RecognitionKeyword Spotting	CodeCode Available	0
Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting	Jun 30, 2022	Keyword Spotting	CodeCode Available	1
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting	Jun 28, 2022	Keyword SpottingMetric Learning	—Unverified	0
Personalized Keyword Spotting through Multi-task Learning	Jun 28, 2022	Keyword SpottingMulti-Task Learning	—Unverified	0
Challenges and Opportunities in Multi-device Speech Processing	Jun 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer	Jun 23, 2022	Keyword Spotting	—Unverified	0

Show:10 25 50

← PrevPage 7 of 17Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified