Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 407 papers

Title	Date	Tasks	Status	Hype
Feature learning for efficient ASR-free keyword spotting in low-resource languages	Aug 13, 2021	Dynamic Time WarpingHumanitarian	—Unverified	0
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting	Aug 12, 2021	Keyword SpottingMetric Learning	—Unverified	0
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization	Aug 3, 2021	Inference OptimizationKeyword Spotting	—Unverified	0
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers	Jul 28, 2021	Event DetectionKeyword Spotting	—Unverified	0
Multi-task Learning with Cross Attention for Keyword Spotting	Jul 15, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data	Jul 13, 2021	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified	0
An Integrated Framework for Two-pass Personalized Voice Trigger	Jun 30, 2021	Keyword SpottingMulti-Task Learning	—Unverified	0
PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation	Jun 25, 2021	Keyword SpottingKnowledge Distillation	—Unverified	0
Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis	Jun 22, 2021	Document Layout AnalysisKeyword Spotting	—Unverified	0
Zero-Shot Federated Learning with New Classes for Audio Classification	Jun 18, 2021	Audio ClassificationClassification	—Unverified	0
MLPerf Tiny Benchmark	Jun 14, 2021	Anomaly DetectionBIG-bench Machine Learning	CodeCode Available	1
Broadcasted Residual Learning for Efficient Keyword Spotting	Jun 8, 2021	Keyword Spotting	CodeCode Available	1
Encoder-Decoder Neural Architecture Optimization for Keyword Spotting	Jun 4, 2021	DecoderKeyword Spotting	—Unverified	0
Teaching keyword spotters to spot new keywords with limited examples	Jun 4, 2021	Keyword Spotting	—Unverified	0
Noisy student-teacher training for robust keyword spotting	Jun 3, 2021	Data AugmentationKeyword Spotting	—Unverified	0
A Streaming End-to-End Framework For Spoken Language Understanding	May 20, 2021	Intent DetectionKeyword Spotting	—Unverified	0
Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting	May 10, 2021	Keyword Spottingtext-to-speech	CodeCode Available	1
Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spotting	May 7, 2021	BenchmarkingDeep Learning	CodeCode Available	0
Efficient Keyword Spotting by capturing long-range interactions with Temporal Lambda Networks	Apr 16, 2021	Keyword Spottingspeech-recognition	CodeCode Available	0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization	Apr 14, 2021	Keyword SpottingNeural Architecture Search	—Unverified	0
The DKU System Description for The Interspeech 2021 Auto-KWS Challenge	Apr 11, 2021	Dynamic Time WarpingKeyword Spotting	—Unverified	0
A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images	Apr 9, 2021	BenchmarkingKeyword Spotting	—Unverified	0
AST: Audio Spectrogram Transformer	Apr 5, 2021	Audio ClassificationAudio Tagging	CodeCode Available	2
Few-Shot Keyword Spotting in Any Language	Apr 3, 2021	Keyword SpottingTransfer Learning	CodeCode Available	1
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification	Apr 2, 2021	Generative Adversarial NetworkKeyword Spotting	—Unverified	0

Show:10 25 50

← PrevPage 10 of 17Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified