Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 407 papers

Title	Date	Tasks	Status	Hype
A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting	Feb 4, 2022	Keyword SpottingQuantization	—Unverified	0
Keyword localisation in untranscribed speech using visually grounded speech models	Feb 2, 2022	Keyword SpottingTAG	CodeCode Available	0
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection	Feb 2, 2022	Audio ClassificationEvent Detection	CodeCode Available	2
Progressive Continual Learning for Spoken Keyword Spotting	Jan 29, 2022	Continual LearningKeyword Spotting	CodeCode Available	1
Tiny, always-on and fragile: Bias propagation through design choices in on-device machine learning workflows	Jan 19, 2022	Keyword Spotting	CodeCode Available	0
ImportantAug: a data augmentation agent for speech	Dec 14, 2021	Data AugmentationKeyword Spotting	CodeCode Available	0
BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge	Dec 3, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Phone Based Keyword Spotting for Transcribing Very Low Resource Languages	Dec 1, 2021	Dynamic Time WarpingKeyword Spotting	—Unverified	0
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection	Nov 20, 2021	Acoustic echo cancellationKeyword Spotting	—Unverified	0
Deep Spoken Keyword Spotting: An Overview	Nov 20, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator	Nov 10, 2021	Keyword Spotting	—Unverified	0
Ultra-Low Power Keyword Spotting at the Edge	Nov 9, 2021	Keyword SpottingModel Optimization	—Unverified	0
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting	Nov 2, 2021	Keyword Spotting	—Unverified	0
Temporal Knowledge Distillation for On-device Audio Classification	Oct 27, 2021	Audio ClassificationClassification	—Unverified	0
SSAST: Self-Supervised Audio Spectrogram Transformer	Oct 19, 2021	Audio ClassificationClassification	CodeCode Available	2
Attention-Free Keyword Spotting	Oct 14, 2021	Keyword Spotting	CodeCode Available	1
End-to-end Keyword Spotting using Xception-1d	Oct 9, 2021	Keyword Spotting	CodeCode Available	0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified	0
Multi-task Voice Activated Framework using Self-supervised Learning	Oct 3, 2021	Emotion ClassificationKeyword Spotting	—Unverified	0
Towards Robust Domain Generalization in 2D Neural Audio Processing	Sep 29, 2021	Acoustic Scene ClassificationDomain Generalization	—Unverified	0
Speech-MLP: a simple MLP architecture for speech processing	Sep 29, 2021	Keyword SpottingSpeech Enhancement	—Unverified	0
A Lightweight dynamic filter for keyword spotting	Sep 23, 2021	Keyword Spottingspeech-recognition	—Unverified	0
Audiomer: A Convolutional Transformer For Keyword Spotting	Sep 21, 2021	Keyword Spotting	CodeCode Available	0
Behavior of Keyword Spotting Networks Under Noisy Conditions	Sep 15, 2021	Keyword Spotting	—Unverified	0
Keyword spotting for audiovisual archival search in Uralic languages	Sep 1, 2021	Keyword Spotting	—Unverified	0
Feature learning for efficient ASR-free keyword spotting in low-resource languages	Aug 13, 2021	Dynamic Time WarpingHumanitarian	—Unverified	0
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting	Aug 12, 2021	Keyword SpottingMetric Learning	—Unverified	0
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization	Aug 3, 2021	Inference OptimizationKeyword Spotting	—Unverified	0
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers	Jul 28, 2021	Event DetectionKeyword Spotting	—Unverified	0
Multi-task Learning with Cross Attention for Keyword Spotting	Jul 15, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data	Jul 13, 2021	Keyword SpottingSmall-Footprint Keyword Spotting	—Unverified	0
An Integrated Framework for Two-pass Personalized Voice Trigger	Jun 30, 2021	Keyword SpottingMulti-Task Learning	—Unverified	0
PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation	Jun 25, 2021	Keyword SpottingKnowledge Distillation	—Unverified	0
Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis	Jun 22, 2021	Document Layout AnalysisKeyword Spotting	—Unverified	0
Zero-Shot Federated Learning with New Classes for Audio Classification	Jun 18, 2021	Audio ClassificationClassification	—Unverified	0
MLPerf Tiny Benchmark	Jun 14, 2021	Anomaly DetectionBIG-bench Machine Learning	CodeCode Available	1
Broadcasted Residual Learning for Efficient Keyword Spotting	Jun 8, 2021	Keyword Spotting	CodeCode Available	1
Encoder-Decoder Neural Architecture Optimization for Keyword Spotting	Jun 4, 2021	DecoderKeyword Spotting	—Unverified	0
Teaching keyword spotters to spot new keywords with limited examples	Jun 4, 2021	Keyword Spotting	—Unverified	0
Noisy student-teacher training for robust keyword spotting	Jun 3, 2021	Data AugmentationKeyword Spotting	—Unverified	0
A Streaming End-to-End Framework For Spoken Language Understanding	May 20, 2021	Intent DetectionKeyword Spotting	—Unverified	0
Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting	May 10, 2021	Keyword Spottingtext-to-speech	CodeCode Available	1
Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spotting	May 7, 2021	BenchmarkingDeep Learning	CodeCode Available	0
Efficient Keyword Spotting by capturing long-range interactions with Temporal Lambda Networks	Apr 16, 2021	Keyword Spottingspeech-recognition	CodeCode Available	0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization	Apr 14, 2021	Keyword SpottingNeural Architecture Search	—Unverified	0
The DKU System Description for The Interspeech 2021 Auto-KWS Challenge	Apr 11, 2021	Dynamic Time WarpingKeyword Spotting	—Unverified	0
A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images	Apr 9, 2021	BenchmarkingKeyword Spotting	—Unverified	0
AST: Audio Spectrogram Transformer	Apr 5, 2021	Audio ClassificationAudio Tagging	CodeCode Available	2
Few-Shot Keyword Spotting in Any Language	Apr 3, 2021	Keyword SpottingTransfer Learning	CodeCode Available	1
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification	Apr 2, 2021	Generative Adversarial NetworkKeyword Spotting	—Unverified	0

Show:10 25 50

← PrevPage 5 of 9Next →

All datasets QUESST Google Speech Commands hey Siri FKD Google Speech Commands V2 35 TensorFlow VoxForge Google Speech Commands (v2)Google Speech Commands V2 12

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NNI non-filtered(for the development set)	Cnxe	6.09	—	Unverified
2	NNI Choi(for the development set)	Cnxe	5.89	—	Unverified
3	NTU rnn (eval)	Cnxe	2.01	—	Unverified
4	NTU dtw (eval)	Cnxe	2.01	—	Unverified
5	NTU dtw (dev)	Cnxe	2.01	—	Unverified
6	NTU rnn (dev)	Cnxe	2.01	—	Unverified
7	ELiRF SDTW (eval)	Cnxe	1.19	—	Unverified
8	ELiRF SDTW-avg (eval)	Cnxe	1.07	—	Unverified
9	ELiRF SDTW (dev)	Cnxe	1.07	—	Unverified
10	CUNY [Subseq+MFCC] (eval)	Cnxe	1.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaveFormer	Google Speech Commands V2 12	98.8	—	Unverified
2	QNN	Google Speech Commands V2 35	98.6	—	Unverified
3	TripletLoss-res15	Google Speech Commands V1 12	98.56	—	Unverified
4	M2D	Google Speech Commands V2 35	98.5	—	Unverified
5	EAT-S	Google Speech Commands V2 35	98.15	—	Unverified
6	Audio Spectrogram Transformer	Google Speech Commands V2 35	98.11	—	Unverified
7	EdgeCRNN 2.0×	Google Speech Commands V2 12	98.05	—	Unverified
8	BC-ResNet-8	Google Speech Commands V1 12	98	—	Unverified
9	HTS-AT	Google Speech Commands V2 35	98	—	Unverified
10	Wav2KWS	Google Speech Commands V1 12	97.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Stacked 1D CNN	Error Rate	1.99	—	Unverified
2	End-to-end DNN-HMM	Error Rate	1.7	—	Unverified
3	HEiMDaL	Error Rate	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Res26	Accuracy	95.88	—	Unverified
2	EfficientNet-A0 + SA + TL	Accuracy	95.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	QuaternionNeuralNetwork	Accuracy (10-fold)	98.53	—	Unverified
2	SSAMBA	Accuracy (10-fold)	97.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TensorFlow's model version 2	TFMA	89.7	—	Unverified
2	TensorFlow's model version 1	TFMA	85.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	2D-ConvNet	Accuracy (%)	95.4	—	Unverified
2	1D-ConvNet	Accuracy (%)	93.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Quaternion Neural Networks	Accuracy(10-fold)	98.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MicroNet-KWS-L	Accuracy	95.3	—	Unverified