SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 201250 of 407 papers

TitleStatusHype
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit0
Wakeword Detection under Distribution Shifts0
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting0
Personalized Keyword Spotting through Multi-task Learning0
Challenges and Opportunities in Multi-device Speech Processing0
Open-source FPGA-ML codesign for the MLPerf Tiny BenchmarkCode0
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer0
0/1 Deep Neural Networks via Block Coordinate Descent0
Avoid Overfitting User Specific Information in Federated Keyword SpottingCode0
Latency Control for Keyword Spotting0
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning0
Continuous-Time Analog Filters for Audio Edge Intelligence: Review on Circuit Designs0
Speech Augmentation Based Unsupervised Learning for Keyword Spotting0
Boosting Tail Neural Network for Realtime Custom Keyword Spotting0
A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization0
Efficient dynamic filter for robust and low computational feature extraction0
PSCNN: A 885.86 TOPS/W Programmable SRAM-based Computing-In-Memory Processor for Keyword Spotting0
Improving Feature Generalizability with Multitask Learning in Class Incremental Learning0
Depth Pruning with Auxiliary Networks for TinyMLCode0
AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy0
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training0
On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting0
Learning Decoupling Features Through Orthogonality Regularization0
Filler Word Detection and Classification: A Dataset and BenchmarkCode0
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention0
A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting0
Keyword localisation in untranscribed speech using visually grounded speech modelsCode0
Tiny, always-on and fragile: Bias propagation through design choices in on-device machine learning workflowsCode0
ImportantAug: a data augmentation agent for speechCode0
BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge0
Phone Based Keyword Spotting for Transcribing Very Low Resource Languages0
Deep Spoken Keyword Spotting: An Overview0
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection0
AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator0
Ultra-Low Power Keyword Spotting at the Edge0
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting0
Temporal Knowledge Distillation for On-device Audio Classification0
End-to-end Keyword Spotting using Xception-1dCode0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Multi-task Voice Activated Framework using Self-supervised Learning0
Speech-MLP: a simple MLP architecture for speech processing0
Towards Robust Domain Generalization in 2D Neural Audio Processing0
A Lightweight dynamic filter for keyword spotting0
Audiomer: A Convolutional Transformer For Keyword SpottingCode0
Behavior of Keyword Spotting Networks Under Noisy Conditions0
Keyword spotting for audiovisual archival search in Uralic languages0
Feature learning for efficient ASR-free keyword spotting in low-resource languages0
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified