SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 251300 of 407 papers

TitleStatusHype
Keyword Transformer: A Self-Attention Model for Keyword SpottingCode1
Auto-KWS 2021 Challenge: Task, Datasets, and BaselinesCode1
Prototype-based Personalized Pruning0
SubSpectral Normalization for Neural Audio Data Processing0
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting0
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios0
The NPU System for the 2020 Personalized Voice Trigger ChallengeCode0
Meta-Learning for improving rare word recognition in end-to-end ASR0
Dynamic curriculum learning via data parameters for noise robust keyword spotting0
Query-by-Example Keyword Spotting system using Multi-head Attention and Softtriple Loss0
Modular approach to data preprocessing in ALOHA and application to a smart industry use case0
Speech Enhancement for Wake-Up-Word detection in Voice Assistants0
Learning Efficient Representations for Keyword Spotting with Triplet LossCode1
Neural Networks for Keyword Spotting on IoT Devices0
A 510-nW Wake-Up Keyword-Spotting Chip Using Serial-FFT-Based MFCC and Binarized Depthwise Separable CNN in 28-nm CMOS0
EfficientNet-Absolute Zero for Continuous Speech Keyword SpottingCode1
Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and QuantizationCode0
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric0
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech RecognitionCode1
RNNAccel: A Fusion Recurrent Neural Network Accelerator for Edge Intelligence0
Deep Convolutional Spiking Neural Networks for Keyword Spotting0
MicroNets: Neural Network Architectures for Deploying TinyML Applications on Commodity MicrocontrollersCode1
Small-Footprint Keyword Spotting with Multi-Scale Temporal ConvolutionCode1
Low-Power Low-Latency Keyword Spotting and Adaptive Control with a SpiNNaker 2 Prototype and Comparison with Loihi0
Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware0
AutoKWS: Keyword Spotting with Differentiable Architecture Search0
Seeing wake words: Audio-visual Keyword SpottingCode1
Neural Architecture Search For Keyword Spotting0
Howl: A Deployed, Open-Source Wake Word Detection SystemCode1
Learning Graph Edit Distance by Graph Neural Networks0
WSRNet: Joint Spotting and Recognition of Handwritten Words0
Neural ODE with Temporal Convolution and Time Delay Neural Networks for Small-Footprint Keyword SpottingCode0
Few-Shot Keyword Spotting With Prototypical NetworksCode1
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cuesCode1
TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechCode1
Always-On, Sub-300-nW, Event-Driven Spiking Neural Network based on Spike-Driven Clock-Generation and Clock- and Power-Gating for an Ultra-Low-Power Intelligent Device0
Exploring Filterbank Learning for Keyword Spotting0
Training Keyword Spotting Models on Non-IID Data with Federated Learning0
Metric Learning for Keyword Spotting0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention0
Domain Aware Training for Far-field Small-footprint Keyword Spotting0
Reformulating Information Retrieval from Speech and Text as a Detection Problem0
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks0
Phoneme Boundary Detection using Learnable Segmental FeaturesCode1
Training Keyword Spotters with Limited and Synthesized Speech DataCode2
Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling0
Performance-Oriented Neural Architecture Search0
A Multi-oriented Chinese Keyword Spotter Guided by Text Line Detection0
Predicting detection filters for small footprint open-vocabulary keyword spotting0
Show:102550
← PrevPage 6 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified