SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 151175 of 407 papers

TitleStatusHype
Metric Learning for User-defined Keyword Spotting0
WeKws: A production first small-footprint end-to-end Keyword Spotting ToolkitCode2
Application of Knowledge Distillation to Multi-task Speech Representation Learning0
HEiMDaL: Highly Efficient Method for Detection and Localization of wake-words0
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the InputCode0
Discriminatory and orthogonal feature learning for noise robust keyword spotting0
Fully Unsupervised Training of Few-shot Keyword Spotting0
Split Federated Learning on Micro-controllers: A Keyword Spotting Showcase0
Improving Label-Deficient Keyword Spotting Through Self-Supervised PretrainingCode1
Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting0
SiDi KWS: A Large-Scale Multilingual Dataset for Keyword SpottingCode1
A Few Shot Multi-Representation Approach for N-gram Spotting in Historical Manuscripts0
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
ReckOn: A 28nm Sub-mm2 Task-Agnostic Spiking Recurrent Neural Network Processor Enabling On-Chip Learning over Second-Long Timescales0
An Anchor-Free Detector for Continuous Speech Keyword Spotting0
Keyword Spotting System and Evaluation of Pruning and Quantization Methods on Low-power Edge MicrocontrollersCode1
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit0
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets0
Wakeword Detection under Distribution Shifts0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Learning Audio-Text Agreement for Open-vocabulary Keyword SpottingCode1
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting0
Personalized Keyword Spotting through Multi-task Learning0
Challenges and Opportunities in Multi-device Speech Processing0
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer0
Show:102550
← PrevPage 7 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified