SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 226250 of 407 papers

TitleStatusHype
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks0
Small-footprint slimmable networks for keyword spotting0
Speech and language technologies for the automatic monitoring and training of cognitive functions0
Speech Augmentation Based Unsupervised Learning for Keyword Spotting0
Speech Enhancement for Wake-Up-Word detection in Voice Assistants0
Speech-MLP: a simple MLP architecture for speech processing0
Speech Privacy Leakage from Shared Gradients in Distributed Learning0
Speech Recognition: Keyword Spotting Through Image Recognition0
Speech Unlearning0
SpeechYOLO: Detection and Localization of Speech Objects0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
Split Federated Learning on Micro-controllers: A Keyword Spotting Showcase0
Spoken Language Identification using ConvNets0
Spot keywords from very noisy and mixed speech0
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents0
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models0
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks0
Structured Transforms for Small-Footprint Deep Learning0
以音韻屬性偵測擷取對話語音關鍵詞之研究 (Study on Keyword Spotting using Prosodic Attribute Detection for Conversational Speech) [In Chinese]0
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets0
SubSpectral Normalization for Neural Audio Data Processing0
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments0
Teaching keyword spotters to spot new keywords with limited examples0
Temporal Knowledge Distillation for On-device Audio Classification0
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications0
Show:102550
← PrevPage 10 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified