SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 251300 of 407 papers

TitleStatusHype
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting0
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing0
Weight-importance sparse training in keyword spotting0
Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding0
Work in Progress: Linear Transformers for TinyML0
WSRNet: Joint Spotting and Recognition of Handwritten Words0
Harnessing the Power of Explanations for Incremental Training: A LIME-Based Approach0
Zero-Shot Federated Learning with New Classes for Audio Classification0
Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks0
0/1 Deep Neural Networks via Block Coordinate Descent0
A Lightweight dynamic filter for keyword spotting0
LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting0
Locale Encoding For Scalable Multilingual Keyword Spotting Models0
Low-bit quantization and quantization-aware training for small-footprint keyword spotting0
Low-Power Low-Latency Keyword Spotting and Adaptive Control with a SpiNNaker 2 Prototype and Comparison with Loihi0
Low-resource keyword spotting using contrastively trained transformer acoustic word embeddings0
Matching Latent Encoding for Audio-Text based Keyword Spotting0
Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting0
Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting0
Meta-Learning for improving rare word recognition in end-to-end ASR0
Metric Learning for Keyword Spotting0
Metric Learning for User-defined Keyword Spotting0
Micro-power spoken keyword spotting on Xylo Audio 20
Modular approach to data preprocessing in ALOHA and application to a smart industry use case0
More than words: Advancements and challenges in speech recognition for singing0
Morphological Segmentation for Keyword Spotting0
Multi-layer Attention Mechanism for Speech Keyword Recognition0
Multilingual acoustic word embeddings for zero-resource languages0
Multilingual Query-by-Example Keyword Spotting with Metric Learning and Phoneme-to-Embedding Mapping0
Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Fold Paralysis0
Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio0
Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting0
Multitaper mel-spectrograms for keyword spotting0
Multi-task Learning with Cross Attention for Keyword Spotting0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention0
Multi-task Voice Activated Framework using Self-supervised Learning0
Domain Aware Training for Far-field Small-footprint Keyword Spotting0
Neural Architecture Search For Keyword Spotting0
Neural Morphological Analysis: Encoding-Decoding Canonical Segments0
Neural Networks for Keyword Spotting on IoT Devices0
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection0
Noise-Robust Hearing Aid Voice Control0
Noisy student-teacher training for robust keyword spotting0
NTC-KWS: Noise-aware CTC for Robust Keyword Spotting0
NTU System at MediaEval 2015: Zero Resource Query by Example Spoken Term Detection Using Deep and Recurrent Neural Networks0
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation0
On-Device Domain Learning for Keyword Spotting on Low-Power Extreme Edge Embedded Systems0
On evaluating CNN representations for low resource medical image classification0
Online Keyword Spotting with a Character-Level Recurrent Neural Network0
On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting0
Show:102550
← PrevPage 6 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified