SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 101150 of 407 papers

TitleStatusHype
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting0
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
Convexity-based Pruning of Speech Representation Models0
Neuromorphic Keyword Spotting with Pulse Density Modulation MEMS MicrophonesCode0
Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting0
Frequency & Channel Attention Network for Small Footprint Noisy Spoken Keyword Spotting0
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model0
The Role of Temporal Hierarchy in Spiking Neural Networks0
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments0
KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer0
Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical0
Learning Delays Through Gradients and Structure: Emergence of Spatiotemporal Patterns in Spiking Neural NetworksCode0
Multitaper mel-spectrograms for keyword spotting0
Advancing Airport Tower Command Recognition: Integrating Squeeze-and-Excitation and Broadcasted Residual Learning0
Micro-power spoken keyword spotting on Xylo Audio 20
DASB -- Discrete Audio and Speech Benchmark0
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting0
CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting0
Relational Proxy Loss for Audio-Text based Keyword Spotting0
Keyword-Guided Adaptation of Automatic Speech Recognition0
RepCNN: Micro-sized, Mighty Models for Wakeword Detection0
TinySV: Speaker Verification in TinyML with On-device Learning0
End-to-End User-Defined Keyword Spotting using Shifted Delta Coefficients0
Towards Contactless Elevators with TinyML using CNN-based Person Detection and Keyword Spotting0
DeltaKWS: A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAM0
Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting0
What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy ConditionsCode0
Noise-Robust Keyword Spotting through Self-supervised PretrainingCode0
Work in Progress: Linear Transformers for TinyML0
More than words: Advancements and challenges in speech recognition for singing0
On-Device Domain Learning for Keyword Spotting on Low-Power Extreme Edge Embedded Systems0
Boosting keyword spotting through on-device learnable user speech characteristicsCode0
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement0
Multilingual acoustic word embeddings for zero-resource languages0
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech0
Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting0
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias0
Keyword spotting -- Detecting commands in speech using deep learning0
Personalizing Keyword Spotting with Speaker Information0
ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correctionCode0
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study0
On the Non-Associativity of Analog Computations0
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks0
Cluster-based pruning techniques for audio dataCode0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting0
Open-vocabulary Keyword-spotting with Adaptive Instance Normalization0
iPhonMatchNet: Zero-Shot User-Defined Keyword Spotting Using Implicit Acoustic Echo Cancellation0
Leveraging Large Language Models for Exploiting ASR Uncertainty0
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified