SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 101150 of 407 papers

TitleStatusHype
Federated Learning for Keyword SpottingCode0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoringCode0
JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental AnalysisCode0
Integrated Parameter-Efficient Tuning for General-Purpose Audio ModelsCode0
Keyword localisation in untranscribed speech using visually grounded speech modelsCode0
Boosting keyword spotting through on-device learnable user speech characteristicsCode0
Indian EmoSpeech Command Dataset: A dataset for emotion based speech recognition in the wildCode0
Evaluating Sequence-to-Sequence Models for Handwritten Text RecognitionCode0
Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword SpottingCode0
GTM-UVigo Systems for the Query-by-Example Search on Speech Task at MediaEval 2015Code0
Hello Edge: Keyword Spotting on MicrocontrollersCode0
Honkling: In-Browser Personalization for Ubiquitous Keyword SpottingCode0
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the InputCode0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization0
Behavior of Keyword Spotting Networks Under Noisy Conditions0
An Alternative Deep Feature Approach to Line Level Keyword Spotting0
Employing Phonetic Speech Recognition for Language and Dialect Specific Search0
EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models0
ELiRF at MediaEval 2015: Query by Example Search on Speech Task (QUESST)0
ELiRF at MediaEval 2014: Query by Example Search on Speech Task (QUESST)0
Encoder-Decoder Neural Architecture Optimization for Keyword Spotting0
BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge0
AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator0
Efficient keyword spotting using time delay neural networks0
Automating speech reception threshold measurements using automatic speech recognition0
Automatic Speech Recognition for Humanitarian Applications in Somali0
A Multi-oriented Chinese Keyword Spotter Guided by Text Line Detection0
Efficient dynamic filter for robust and low computational feature extraction0
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks0
Automatic Extraction of News Values from Headline Text0
Effective Integration of KAN for Keyword Spotting0
Effective Combination of DenseNet andBiLSTM for Keyword Spotting0
AutoKWS: Keyword Spotting with Differentiable Architecture Search0
A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting0
Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues0
AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy0
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting0
Dynamic curriculum learning via data parameters for noise robust keyword spotting0
Always-On, Sub-300-nW, Event-Driven Spiking Neural Network based on Spike-Driven Clock-Generation and Clock- and Power-Gating for an Ultra-Low-Power Intelligent Device0
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting0
DONUT: CTC-based Query-by-Example Keyword Spotting0
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study0
AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data0
A Literature Review of Keyword Spotting Technologies for Urdu0
Adaptive Noise Resilient Keyword Spotting Using One-Shot Learning0
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting0
Discriminatory and orthogonal feature learning for noise robust keyword spotting0
Developing Far-Field Speaker System Via Teacher-Student Learning0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified