SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 301350 of 407 papers

TitleStatusHype
On the Non-Associativity of Analog Computations0
Open-vocabulary Keyword-spotting with Adaptive Instance Normalization0
Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric0
Orthogonality Constrained Multi-Head Attention For Keyword Spotting0
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification0
PBSM: Backdoor attack against Keyword spotting based on pitch boosting and sound masking0
Performance-Oriented Neural Architecture Search0
Personalized Keyword Spotting through Multi-task Learning0
Personalizing Keyword Spotting with Speaker Information0
Phone Based Keyword Spotting for Transcribing Very Low Resource Languages0
Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment0
Plug-and-Play Multilingual Few-shot Spoken Words Recognition0
Polish Read Speech Corpus for Speech Tools and Services0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation0
Predicting detection filters for small footprint open-vocabulary keyword spotting0
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training0
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers0
Prototype-based Personalized Pruning0
Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data0
PSCNN: A 885.86 TOPS/W Programmable SRAM-based Computing-In-Memory Processor for Keyword Spotting0
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer0
Query-by-Example Keyword Spotting system using Multi-head Attention and Softtriple Loss0
Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning0
Query-by-example on-device keyword spotting0
ReckOn: A 28nm Sub-mm2 Task-Agnostic Spiking Recurrent Neural Network Processor Enabling On-Chip Learning over Second-Long Timescales0
Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting0
Reformulating Information Retrieval from Speech and Text as a Detection Problem0
Relational Proxy Loss for Audio-Text based Keyword Spotting0
RepCNN: Micro-sized, Mighty Models for Wakeword Detection0
Resource-Efficient Neural Architect0
RNNAccel: A Fusion Recurrent Neural Network Accelerator for Edge Intelligence0
Robust Spoken Term Detection Automatically Adjusted for a Given Threshold0
Scalable Weight Reparametrization for Efficient Transfer Learning0
Self-supervised speech representation learning for keyword-spotting with light-weight transformers0
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting0
SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting0
Small-footprint Keyword Spotting Using Deep Neural Network and Connectionist Temporal Classifier0
Small-footprint Keyword Spotting with Graph Convolutional Network0
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks0
Small-footprint slimmable networks for keyword spotting0
Speech and language technologies for the automatic monitoring and training of cognitive functions0
Speech Augmentation Based Unsupervised Learning for Keyword Spotting0
Speech Enhancement for Wake-Up-Word detection in Voice Assistants0
Speech-MLP: a simple MLP architecture for speech processing0
Speech Privacy Leakage from Shared Gradients in Distributed Learning0
Speech Recognition: Keyword Spotting Through Image Recognition0
Speech Unlearning0
SpeechYOLO: Detection and Localization of Speech Objects0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
Show:102550
← PrevPage 7 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified