SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 351400 of 407 papers

TitleStatusHype
Continuous-Time Analog Filters for Audio Edge Intelligence: Review on Circuit Designs0
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology0
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech0
Convexity-based Pruning of Speech Representation Models0
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting0
CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting0
CUHK System for QUESST Task of MediaEval 20140
CUNY Systems for the Query-by-Example Search on Speech Task at MediaEval 20150
Custom DNN using Reward Modulated Inverted STDP Learning for Temporal Pattern Recognition0
Dark Experience for Incremental Keyword Spotting0
DASB -- Discrete Audio and Speech Benchmark0
Data Augmentation for Robust Keyword Spotting under Playback Interference0
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting0
Deep Convolutional Spiking Neural Networks for Keyword Spotting0
Deep Spoken Keyword Spotting: An Overview0
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention0
Developing Far-Field Speaker System Via Teacher-Student Learning0
Discriminatory and orthogonal feature learning for noise robust keyword spotting0
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting0
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study0
DONUT: CTC-based Query-by-Example Keyword Spotting0
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting0
Dynamic curriculum learning via data parameters for noise robust keyword spotting0
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting0
Effective Combination of DenseNet andBiLSTM for Keyword Spotting0
Effective Integration of KAN for Keyword Spotting0
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks0
Efficient dynamic filter for robust and low computational feature extraction0
Efficient keyword spotting using time delay neural networks0
ELiRF at MediaEval 2014: Query by Example Search on Speech Task (QUESST)0
ELiRF at MediaEval 2015: Query by Example Search on Speech Task (QUESST)0
EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models0
Employing Phonetic Speech Recognition for Language and Dialect Specific Search0
Encoder-Decoder Neural Architecture Optimization for Keyword Spotting0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization0
End-to-End Streaming Keyword Spotting0
End-to-End User-Defined Keyword Spotting using Shifted Delta Coefficients0
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models0
Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis0
Eventprop training for efficient neuromorphic applications0
Expanding the Range of Automatic Emotion Detection in Microblogging Text0
Exploring Filterbank Learning for Keyword Spotting0
Exploring Representation Learning for Small-Footprint Keyword Spotting0
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting0
Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical0
Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring0
Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders0
Feature learning for efficient ASR-free keyword spotting in low-resource languages0
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning0
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting0
Show:102550
← PrevPage 8 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified