SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 201250 of 407 papers

TitleStatusHype
Challenges and Opportunities in Multi-device Speech Processing0
Characterizing Linguistic Attributes for Automatic Classification of Intent Based Racist/Radicalized Posts on Tumblr Micro-Blogging Website0
Conditional Online Learning for Keyword Spotting0
Continuous-Time Analog Filters for Audio Edge Intelligence: Review on Circuit Designs0
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology0
Spot keywords from very noisy and mixed speech0
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents0
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models0
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks0
Structured Transforms for Small-Footprint Deep Learning0
以音韻屬性偵測擷取對話語音關鍵詞之研究 (Study on Keyword Spotting using Prosodic Attribute Detection for Conversational Speech) [In Chinese]0
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets0
SubSpectral Normalization for Neural Audio Data Processing0
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments0
Teaching keyword spotters to spot new keywords with limited examples0
Temporal Knowledge Distillation for On-device Audio Classification0
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications0
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting0
Text-Aware Adapter for Few-Shot Keyword Spotting0
The DKU System Description for The Interspeech 2021 Auto-KWS Challenge0
The Effects of Data Collection Methods in Twitter0
The IIT-B Query-by-Example System for MediaEval 20150
The NNI Query-by-Example System for MediaEval 20140
The NNI Query-by-Example System for MediaEval 20150
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
The Role of Temporal Hierarchy in Spiking Neural Networks0
The SPL-IT Query by Example Search on Speech system for MediaEval 20140
The SPL-IT-UC Query by Example Search on Speech system for MediaEval 20150
TinySV: Speaker Verification in TinyML with On-device Learning0
To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement0
Toward noise-robust whisper keyword spotting on headphones with in-earcup microphone and curriculum learning0
Towards Contactless Elevators with TinyML using CNN-based Person Detection and Keyword Spotting0
Towards efficient keyword spotting using spike-based time difference encoders0
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili0
Towards Robust Domain Generalization in 2D Neural Audio Processing0
Training Keyword Spotting Models on Non-IID Data with Federated Learning0
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Transfer Learning for a Letter-Ngrams to Word Decoder in the Context of Historical Handwriting Recognition with Scarce Resources0
T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit0
TUKE at MediaEval 2015 QUESST0
TUKE System for MediaEval 2014 QUESST0
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias0
Ultra-Low Power Keyword Spotting at the Edge0
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction0
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model0
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks0
Visually grounded cross-lingual keyword spotting in speech0
Vocal Tract Length Warped Features for Spoken Keyword Spotting0
VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion0
Wakeword Detection under Distribution Shifts0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified