SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 151200 of 407 papers

TitleStatusHype
Fully Unsupervised Training of Few-shot Keyword Spotting0
Frequency & Channel Attention Network for Small Footprint Noisy Spoken Keyword Spotting0
Challenges and Opportunities in Multi-device Speech Processing0
An In-Vehicle KWS System with Multi-Source Fusion for Vehicle Applications0
Flexible Keyword Spotting based on Homogeneous Audio-Text Embedding0
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting0
GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples0
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting0
GTTS-EHU Systems for QUESST at MediaEval 20140
Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware0
Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs0
HEiMDaL: Highly Efficient Method for Detection and Localization of wake-words0
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology0
Hierarchical Neural Network Architecture In Keyword Spotting0
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech0
A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting0
Fixed-point quantization aware training for on-device keyword-spotting0
How Tiny Can Analog Filterbank Features Be Made for Ultra-low-power On-device Keyword Spotting?0
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting0
IIIT-H System for MediaEval 2014 QUESST0
台語關鍵詞辨識之實作與比較 (Implementation and Comparison of Keyword Spotting for Taiwanese) [In Chinese]0
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection0
CUHK System for QUESST Task of MediaEval 20140
Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training0
Finding Opinion Manipulation Trolls in News Community Forums0
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting0
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation0
Improving Small Footprint Few-shot Keyword Spotting with Supervision on Auxiliary Data0
BUT QUESST 2015 System Description0
Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder0
An Integrated Framework for Two-pass Personalized Voice Trigger0
DASB -- Discrete Audio and Speech Benchmark0
BUT QUESST 2014 System Description0
Data Augmentation for Robust Keyword Spotting under Playback Interference0
Keyword-Guided Adaptation of Automatic Speech Recognition0
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting0
A Channel-Pruned and Weight-Binarized Convolutional Neural Network for Keyword Spotting0
Keyword spotting -- Detecting commands in speech using deep learning0
Keyword spotting for audiovisual archival search in Uralic languages0
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers0
An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild'' Edge Applications0
A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization0
Leveraging Large Language Models for Exploiting ASR Uncertainty0
A Lightweight dynamic filter for keyword spotting0
Latency Control for Keyword Spotting0
Learnable Front Ends Based on Temporal Modulation for Music Tagging0
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning0
Learning Decoupling Features Through Orthogonality Regularization0
Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders0
Feature learning for efficient ASR-free keyword spotting in low-resource languages0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified