SOTAVerified

Keyword Spotting

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Papers

Showing 351400 of 407 papers

TitleStatusHype
Visually grounded cross-lingual keyword spotting in speech0
Resource-Efficient Neural Architect0
A Bird's-eye View of Language Processing Projects at the Romanian Academy0
Developing Far-Field Speaker System Via Teacher-Student Learning0
Speech Commands: A Dataset for Limited-Vocabulary Speech RecognitionCode1
Attention-based End-to-End Models for Small-Footprint Keyword SpottingCode0
Speech Recognition: Keyword Spotting Through Image Recognition0
Zone-based Keyword Spotting in Bangla and Devanagari Documents0
Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio0
Hello Edge: Keyword Spotting on MicrocontrollersCode0
Deep Residual Learning for Small-Footprint Keyword SpottingCode1
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models0
Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword SpottingCode0
Small-footprint Keyword Spotting Using Deep Neural Network and Connectionist Temporal Classifier0
Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding0
Polish Read Speech Corpus for Speech Tools and Services0
READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival DocumentsCode0
Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting0
Automatic Extraction of News Values from Headline Text0
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting0
Characterizing Linguistic Attributes for Automatic Classification of Intent Based Racist/Radicalized Posts on Tumblr Micro-Blogging Website0
An End-to-End Architecture for Keyword Spotting and Voice Activity DetectionCode1
The Effects of Data Collection Methods in Twitter0
Neural Morphological Analysis: Encoding-Decoding Canonical Segments0
Trainable Frontend For Robust and Far-Field Keyword SpottingCode0
A Joint Model of Orthography and Morphological Segmentation0
Online Keyword Spotting with a Character-Level Recurrent Neural Network0
Structured Transforms for Small-Footprint Deep Learning0
TUKE at MediaEval 2015 QUESST0
CUNY Systems for the Query-by-Example Search on Speech Task at MediaEval 20150
GTM-UVigo Systems for the Query-by-Example Search on Speech Task at MediaEval 2015Code0
BUT QUESST 2015 System Description0
The NNI Query-by-Example System for MediaEval 20150
The IIT-B Query-by-Example System for MediaEval 20150
The SPL-IT-UC Query by Example Search on Speech system for MediaEval 20150
ELiRF at MediaEval 2015: Query by Example Search on Speech Task (QUESST)0
NTU System at MediaEval 2015: Zero Resource Query by Example Spoken Term Detection Using Deep and Recurrent Neural Networks0
Speech and language technologies for the automatic monitoring and training of cognitive functions0
Finding Opinion Manipulation Trolls in News Community Forums0
What’s Cookin’? Interpreting Cooking Videos using Text, Speech and VisionCode0
What's Cookin'? Interpreting Cooking Videos using Text, Speech and VisionCode0
BUT QUESST 2014 System Description0
ELiRF at MediaEval 2014: Query by Example Search on Speech Task (QUESST)0
IIIT-H System for MediaEval 2014 QUESST0
The SPL-IT Query by Example Search on Speech system for MediaEval 20140
TUKE System for MediaEval 2014 QUESST0
The NNI Query-by-Example System for MediaEval 20140
GTTS-EHU Systems for QUESST at MediaEval 20140
CUHK System for QUESST Task of MediaEval 20140
Morphological Segmentation for Keyword Spotting0
Show:102550
← PrevPage 8 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NNI non-filtered(for the development set)Cnxe6.09Unverified
2NNI Choi(for the development set)Cnxe5.89Unverified
3NTU rnn (eval)Cnxe2.01Unverified
4NTU dtw (eval)Cnxe2.01Unverified
5NTU dtw (dev)Cnxe2.01Unverified
6NTU rnn (dev)Cnxe2.01Unverified
7ELiRF SDTW (eval)Cnxe1.19Unverified
8ELiRF SDTW-avg (eval)Cnxe1.07Unverified
9ELiRF SDTW (dev)Cnxe1.07Unverified
10CUNY [Subseq+MFCC] (eval)Cnxe1.07Unverified
#ModelMetricClaimedVerifiedStatus
1WaveFormerGoogle Speech Commands V2 1298.8Unverified
2QNNGoogle Speech Commands V2 3598.6Unverified
3TripletLoss-res15Google Speech Commands V1 1298.56Unverified
4M2DGoogle Speech Commands V2 3598.5Unverified
5EAT-SGoogle Speech Commands V2 3598.15Unverified
6Audio Spectrogram TransformerGoogle Speech Commands V2 3598.11Unverified
7EdgeCRNN 2.0×Google Speech Commands V2 1298.05Unverified
8BC-ResNet-8Google Speech Commands V1 1298Unverified
9HTS-ATGoogle Speech Commands V2 3598Unverified
10Wav2KWSGoogle Speech Commands V1 1297.9Unverified
#ModelMetricClaimedVerifiedStatus
1Stacked 1D CNNError Rate1.99Unverified
2End-to-end DNN-HMMError Rate1.7Unverified
3HEiMDaLError Rate0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Res26Accuracy95.88Unverified
2EfficientNet-A0 + SA + TLAccuracy95.83Unverified
#ModelMetricClaimedVerifiedStatus
1QuaternionNeuralNetworkAccuracy (10-fold)98.53Unverified
2SSAMBAAccuracy (10-fold)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1TensorFlow's model version 2TFMA89.7Unverified
2TensorFlow's model version 1TFMA85.4Unverified
#ModelMetricClaimedVerifiedStatus
12D-ConvNetAccuracy (%)95.4Unverified
21D-ConvNetAccuracy (%)93.7Unverified
#ModelMetricClaimedVerifiedStatus
1Quaternion Neural NetworksAccuracy(10-fold)98.53Unverified
#ModelMetricClaimedVerifiedStatus
1MicroNet-KWS-LAccuracy95.3Unverified