| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 | 5 |
| MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous Decoding | May 26, 2025 | Keyword Spotting | CodeCode Available | 2 | 5 |
| AST: Audio Spectrogram Transformer | Apr 5, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 2 | 5 |
| SSAST: Self-Supervised Audio Spectrogram Transformer | Oct 19, 2021 | Audio ClassificationClassification | CodeCode Available | 2 | 5 |
| SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | May 20, 2024 | Audio ClassificationGPU | CodeCode Available | 2 | 5 |
| TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer | Mar 20, 2024 | Keyword Spotting | CodeCode Available | 2 | 5 |
| Training Keyword Spotters with Limited and Synthesized Speech Data | Jan 31, 2020 | Keyword Spotting | CodeCode Available | 2 | 5 |
| WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit | Oct 30, 2022 | Keyword Spotting | CodeCode Available | 2 | 5 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 | 5 |
| GLAP: General contrastive audio-text pretraining across domains and languages | Jun 12, 2025 | AudioCapsKeyword Spotting | CodeCode Available | 2 | 5 |
| Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency | Dec 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution | Oct 20, 2020 | Efficient Neural NetworkKeyword Spotting | CodeCode Available | 1 | 5 |
| Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition | Apr 9, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Sparse Binarization for Fast Keyword Spotting | Jun 9, 2024 | BinarizationKeyword Spotting | CodeCode Available | 1 | 5 |
| Reduced Precision Floating-Point Optimization for Deep Neural Network On-Device Learning on MicroControllers | May 30, 2023 | Continual Learningimage-classification | CodeCode Available | 1 | 5 |
| Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining | Oct 4, 2022 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting | Mar 30, 2022 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Seeing wake words: Audio-visual Keyword Spotting | Sep 2, 2020 | Keyword SpottingLip Reading | CodeCode Available | 1 | 5 |
| AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages | Jan 14, 2025 | Abusive LanguageKeyword Spotting | CodeCode Available | 1 | 5 |
| EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting | Dec 31, 2020 | Keyword SpottingKeyword Spotting CSS | CodeCode Available | 1 | 5 |
| MicroNets: Neural Network Architectures for Deploying TinyML Applications on Commodity Microcontrollers | Oct 21, 2020 | Anomaly DetectionKeyword Spotting | CodeCode Available | 1 | 5 |
| The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language | Nov 14, 2023 | Keyword Spotting | CodeCode Available | 1 | 5 |
| Few-Shot Keyword Spotting With Prototypical Networks | Jul 25, 2020 | Keyword SpottingMetric Learning | CodeCode Available | 1 | 5 |
| Progressive Continual Learning for Spoken Keyword Spotting | Jan 29, 2022 | Continual LearningKeyword Spotting | CodeCode Available | 1 | 5 |
| Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems | Jun 3, 2023 | Few-Shot LearningKeyword Spotting | CodeCode Available | 1 | 5 |
| End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network | Apr 25, 2022 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio Sensors | Aug 22, 2024 | Keyword SpottingSelf-Learning | CodeCode Available | 1 | 5 |
| SiDi KWS: A Large-Scale Multilingual Dataset for Keyword Spotting | Sep 22, 2022 | Deep LearningKeyword Extraction | CodeCode Available | 1 | 5 |
| MLPerf Tiny Benchmark | Jun 14, 2021 | Anomaly DetectionBIG-bench Machine Learning | CodeCode Available | 1 | 5 |
| Howl: A Deployed, Open-Source Wake Word Detection System | Aug 21, 2020 | Keyword Spotting | CodeCode Available | 1 | 5 |
| Phoneme Boundary Detection using Learnable Segmental Features | Feb 11, 2020 | Boundary DetectionKeyword Spotting | CodeCode Available | 1 | 5 |
| Learning Efficient Representations for Keyword Spotting with Triplet Loss | Jan 12, 2021 | ClassificationKeyword Spotting | CodeCode Available | 1 | 5 |
| BiFSMN: Binary Neural Network for Keyword Spotting | Feb 14, 2022 | BinarizationKeyword Spotting | CodeCode Available | 1 | 5 |
| BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance | Nov 13, 2022 | BinarizationKeyword Spotting | CodeCode Available | 1 | 5 |
| Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting | Jun 30, 2022 | Keyword Spotting | CodeCode Available | 1 | 5 |
| LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT | Mar 29, 2022 | AllAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 | 5 |
| Broadcasted Residual Learning for Efficient Keyword Spotting | Jun 8, 2021 | Keyword Spotting | CodeCode Available | 1 | 5 |
| Deep Residual Learning for Small-Footprint Keyword Spotting | Oct 28, 2017 | Keyword SpottingSmall-Footprint Keyword Spotting | CodeCode Available | 1 | 5 |
| Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition | Oct 26, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| MAST: Multiscale Audio Spectrogram Transformers | Nov 2, 2022 | Audio ClassificationKeyword Spotting | CodeCode Available | 1 | 5 |
| An End-to-End Architecture for Keyword Spotting and Voice Activity Detection | Nov 28, 2016 | Action DetectionActivity Detection | CodeCode Available | 1 | 5 |
| Auto-KWS 2021 Challenge: Task, Datasets, and Baselines | Mar 31, 2021 | AutoMLBIG-bench Machine Learning | CodeCode Available | 1 | 5 |
| ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification | Nov 23, 2022 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| ED-sKWS: Early-Decision Spiking Neural Networks for Rapid,and Energy-Efficient Keyword Spotting | Jun 14, 2024 | Edge-computingKeyword Spotting | CodeCode Available | 1 | 5 |
| MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting | Jun 11, 2024 | Data AugmentationKeyword Spotting | CodeCode Available | 1 | 5 |
| Keyword Spotting System and Evaluation of Pruning and Quantization Methods on Low-power Edge Microcontrollers | Aug 4, 2022 | Edge-computingKeyword Spotting | CodeCode Available | 1 | 5 |
| Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data | May 30, 2025 | Continual LearningFew-Shot Learning | CodeCode Available | 1 | 5 |
| Few-Shot Keyword Spotting in Any Language | Apr 3, 2021 | Keyword SpottingTransfer Learning | CodeCode Available | 1 | 5 |
| Attention-Free Keyword Spotting | Oct 14, 2021 | Keyword Spotting | CodeCode Available | 1 | 5 |