| Open-vocabulary Keyword-spotting with Adaptive Instance Normalization | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| iPhonMatchNet: Zero-Shot User-Defined Keyword Spotting Using Implicit Acoustic Echo Cancellation | Sep 12, 2023 | Acoustic echo cancellationKeyword Spotting | —Unverified | 0 |
| Leveraging Large Language Models for Exploiting ASR Uncertainty | Sep 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction | Sep 7, 2023 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 |
| Improving Small Footprint Few-shot Keyword Spotting with Supervision on Auxiliary Data | Aug 31, 2023 | Keyword SpottingMulti-Task Learning | —Unverified | 0 |
| Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder | Aug 31, 2023 | Keyword Spotting | —Unverified | 0 |
| PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords | Aug 31, 2023 | Keyword Spotting | CodeCode Available | 1 |
| Flexible Keyword Spotting based on Homogeneous Audio-Text Embedding | Aug 12, 2023 | Keyword Spotting | —Unverified | 0 |
| Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring | Aug 7, 2023 | Keyword Spottingobject-detection | CodeCode Available | 0 |
| Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics | Jul 24, 2023 | Continual LearningKeyword Spotting | CodeCode Available | 0 |
| Custom DNN using Reward Modulated Inverted STDP Learning for Temporal Pattern Recognition | Jul 15, 2023 | Anomaly DetectionKeyword Spotting | —Unverified | 0 |
| On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation | Jul 6, 2023 | Keyword SpottingKnowledge Distillation | —Unverified | 0 |
| Matching Latent Encoding for Audio-Text based Keyword Spotting | Jun 8, 2023 | Keyword Spotting | —Unverified | 0 |
| Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems | Jun 3, 2023 | Few-Shot LearningKeyword Spotting | CodeCode Available | 1 |
| Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Reduced Precision Floating-Point Optimization for Deep Neural Network On-Device Learning on MicroControllers | May 30, 2023 | Continual Learningimage-classification | CodeCode Available | 1 |
| Spot keywords from very noisy and mixed speech | May 28, 2023 | Data AugmentationKeyword Spotting | —Unverified | 0 |
| DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting | May 21, 2023 | DenoisingKeyword Spotting | —Unverified | 0 |
| Conditional Online Learning for Keyword Spotting | May 19, 2023 | Continual LearningKeyword Spotting | —Unverified | 0 |
| TACos: Learning Temporally Structured Embeddings for Few-Shot Keyword Spotting with Dynamic Time Warping | May 18, 2023 | Dynamic Time WarpingKeyword Spotting | CodeCode Available | 0 |
| Semi-Supervised Federated Learning for Keyword Spotting | May 9, 2023 | Federated LearningKeyword Spotting | CodeCode Available | 0 |
| An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild'' Edge Applications | May 9, 2023 | Emotion Recognitionintent-classification | —Unverified | 0 |
| Plug-and-Play Multilingual Few-shot Spoken Words Recognition | May 3, 2023 | Few-Shot LearningKeyword Spotting | —Unverified | 0 |
| Small-footprint slimmable networks for keyword spotting | Apr 21, 2023 | Keyword SpottingSmall-Footprint Keyword Spotting | —Unverified | 0 |
| Multilingual Query-by-Example Keyword Spotting with Metric Learning and Phoneme-to-Embedding Mapping | Apr 19, 2023 | Keyword SpottingMetric Learning | —Unverified | 0 |
| How Tiny Can Analog Filterbank Features Be Made for Ultra-low-power On-device Keyword Spotting? | Apr 17, 2023 | Keyword Spotting | —Unverified | 0 |
| Unsupervised Speech Representation Pooling Using Vector Quantization | Apr 8, 2023 | Emotion Recognitionintent-classification | CodeCode Available | 0 |
| To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement | Apr 6, 2023 | Keyword Spotting | —Unverified | 0 |
| AraSpot: Arabic Spoken Command Spotting | Mar 29, 2023 | Data AugmentationKeyword Spotting | CodeCode Available | 0 |
| Exploring Representation Learning for Small-Footprint Keyword Spotting | Mar 20, 2023 | Contrastive LearningKeyword Spotting | —Unverified | 0 |
| Self-supervised speech representation learning for keyword-spotting with light-weight transformers | Mar 7, 2023 | Keyword SpottingRepresentation Learning | —Unverified | 0 |
| ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents | Mar 6, 2023 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 |
| Fixed-point quantization aware training for on-device keyword-spotting | Mar 4, 2023 | Keyword SpottingQuantization | —Unverified | 0 |
| Scalable Weight Reparametrization for Efficient Transfer Learning | Feb 26, 2023 | Keyword SpottingTransfer Learning | —Unverified | 0 |
| Locale Encoding For Scalable Multilingual Keyword Spotting Models | Feb 25, 2023 | Keyword Spotting | —Unverified | 0 |
| Speech Privacy Leakage from Shared Gradients in Distributed Learning | Feb 21, 2023 | Federated LearningKeyword Spotting | —Unverified | 0 |
| LipLearner: Customizable Silent Speech Interactions on Mobile Devices | Feb 12, 2023 | Contrastive LearningIncremental Learning | CodeCode Available | 1 |
| A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons | Jan 24, 2023 | Binary ClassificationKeyword Spotting | —Unverified | 0 |
| Analyzing the Representational Geometry of Acoustic Word Embeddings | Jan 8, 2023 | Keyword SpottingWord Embeddings | —Unverified | 0 |
| VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion | Dec 20, 2022 | Backdoor AttackKeyword Spotting | —Unverified | 0 |
| Learnable Front Ends Based on Temporal Modulation for Music Tagging | Nov 28, 2022 | Keyword SpottingMusic Tagging | —Unverified | 0 |
| ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification | Nov 23, 2022 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 |
| Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting | Nov 19, 2022 | Keyword SpottingSmall-Footprint Keyword Spotting | —Unverified | 0 |
| PBSM: Backdoor attack against Keyword spotting based on pitch boosting and sound masking | Nov 16, 2022 | Backdoor AttackKeyword Spotting | —Unverified | 0 |
| BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance | Nov 13, 2022 | BinarizationKeyword Spotting | CodeCode Available | 1 |
| Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting | Nov 11, 2022 | Keyword Spotting | —Unverified | 0 |
| LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting | Nov 9, 2022 | Keyword Spotting | —Unverified | 0 |
| Integrated Parameter-Efficient Tuning for General-Purpose Audio Models | Nov 4, 2022 | Genre classificationKeyword Spotting | CodeCode Available | 0 |
| Harnessing the Power of Explanations for Incremental Training: A LIME-Based Approach | Nov 2, 2022 | Feature ImportanceIncremental Learning | —Unverified | 0 |
| MAST: Multiscale Audio Spectrogram Transformers | Nov 2, 2022 | Audio ClassificationKeyword Spotting | CodeCode Available | 1 |