| Privacy-preserving Representation Learning for Speech Understanding | Oct 26, 2023 | ClassificationEmotion Recognition | —Unverified | 0 |
| Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis | Oct 16, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language Models | Sep 21, 2023 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Test-Time Training for Speech | Sep 19, 2023 | parameter-efficient fine-tuningSpeaker Identification | —Unverified | 0 |
| Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks | Sep 18, 2023 | Keyword SpottingSpeaker Identification | —Unverified | 0 |
| Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction | Sep 7, 2023 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 |
| An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification | Aug 22, 2023 | Self-Supervised LearningSpeaker Identification | CodeCode Available | 0 |
| Read, Look or Listen? What's Needed for Solving a Multimodal Dataset | Jul 6, 2023 | Question AnsweringSpeaker Identification | —Unverified | 0 |
| Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment | Jul 6, 2023 | Speaker Identificationspeech-recognition | CodeCode Available | 0 |
| VoxWatch: An open-set speaker recognition benchmark on VoxCeleb | Jun 30, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals | Jun 2, 2023 | Depression DetectionDisentanglement | CodeCode Available | 1 |
| Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition | Jun 1, 2023 | Meta-LearningSpeaker Identification | —Unverified | 0 |
| Few-Shot Speaker Identification Using Lightweight Prototypical Network with Feature Grouping and Interaction | May 31, 2023 | Speaker Identification | —Unverified | 0 |
| MPCHAT: Towards Multimodal Persona-Grounded Conversation | May 27, 2023 | Speaker Identification | CodeCode Available | 1 |
| Ordered and Binary Speaker Embedding | May 25, 2023 | ClusteringRetrieval | —Unverified | 0 |
| On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding | May 16, 2023 | Speaker Identification | CodeCode Available | 1 |
| Security and Privacy Problems in Voice Assistant Applications: A Survey | Apr 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Speech Representation Pooling Using Vector Quantization | Apr 8, 2023 | Emotion Recognitionintent-classification | CodeCode Available | 0 |
| HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones | Mar 13, 2023 | Event DetectionSound Event Detection | —Unverified | 0 |
| Ensemble knowledge distillation of self-supervised speech models | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ExARN: self-attending RNN for target speaker extraction | Dec 2, 2022 | Speaker IdentificationTarget Speaker Extraction | —Unverified | 0 |
| ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification | Nov 23, 2022 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 |
| MelHuBERT: A simplified HuBERT on Mel spectrograms | Nov 17, 2022 | Automatic Speech RecognitionSelf-Supervised Learning | CodeCode Available | 1 |