| Privacy-preserving Representation Learning for Speech Understanding | Oct 26, 2023 | ClassificationEmotion Recognition | —Unverified | 0 |
| Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis | Oct 16, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language Models | Sep 21, 2023 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Test-Time Training for Speech | Sep 19, 2023 | parameter-efficient fine-tuningSpeaker Identification | —Unverified | 0 |
| Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks | Sep 18, 2023 | Keyword SpottingSpeaker Identification | —Unverified | 0 |
| Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction | Sep 7, 2023 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 |
| An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification | Aug 22, 2023 | Self-Supervised LearningSpeaker Identification | CodeCode Available | 0 |
| Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment | Jul 6, 2023 | Speaker Identificationspeech-recognition | CodeCode Available | 0 |
| Read, Look or Listen? What's Needed for Solving a Multimodal Dataset | Jul 6, 2023 | Question AnsweringSpeaker Identification | —Unverified | 0 |