| Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals | Jun 2, 2023 | Depression DetectionDisentanglement | CodeCode Available | 1 |
| MPCHAT: Towards Multimodal Persona-Grounded Conversation | May 27, 2023 | Speaker Identification | CodeCode Available | 1 |
| GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding | May 16, 2023 | Speaker Identification | CodeCode Available | 1 |
| ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification | Nov 23, 2022 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 |
| MelHuBERT: A simplified HuBERT on Mel spectrograms | Nov 17, 2022 | Automatic Speech RecognitionSelf-Supervised Learning | CodeCode Available | 1 |
| IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages | Aug 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Masked Autoencoders that Listen | Jul 13, 2022 | Audio ClassificationDecoder | CodeCode Available | 1 |
| End-to-End Chinese Speaker Identification | Jul 1, 2022 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 |
| Extended U-Net for Speaker Verification in Noisy Environments | Jun 27, 2022 | DenoisingSpeaker Identification | CodeCode Available | 1 |
| ATST: Audio Representation Learning with Teacher-Student Transformer | Apr 26, 2022 | Audio ClassificationInstrument Recognition | CodeCode Available | 1 |