| InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language Models | Sep 21, 2023 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 | 5 |
| Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings | Aug 11, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| AM-MobileNet1D: A Portable Model for Speaker Recognition | Mar 31, 2020 | Deep Learningmodel | CodeCode Available | 1 | 5 |
| End-to-End Chinese Speaker Identification | Jul 1, 2022 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 | 5 |
| ATST: Audio Representation Learning with Teacher-Student Transformer | Apr 26, 2022 | Audio ClassificationInstrument Recognition | CodeCode Available | 1 | 5 |
| A Modulation-Domain Loss for Neural-Network-based Real-time Speech Enhancement | Feb 15, 2021 | Speaker IdentificationSpeech Denoising | CodeCode Available | 1 | 5 |
| AutoSpeech: Neural Architecture Search for Speaker Recognition | May 7, 2020 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| FastAudio: A Learnable Audio Front-End for Spoof Speech Detection | Sep 6, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 1 | 5 |
| FoolHD: Fooling speaker identification by Highly imperceptible adversarial Disturbances | Nov 17, 2020 | Adversarial AttackSpeaker Identification | CodeCode Available | 1 | 5 |
| Blind Speech Separation and Dereverberation using Neural Beamforming | Mar 24, 2021 | Speaker IdentificationSpeaker Separation | CodeCode Available | 1 | 5 |