| Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario | Jan 7, 2021 | Multi-Task LearningSpeaker Identification | CodeCode Available | 0 | 5 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction | Oct 3, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 | 5 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 | 5 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Contrastive Learning of General-Purpose Audio Representations | Oct 21, 2020 | CoLAContrastive Learning | CodeCode Available | 0 | 5 |
| Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue | Sep 7, 2024 | Question AnsweringSpeaker Identification | CodeCode Available | 0 | 5 |
| Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers | Oct 22, 2020 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 | 5 |
| Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification | Sep 9, 2021 | ClusteringFew-Shot Learning | CodeCode Available | 0 | 5 |
| Identify Speakers in Cocktail Parties with End-to-End Attention | May 22, 2020 | Speaker IdentificationSpeech Separation | CodeCode Available | 0 | 5 |