| Towards Making the Most of Dialogue Characteristics for Neural Chat Translation | Sep 2, 2021 | Machine TranslationResponse Generation | CodeCode Available | 0 | 5 |
| Unsupervised Speech Representation Pooling Using Vector Quantization | Apr 8, 2023 | Emotion Recognitionintent-classification | CodeCode Available | 0 | 5 |
| Contrastive Learning of General-Purpose Audio Representations | Oct 21, 2020 | CoLAContrastive Learning | CodeCode Available | 0 | 5 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 | 5 |
| Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network | Nov 22, 2024 | Data AugmentationSpeaker Identification | CodeCode Available | 0 | 5 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers | Oct 22, 2020 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 | 5 |
| Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification | Sep 9, 2021 | ClusteringFew-Shot Learning | CodeCode Available | 0 | 5 |
| Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding | Dec 23, 2024 | Speaker Identification | CodeCode Available | 0 | 5 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| SIG: Speaker Identification in Literature via Prompt-Based Generation | Dec 22, 2023 | Speaker Identification | CodeCode Available | 0 | 5 |
| Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing | Oct 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| On Learning Associations of Faces and Voices | May 15, 2018 | Speaker Identification | CodeCode Available | 0 | 5 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Modeling Duo: Towards a Universal Audio Pre-training Framework | Apr 9, 2024 | Audio Classification | CodeCode Available | 0 | 5 |
| A Generative Product-of-Filters Model of Audio | Dec 20, 2013 | modelSpeaker Identification | CodeCode Available | 0 | 5 |
| EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification | Apr 28, 2022 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 | 5 |
| Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue | Sep 7, 2024 | Question AnsweringSpeaker Identification | CodeCode Available | 0 | 5 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models | Jul 16, 2024 | AttributeSpeaker Identification | CodeCode Available | 0 | 5 |
| A domain-agnostic approach for opinion prediction on speech | Dec 1, 2016 | Emotion RecognitionFeature Engineering | CodeCode Available | 0 | 5 |
| Identify Speakers in Cocktail Parties with End-to-End Attention | May 22, 2020 | Speaker IdentificationSpeech Separation | CodeCode Available | 0 | 5 |
| Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input | Oct 26, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 0 | 5 |
| PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction | Oct 3, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 | 5 |
| End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings | May 5, 2021 | ClusteringSpeaker Identification | —Unverified | 0 | 0 |
| Emirati-Accented Speaker Identification in Stressful Talking Conditions | Sep 28, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Efficiency-oriented approaches for self-supervised speech representation learning | Dec 18, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| A user study to compare two conversational assistants designed for people with hearing impairments | Jun 1, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods | Feb 26, 2024 | Speaker Identification | —Unverified | 0 | 0 |
| Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks | Dec 1, 2016 | Dialect IdentificationInformation Retrieval | —Unverified | 0 | 0 |
| A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech | Jun 27, 2014 | Speaker Identification | —Unverified | 0 | 0 |
| Advances in Online Audio-Visual Meeting Transcription | Dec 10, 2019 | Sound Source Localizationspeaker-diarization | —Unverified | 0 | 0 |
| Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models | Jul 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data | Mar 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| DASB -- Discrete Audio and Speech Benchmark | Jun 20, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 | 0 |
| Curie: A method for protecting SVM Classifier from Poisoning Attack | Jun 5, 2016 | BIG-bench Machine LearningSpeaker Identification | —Unverified | 0 | 0 |
| A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR | Sep 9, 2024 | Automatic Speech Recognitionspeaker-diarization | —Unverified | 0 | 0 |
| Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings | Jan 6, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| HPP-Voice: A Large-Scale Evaluation of Speech Embeddings for Multi-Phenotypic Classification | May 22, 2025 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Cross-Lingual Speaker Identification from Weak Local Evidence | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Survey on Paralinguistics in Tamil Speech Processing | Apr 1, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Advanced Rich Transcription System for Estonian Speech | Jan 11, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors | Oct 25, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| How Redundant Is the Transformer Stack in Speech Representation Models? | Sep 10, 2024 | Knowledge DistillationSpeaker Identification | —Unverified | 0 | 0 |
| How Far Are We from Robust Voice Conversion: A Survey | Nov 24, 2020 | Speaker IdentificationSurvey | —Unverified | 0 | 0 |
| H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model | Oct 17, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Cosine similarity-based adversarial process | Jul 1, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Identification of Speakers in Novels | Aug 1, 2013 | Speaker Identification | —Unverified | 0 | 0 |
| Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems | Jun 18, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 | 0 |
| Histogram Transform-based Speaker Identification | Aug 2, 2018 | Speaker Identification | —Unverified | 0 | 0 |