| Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features | May 25, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| Speaker attribution with voice profiles by graph-based semi-supervised learning | Feb 6, 2021 | Speaker Identification | —Unverified | 0 |
| Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones | Jul 1, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues | Apr 21, 2025 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| Speaker Identification Experiments Under Gender De-Identification | Mar 9, 2022 | De-identificationSpeaker Identification | —Unverified | 0 |
| Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG | Oct 23, 2022 | Speaker Identification | —Unverified | 0 |
| Speaker identification from the sound of the human breath | Dec 1, 2017 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Speaker Identification From Youtube Obtained Data | Nov 11, 2014 | parameter estimationQuantization | —Unverified | 0 |
| Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs | Jun 29, 2017 | Speaker Identification | —Unverified | 0 |
| Speaker Identification using EEG | Mar 7, 2020 | EEGElectroencephalogram (EEG) | —Unverified | 0 |
| Speaker Identification using Speech Recognition | May 29, 2022 | Speaker Identificationspeech-recognition | —Unverified | 0 |
| Speaker Recognition in Bengali Language from Nonlinear Features | Apr 15, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition | Jun 1, 2023 | Meta-LearningSpeaker Identification | —Unverified | 0 |
| Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention | Feb 14, 2020 | Multi-Task LearningSpeaker Identification | —Unverified | 0 |
| Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization | Feb 18, 2025 | Automatic Speech RecognitionSpeaker Identification | —Unverified | 0 |
| Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis | Feb 11, 2024 | RhythmSpeaker Identification | —Unverified | 0 |
| Speech Unlearning | Jun 1, 2025 | Adversarial RobustnessKeyword Spotting | —Unverified | 0 |
| Speech watermarking: an approach for the forensic analysis of digital telephonic recordings | Feb 23, 2022 | ArticlesSpeaker Identification | —Unverified | 0 |
| Masked Modeling Duo: Towards a Universal Audio Pre-training Framework | Apr 9, 2024 | Audio Classification | CodeCode Available | 0 |
| Unsupervised Speech Representation Pooling Using Vector Quantization | Apr 8, 2023 | Emotion Recognitionintent-classification | CodeCode Available | 0 |
| Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input | Oct 26, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 0 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue | Sep 7, 2024 | Question AnsweringSpeaker Identification | CodeCode Available | 0 |
| SIG: Speaker Identification in Literature via Prompt-Based Generation | Dec 22, 2023 | Speaker Identification | CodeCode Available | 0 |
| Deep Learning for Speaker Identification: Architectural Insights from AB-1 Corpus Analysis and Performance Evaluation | Aug 13, 2024 | Speaker Identification | CodeCode Available | 0 |
| Identify Speakers in Cocktail Parties with End-to-End Attention | May 22, 2020 | Speaker IdentificationSpeech Separation | CodeCode Available | 0 |
| Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models | Jul 16, 2024 | AttributeSpeaker Identification | CodeCode Available | 0 |
| Delving into VoxCeleb: environment invariant speaker recognition | Oct 24, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation | May 18, 2020 | Self-Supervised LearningSpeaker Identification | CodeCode Available | 0 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A domain-agnostic approach for opinion prediction on speech | Dec 1, 2016 | Emotion RecognitionFeature Engineering | CodeCode Available | 0 |
| Contrastive Learning of General-Purpose Audio Representations | Oct 21, 2020 | CoLAContrastive Learning | CodeCode Available | 0 |
| Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment | Jul 6, 2023 | Speaker Identificationspeech-recognition | CodeCode Available | 0 |
| On Learning Associations of Faces and Voices | May 15, 2018 | Speaker Identification | CodeCode Available | 0 |
| Towards Making the Most of Dialogue Characteristics for Neural Chat Translation | Sep 2, 2021 | Machine TranslationResponse Generation | CodeCode Available | 0 |
| Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing | Oct 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario | Jan 7, 2021 | Multi-Task LearningSpeaker Identification | CodeCode Available | 0 |
| Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network | Nov 22, 2024 | Data AugmentationSpeaker Identification | CodeCode Available | 0 |
| Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding | Dec 23, 2024 | Speaker Identification | CodeCode Available | 0 |
| Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers | Oct 22, 2020 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 |
| PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction | Oct 3, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification | Sep 9, 2021 | ClusteringFew-Shot Learning | CodeCode Available | 0 |
| EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification | Apr 28, 2022 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification | Aug 22, 2023 | Self-Supervised LearningSpeaker Identification | CodeCode Available | 0 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 |
| A Generative Product-of-Filters Model of Audio | Dec 20, 2013 | modelSpeaker Identification | CodeCode Available | 0 |