| Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models | Jan 23, 2024 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Pretraining Multi-Speaker Identification for Neural Speaker Diarization | May 30, 2025 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion? | Nov 12, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Privacy-preserving Representation Learning for Speech Understanding | Oct 26, 2023 | ClassificationEmotion Recognition | —Unverified | 0 | 0 |
| Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples | Nov 10, 2022 | De-identificationSpeaker Identification | —Unverified | 0 | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 | 0 |
| Progressive Residual Extraction based Pre-training for Speech Representation Learning | Aug 31, 2024 | Emotion RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus | Jun 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 | 0 |
| Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio | May 15, 2025 | Speaker Identificationspeech-recognition | —Unverified | 0 | 0 |
| Read, Look or Listen? What's Needed for Solving a Multimodal Dataset | Jul 6, 2023 | Question AnsweringSpeaker Identification | —Unverified | 0 | 0 |
| Reducing audio membership inference attack accuracy to chance: 4 defenses | Oct 31, 2019 | Inference AttackMembership Inference Attack | —Unverified | 0 | 0 |
| Remarks on Optimal Scores for Speaker Recognition | Oct 10, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling | Apr 1, 2024 | Speaker IdentificationSpeech Synthesis | —Unverified | 0 | 0 |
| Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems | Jul 9, 2021 | Representation LearningSpeaker Identification | —Unverified | 0 | 0 |
| REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion | May 27, 2025 | DisentanglementSpeaker Identification | —Unverified | 0 | 0 |
| Rhythm Features for Speaker Identification | Jun 7, 2025 | Deep LearningRhythm | —Unverified | 0 | 0 |
| Robust Speaker Recognition Using Speech Enhancement And Attention Model | Jan 14, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| SCDiar: a streaming diarization system based on speaker change detection and speech recognition | Jan 28, 2025 | Change Detectionspeaker-diarization | —Unverified | 0 | 0 |
| Security and Privacy Problems in Voice Assistant Applications: A Survey | Apr 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Seeing Voices and Hearing Faces: Cross-modal biometric matching | Apr 1, 2018 | Face RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Significance of Chirp MFCC as a Feature in Speech and Audio Applications | Feb 19, 2024 | Music ClassificationSpeaker Identification | —Unverified | 0 | 0 |
| Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information | May 8, 2022 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 | 0 |
| 基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese] | Oct 1, 2014 | Dimensionality ReductionSpeaker Identification | —Unverified | 0 | 0 |
| Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features | May 25, 2020 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Speaker attribution with voice profiles by graph-based semi-supervised learning | Feb 6, 2021 | Speaker Identification | —Unverified | 0 | 0 |
| Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones | Jul 1, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues | Apr 21, 2025 | BenchmarkingSpeaker Identification | —Unverified | 0 | 0 |
| Speaker Identification Experiments Under Gender De-Identification | Mar 9, 2022 | De-identificationSpeaker Identification | —Unverified | 0 | 0 |
| Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG | Oct 23, 2022 | Speaker Identification | —Unverified | 0 | 0 |
| Speaker identification from the sound of the human breath | Dec 1, 2017 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Speaker Identification From Youtube Obtained Data | Nov 11, 2014 | parameter estimationQuantization | —Unverified | 0 | 0 |
| Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs | Jun 29, 2017 | Speaker Identification | —Unverified | 0 | 0 |
| Speaker Identification using EEG | Mar 7, 2020 | EEGElectroencephalogram (EEG) | —Unverified | 0 | 0 |
| Speaker Identification using Speech Recognition | May 29, 2022 | Speaker Identificationspeech-recognition | —Unverified | 0 | 0 |
| Speaker Recognition in Bengali Language from Nonlinear Features | Apr 15, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition | Jun 1, 2023 | Meta-LearningSpeaker Identification | —Unverified | 0 | 0 |
| Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention | Feb 14, 2020 | Multi-Task LearningSpeaker Identification | —Unverified | 0 | 0 |
| Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization | Feb 18, 2025 | Automatic Speech RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis | Feb 11, 2024 | RhythmSpeaker Identification | —Unverified | 0 | 0 |
| Speech Unlearning | Jun 1, 2025 | Adversarial RobustnessKeyword Spotting | —Unverified | 0 | 0 |
| Speech watermarking: an approach for the forensic analysis of digital telephonic recordings | Feb 23, 2022 | ArticlesSpeaker Identification | —Unverified | 0 | 0 |
| Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks | Sep 18, 2023 | Keyword SpottingSpeaker Identification | —Unverified | 0 | 0 |
| Story Comprehension for Predicting What Happens Next | Sep 1, 2017 | Common Sense ReasoningNatural Language Understanding | —Unverified | 0 | 0 |
| Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations | Jun 16, 2022 | Speaker IdentificationSpeech Extraction | —Unverified | 0 | 0 |
| Streaming Multi-talker Speech Recognition with Joint Speaker Identification | Apr 5, 2021 | Speaker Identificationspeech-recognition | —Unverified | 0 | 0 |
| Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals | Nov 11, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support | Apr 30, 2019 | Speaker IdentificationVoice Conversion | —Unverified | 0 | 0 |