| VAST: A Corpus of Video Annotation for Speech Technologies | May 1, 2018 | Action DetectionLanguage Identification | —Unverified | 0 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| Voice Privacy with Smart Digital Assistants in Educational Settings | Mar 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices | Dec 20, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| VoxWatch: An open-set speaker recognition benchmark on VoxCeleb | Jun 30, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification | May 15, 2020 | Speaker Identification | —Unverified | 0 |
| Weakly Supervised Training of Speaker Identification Models | Jun 22, 2018 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| Matics Software Suite: New Tools for Evaluation and Data Exploration | May 1, 2018 | Optical Character Recognition (OCR)Speaker Diarization | —Unverified | 0 |
| MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification | Jun 1, 2013 | LipreadingSpeaker Identification | —Unverified | 0 |
| Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models | Oct 5, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Label Training for Text-Independent Speaker Identification | Nov 14, 2022 | Ensemble LearningSpeaker Identification | —Unverified | 0 |
| Multi-Modal Emotion Detection with Transfer Learning | Nov 13, 2020 | Speaker IdentificationTransfer Learning | —Unverified | 0 |
| Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition | Mar 3, 2020 | Emotion Recognition in ConversationMulti-Task Learning | —Unverified | 0 |
| NeuraGen-A Low-Resource Neural Network based approach for Gender Classification | Mar 29, 2022 | Gender ClassificationSpeaker Identification | —Unverified | 0 |
| Hearing-Loss Compensation Using Deep Neural Networks: A Framework and Results From a Listening Test | Mar 15, 2024 | Music ClassificationSpeaker Identification | —Unverified | 0 |
| Neural Predictive Coding using Convolutional Neural Networks towards Unsupervised Learning of Speaker Characteristics | Feb 22, 2018 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| On the relevance of bandwidth extension for speaker identification | Feb 24, 2022 | Bandwidth ExtensionSpeaker Identification | —Unverified | 0 |
| On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels | Jun 27, 2014 | Robust Speech RecognitionSpeaker Identification | —Unverified | 0 |
| openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer | Feb 24, 2022 | Open Set LearningSpeaker Identification | —Unverified | 0 |
| Ordered and Binary Speaker Embedding | May 25, 2023 | ClusteringRetrieval | —Unverified | 0 |
| Tubes Among Us: Analog Attack on Automatic Speaker Identification | Feb 6, 2022 | BIG-bench Machine LearningSpeaker Identification | —Unverified | 0 |
| PolInterviews -- A Dataset of German Politician Public Broadcast Interviews | Jan 8, 2025 | Speaker Identification | —Unverified | 0 |
| Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models | Jan 23, 2024 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| Pretraining Multi-Speaker Identification for Neural Speaker Diarization | May 30, 2025 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion? | Nov 12, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Privacy-preserving Representation Learning for Speech Understanding | Oct 26, 2023 | ClassificationEmotion Recognition | —Unverified | 0 |
| Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples | Nov 10, 2022 | De-identificationSpeaker Identification | —Unverified | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| Progressive Residual Extraction based Pre-training for Speech Representation Learning | Aug 31, 2024 | Emotion RecognitionRepresentation Learning | —Unverified | 0 |
| QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus | Jun 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 |
| Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio | May 15, 2025 | Speaker Identificationspeech-recognition | —Unverified | 0 |
| Read, Look or Listen? What's Needed for Solving a Multimodal Dataset | Jul 6, 2023 | Question AnsweringSpeaker Identification | —Unverified | 0 |
| Reducing audio membership inference attack accuracy to chance: 4 defenses | Oct 31, 2019 | Inference AttackMembership Inference Attack | —Unverified | 0 |
| Remarks on Optimal Scores for Speaker Recognition | Oct 10, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling | Apr 1, 2024 | Speaker IdentificationSpeech Synthesis | —Unverified | 0 |
| Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems | Jul 9, 2021 | Representation LearningSpeaker Identification | —Unverified | 0 |
| REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion | May 27, 2025 | DisentanglementSpeaker Identification | —Unverified | 0 |
| Rhythm Features for Speaker Identification | Jun 7, 2025 | Deep LearningRhythm | —Unverified | 0 |
| Robust Speaker Recognition Using Speech Enhancement And Attention Model | Jan 14, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| SCDiar: a streaming diarization system based on speaker change detection and speech recognition | Jan 28, 2025 | Change Detectionspeaker-diarization | —Unverified | 0 |
| Security and Privacy Problems in Voice Assistant Applications: A Survey | Apr 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seeing Voices and Hearing Faces: Cross-modal biometric matching | Apr 1, 2018 | Face RecognitionSpeaker Identification | —Unverified | 0 |
| Significance of Chirp MFCC as a Feature in Speech and Audio Applications | Feb 19, 2024 | Music ClassificationSpeaker Identification | —Unverified | 0 |
| Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information | May 8, 2022 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| 基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese] | Oct 1, 2014 | Dimensionality ReductionSpeaker Identification | —Unverified | 0 |