| Symmetric Saliency-based Adversarial Attack To Speaker Identification | Oct 30, 2022 | Adversarial AttackDecoder | —Unverified | 0 | 0 |
| Test-Time Training for Speech | Sep 19, 2023 | parameter-efficient fine-tuningSpeaker Identification | —Unverified | 0 | 0 |
| Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks | Jul 1, 2017 | Speaker IdentificationSpeech Recognition | —Unverified | 0 | 0 |
| Text Independent Speaker Identification System for Access Control | Sep 26, 2022 | Speaker Identification | —Unverified | 0 | 0 |
| The Deterministic plus Stochastic Model of the Residual Signal and its Applications | Dec 29, 2019 | Speaker IdentificationSpeech Synthesis | —Unverified | 0 | 0 |
| The DIRHA simulated corpus | May 1, 2014 | Dialogue ManagementDistant Speech Recognition | —Unverified | 0 | 0 |
| The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices | Dec 15, 2021 | Speaker IdentificationVoice Conversion | —Unverified | 0 | 0 |
| SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems | Jul 13, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| The RATS Collection: Supporting HLT Research with Degraded Audio Data | May 1, 2014 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches | Apr 18, 2024 | Age EstimationClassification | —Unverified | 0 | 0 |
| Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications | Nov 20, 2024 | Emotion RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Triplet loss based embeddings for forensic speaker identification in Spanish | Feb 24, 2021 | Speaker IdentificationTriplet | —Unverified | 0 | 0 |
| T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model | Oct 29, 2020 | Speaker Identification | —Unverified | 0 | 0 |
| Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction | Sep 7, 2023 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 | 0 |
| Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification | Feb 29, 2024 | Adversarial AttackClassification | —Unverified | 0 | 0 |
| VAST: A Corpus of Video Annotation for Speech Technologies | May 1, 2018 | Action DetectionLanguage Identification | —Unverified | 0 | 0 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 | 0 |
| Voice Privacy with Smart Digital Assistants in Educational Settings | Mar 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices | Dec 20, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| VoxWatch: An open-set speaker recognition benchmark on VoxCeleb | Jun 30, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification | May 15, 2020 | Speaker Identification | —Unverified | 0 | 0 |
| Weakly Supervised Training of Speaker Identification Models | Jun 22, 2018 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Supervised Speaker Embedding De-Mixing in Two-Speaker Environment | Jan 14, 2020 | Speaker IdentificationVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement | Mar 3, 2024 | Automatic Speech RecognitionKeyword Spotting | —Unverified | 0 | 0 |
| Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors | Oct 25, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Advanced Rich Transcription System for Estonian Speech | Jan 11, 2019 | Speaker Identification | —Unverified | 0 | 0 |
| Advances in Online Audio-Visual Meeting Transcription | Dec 10, 2019 | Sound Source Localizationspeaker-diarization | —Unverified | 0 | 0 |
| AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification | Apr 8, 2022 | Representation LearningSpeaker Identification | —Unverified | 0 | 0 |
| A Joint Model for Quotation Attribution and Coreference Resolution | Apr 1, 2014 | coreference-resolutionCoreference Resolution | —Unverified | 0 | 0 |
| A Lightweight Speaker Recognition System Using Timbre Properties | Oct 12, 2020 | GPUSpeaker Identification | —Unverified | 0 | 0 |
| A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech | Jun 27, 2014 | Speaker Identification | —Unverified | 0 | 0 |
| A Novel Minimum Divergence Approach to Robust Speaker Identification | Dec 16, 2015 | General ClassificationSpeaker Identification | —Unverified | 0 | 0 |
| An Unsupervised Speaker Clustering Technique based on SOM and I-vectors for Speech Recognition Systems | Apr 1, 2017 | Automatic Speech Recognition (ASR)Clustering | —Unverified | 0 | 0 |
| 基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese] | Nov 1, 2017 | Speaker Identification | —Unverified | 0 | 0 |
| A Preliminary Exploration with GPT-4o Voice Mode | Feb 14, 2025 | Age ClassificationAudio Deepfake Detection | —Unverified | 0 | 0 |
| A Real-time Speaker Diarization System Based on Spatial Spectrum | Jul 20, 2021 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions | Oct 23, 2021 | Speaker Identification | —Unverified | 0 | 0 |
| A Study of Few-Shot Audio Classification | Dec 2, 2020 | Audio ClassificationBIG-bench Machine Learning | —Unverified | 0 | 0 |
| A Survey on Paralinguistics in Tamil Speech Processing | Apr 1, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR | Sep 9, 2024 | Automatic Speech Recognitionspeaker-diarization | —Unverified | 0 | 0 |
| A user study to compare two conversational assistants designed for people with hearing impairments | Jun 1, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 | 0 |
| Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music | Apr 29, 2017 | EEGElectroencephalogram (EEG) | —Unverified | 0 | 0 |
| CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions | Feb 11, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models | Jan 24, 2025 | Emotion ClassificationSpeaker Identification | —Unverified | 0 | 0 |
| Comparison of Gender- and Speaker-adaptive Emotion Recognition | May 1, 2014 | AttributeEmotion Classification | —Unverified | 0 | 0 |
| Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification | Jul 14, 2017 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |