| A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments | Jun 17, 2025 | DenoisingSpeaker Recognition | —Unverified | 0 | 0 |
| DeepMSRF: A novel Deep Multimodal Speaker Recognition framework with Feature selection | Jul 14, 2020 | feature selectionSpeaker Recognition | —Unverified | 0 | 0 |
| LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis | Feb 7, 2020 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Deep learning methods in speaker recognition: a review | Nov 14, 2019 | Deep LearningSpeaker Recognition | —Unverified | 0 | 0 |
| Deep Learning for Single and Multi-Session i-Vector Speaker Recognition | Dec 8, 2015 | Speaker Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Large-scale learning of generalised representations for speaker recognition | Oct 20, 2022 | Inductive BiasSpeaker Recognition | —Unverified | 0 | 0 |
| Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel | May 1, 2012 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition | Nov 24, 2022 | Speaker Recognition | —Unverified | 0 | 0 |
| Deep factorization for speech signal | Feb 27, 2018 | Emotion RecognitionSpeaker Recognition | —Unverified | 0 | 0 |
| Deep CNN based feature extractor for text-prompted speaker recognition | Mar 13, 2018 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| A Unified Deep Neural Network for Speaker and Language Recognition | Apr 3, 2015 | Domain AdaptationSpeaker Recognition | —Unverified | 0 | 0 |
| Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition | Aug 4, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments | Jun 29, 2020 | Data AugmentationDenoising | —Unverified | 0 | 0 |
| Augmentation adversarial training for self-supervised speaker recognition | Jul 23, 2020 | Contrastive LearningSpeaker Recognition | —Unverified | 0 | 0 |
| An Ensemble SVM-based Approach for Voice Activity Detection | Feb 5, 2019 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Adversarial Speaker Verification | Apr 29, 2019 | General ClassificationSpeaker Recognition | —Unverified | 0 | 0 |
| Investigation of Using VAE for i-Vector Speaker Verification | May 25, 2017 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction | Jun 2, 2025 | Speaker Recognition | —Unverified | 0 | 0 |
| Cross-modal Speaker Verification and Recognition: A Multilingual Perspective | Apr 28, 2020 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Audio-visual Speaker Recognition with a Cross-modal Discriminative Network | Aug 10, 2020 | Speaker Recognition | —Unverified | 0 | 0 |
| Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution | Dec 23, 2024 | Audio Deepfake DetectionDeepFake Detection | —Unverified | 0 | 0 |
| Introduction to Voice Presentation Attack Detection and Recent Advances | Jan 4, 2019 | BenchmarkingSpeaker Recognition | —Unverified | 0 | 0 |
| iQIYI-VID: A Large Dataset for Multi-modal Person Identification | Nov 19, 2018 | Face RecognitionMulti-Modal Person Identification | —Unverified | 0 | 0 |
| Introducing Model Inversion Attacks on Automatic Speaker Recognition | Jan 9, 2023 | modelSpeaker Recognition | —Unverified | 0 | 0 |
| 結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese] | Oct 1, 2013 | Action DetectionSpeaker Recognition | —Unverified | 0 | 0 |
| Joint Probabilistic Linear Discriminant Analysis | Apr 7, 2017 | Speaker Recognition | —Unverified | 0 | 0 |
| Joint Sound Source Separation and Speaker Recognition | Apr 29, 2016 | blind source separationSpeaker Recognition | —Unverified | 0 | 0 |
| JukeBox: A Multilingual Singer Recognition Dataset | Aug 8, 2020 | Speaker RecognitionText-Independent Speaker Recognition | —Unverified | 0 | 0 |
| Fine-grained Early Frequency Attention for Deep Speaker Representation Learning | Sep 3, 2020 | Deep LearningEmotion Recognition | —Unverified | 0 | 0 |
| KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation | Feb 3, 2017 | ClusteringSpeaker Recognition | —Unverified | 0 | 0 |
| Language Modelling for Speaker Diarization in Telephonic Interviews | Jan 28, 2025 | Acoustic ModellingLanguage Modelling | —Unverified | 0 | 0 |
| Interpretable Spectrum Transformation Attacks to Speaker Recognition | Feb 21, 2023 | Speaker Recognition | —Unverified | 0 | 0 |
| LASPA: Language Agnostic Speaker Disentanglement with Prefix-Tuned Cross-Attention | Jun 2, 2025 | AnatomyDisentanglement | —Unverified | 0 | 0 |
| Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks | Mar 7, 2025 | Speaker Recognition | —Unverified | 0 | 0 |
| LDC Language Resource Database: Building a Bibliographic Database | May 1, 2012 | Information RetrievalMachine Translation | —Unverified | 0 | 0 |
| An Effortless Way To Create Large-Scale Datasets For Famous Speakers | May 1, 2014 | Person IdentificationSpeaker Diarization | —Unverified | 0 | 0 |
| Learning Speaker-Invariant Visual Features for Lipreading | Jun 9, 2025 | DisentanglementLipreading | —Unverified | 0 | 0 |
| Length- and Noise-aware Training Techniques for Short-utterance Speaker Recognition | Aug 27, 2020 | Representation LearningSpeaker Recognition | —Unverified | 0 | 0 |
| Influence of Mother Tongue on English Accent | Dec 1, 2014 | Language IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Incorporation of Speech Duration Information in Score Fusion of Speaker Recognition Systems | Aug 7, 2016 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification | Jan 22, 2023 | Domain AdaptationMulti-Task Learning | —Unverified | 0 | 0 |
| Likelihood-ratio calibration using prior-weighted proper scoring rules | Jul 30, 2013 | regressionscoring rule | —Unverified | 0 | 0 |
| Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation | Aug 1, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Cosine Scoring with Uncertainty for Neural Speaker Embedding | Mar 11, 2024 | Speaker Recognition | —Unverified | 0 | 0 |
| Machine Speech Chain with One-shot Speaker Adaptation | Mar 28, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach | Oct 25, 2022 | Representation LearningSpeaker Recognition | —Unverified | 0 | 0 |
| Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model | Sep 24, 2019 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| CopyPaste: An Augmentation Method for Speech Emotion Recognition | Oct 27, 2020 | Data AugmentationEmotion Recognition | —Unverified | 0 | 0 |
| Audio Representation Learning by Distilling Video as Privileged Information | Feb 6, 2023 | Emotion RecognitionKnowledge Distillation | —Unverified | 0 | 0 |