| A Study on Bias and Fairness In Deep Speaker Recognition | Mar 14, 2023 | FairnessSpeaker Recognition | —Unverified | 0 |
| Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition | Mar 7, 2023 | Bandwidth ExtensionSpeaker Recognition | —Unverified | 0 |
| Speaker Recognition in Realistic Scenario Using Multimodal Data | Feb 25, 2023 | Speaker Recognition | —Unverified | 0 |
| A Reinforcement Learning Framework for Online Speaker Diarization | Feb 21, 2023 | Decision MakingDomain Adaptation | —Unverified | 0 |
| Interpretable Spectrum Transformation Attacks to Speaker Recognition | Feb 21, 2023 | Speaker Recognition | —Unverified | 0 |
| VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge | Feb 20, 2023 | Speaker DiarizationSpeaker Recognition | CodeCode Available | 1 |
| Probabilistic Back-ends for Online Speaker Recognition and Clustering | Feb 19, 2023 | ClusteringOnline Clustering | CodeCode Available | 1 |
| Speaker and Language Change Detection using Wav2vec2 and Whisper | Feb 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement | Feb 16, 2023 | Speaker RecognitionSpeech Enhancement | CodeCode Available | 1 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 |
| Audio Representation Learning by Distilling Video as Privileged Information | Feb 6, 2023 | Emotion RecognitionKnowledge Distillation | —Unverified | 0 |
| Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification | Jan 22, 2023 | Domain AdaptationMulti-Task Learning | —Unverified | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description | Jan 17, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset | Jan 16, 2023 | Audio-Visual Speech RecognitionLip Reading | CodeCode Available | 1 |
| Introducing Model Inversion Attacks on Automatic Speaker Recognition | Jan 9, 2023 | modelSpeaker Recognition | —Unverified | 0 |
| SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Dec 20, 2022 | Dialog Act ClassificationQuestion Answering | —Unverified | 0 |
| Probing Deep Speaker Embeddings for Speaker-related Tasks | Dec 14, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition | Dec 1, 2022 | Speaker RecognitionText-Independent Speaker Recognition | —Unverified | 0 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition | Nov 24, 2022 | Speaker Recognition | —Unverified | 0 |
| Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition | Nov 17, 2022 | Domain AdaptationSpeaker Recognition | —Unverified | 0 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 |
| Late Audio-Visual Fusion for In-The-Wild Speaker Diarization | Nov 2, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| I4U System Description for NIST SRE'20 CTS Challenge | Nov 2, 2022 | Speaker Recognition | —Unverified | 0 |
| Disentangled representation learning for multilingual speaker recognition | Nov 1, 2022 | DisentanglementMetric Learning | —Unverified | 0 |
| Speaker recognition with two-step multi-modal deep cleansing | Oct 28, 2022 | Representation LearningSpeaker Recognition | CodeCode Available | 1 |
| Universal speaker recognition encoders for different speech segments duration | Oct 28, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs | Oct 27, 2022 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| Toroidal Probabilistic Spherical Discriminant Analysis | Oct 27, 2022 | FormSpeaker Recognition | CodeCode Available | 1 |
| Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach | Oct 25, 2022 | Representation LearningSpeaker Recognition | —Unverified | 0 |
| Large-scale learning of generalised representations for speaker recognition | Oct 20, 2022 | Inductive BiasSpeaker Recognition | —Unverified | 0 |
| Risk of re-identification for shared clinical speech recordings | Oct 18, 2022 | Speaker Recognition | CodeCode Available | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| THUEE system description for NIST 2020 SRE CTS challenge | Oct 12, 2022 | Speaker Recognition | —Unverified | 0 |
| The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022 | Oct 4, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022 | Sep 23, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The SpeakIn System Description for CNSRC2022 | Sep 22, 2022 | RetrievalSpeaker Recognition | —Unverified | 0 |
| GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge | Sep 21, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022 | Sep 21, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022 | Sep 19, 2022 | ClusteringDomain Adaptation | —Unverified | 0 |
| A Benchmark for Understanding and Generating Dialogue between Characters in Stories | Sep 18, 2022 | Dialogue GenerationSpeaker Recognition | —Unverified | 0 |
| Disentangled Speaker Representation Learning via Mutual Information Minimization | Aug 17, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition | Aug 4, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception | Jul 26, 2022 | Adversarial AttackSpeaker Recognition | —Unverified | 0 |
| Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification | Jul 8, 2022 | FairnessSpeaker Identification | —Unverified | 0 |
| A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion | Jun 28, 2022 | Speaker RecognitionVoice Conversion | —Unverified | 0 |
| Towards End-to-End Private Automatic Speaker Recognition | Jun 23, 2022 | Privacy PreservingSpeaker Recognition | —Unverified | 0 |
| Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition | Jun 7, 2022 | Speaker Recognitionspeech-recognition | CodeCode Available | 1 |
| AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems | Jun 7, 2022 | Adversarial AttackSpeaker Recognition | —Unverified | 0 |