| A Study on Bias and Fairness In Deep Speaker Recognition | Mar 14, 2023 | FairnessSpeaker Recognition | —Unverified | 0 |
| Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition | Mar 7, 2023 | Bandwidth ExtensionSpeaker Recognition | —Unverified | 0 |
| Speaker Recognition in Realistic Scenario Using Multimodal Data | Feb 25, 2023 | Speaker Recognition | —Unverified | 0 |
| A Reinforcement Learning Framework for Online Speaker Diarization | Feb 21, 2023 | Decision MakingDomain Adaptation | —Unverified | 0 |
| Interpretable Spectrum Transformation Attacks to Speaker Recognition | Feb 21, 2023 | Speaker Recognition | —Unverified | 0 |
| VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge | Feb 20, 2023 | Speaker DiarizationSpeaker Recognition | CodeCode Available | 1 |
| Probabilistic Back-ends for Online Speaker Recognition and Clustering | Feb 19, 2023 | ClusteringOnline Clustering | CodeCode Available | 1 |
| Speaker and Language Change Detection using Wav2vec2 and Whisper | Feb 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement | Feb 16, 2023 | Speaker RecognitionSpeech Enhancement | CodeCode Available | 1 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 |
| Audio Representation Learning by Distilling Video as Privileged Information | Feb 6, 2023 | Emotion RecognitionKnowledge Distillation | —Unverified | 0 |
| Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification | Jan 22, 2023 | Domain AdaptationMulti-Task Learning | —Unverified | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description | Jan 17, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset | Jan 16, 2023 | Audio-Visual Speech RecognitionLip Reading | CodeCode Available | 1 |
| Introducing Model Inversion Attacks on Automatic Speaker Recognition | Jan 9, 2023 | modelSpeaker Recognition | —Unverified | 0 |
| SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Dec 20, 2022 | Dialog Act ClassificationQuestion Answering | —Unverified | 0 |
| Probing Deep Speaker Embeddings for Speaker-related Tasks | Dec 14, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition | Dec 1, 2022 | Speaker RecognitionText-Independent Speaker Recognition | —Unverified | 0 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition | Nov 24, 2022 | Speaker Recognition | —Unverified | 0 |
| Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition | Nov 17, 2022 | Domain AdaptationSpeaker Recognition | —Unverified | 0 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 |
| Late Audio-Visual Fusion for In-The-Wild Speaker Diarization | Nov 2, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| I4U System Description for NIST SRE'20 CTS Challenge | Nov 2, 2022 | Speaker Recognition | —Unverified | 0 |