| Speaker and Language Change Detection using Wav2vec2 and Whisper | Feb 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 |
| Audio Representation Learning by Distilling Video as Privileged Information | Feb 6, 2023 | Emotion RecognitionKnowledge Distillation | —Unverified | 0 |
| Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification | Jan 22, 2023 | Domain AdaptationMulti-Task Learning | —Unverified | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description | Jan 17, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Introducing Model Inversion Attacks on Automatic Speaker Recognition | Jan 9, 2023 | modelSpeaker Recognition | —Unverified | 0 |
| SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Dec 20, 2022 | Dialog Act ClassificationQuestion Answering | —Unverified | 0 |
| Probing Deep Speaker Embeddings for Speaker-related Tasks | Dec 14, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition | Dec 1, 2022 | Speaker RecognitionText-Independent Speaker Recognition | —Unverified | 0 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition | Nov 24, 2022 | Speaker Recognition | —Unverified | 0 |
| Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition | Nov 17, 2022 | Domain AdaptationSpeaker Recognition | —Unverified | 0 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 |
| Late Audio-Visual Fusion for In-The-Wild Speaker Diarization | Nov 2, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| I4U System Description for NIST SRE'20 CTS Challenge | Nov 2, 2022 | Speaker Recognition | —Unverified | 0 |
| Disentangled representation learning for multilingual speaker recognition | Nov 1, 2022 | DisentanglementMetric Learning | —Unverified | 0 |
| Universal speaker recognition encoders for different speech segments duration | Oct 28, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs | Oct 27, 2022 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach | Oct 25, 2022 | Representation LearningSpeaker Recognition | —Unverified | 0 |
| Large-scale learning of generalised representations for speaker recognition | Oct 20, 2022 | Inductive BiasSpeaker Recognition | —Unverified | 0 |
| Risk of re-identification for shared clinical speech recordings | Oct 18, 2022 | Speaker Recognition | CodeCode Available | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| THUEE system description for NIST 2020 SRE CTS challenge | Oct 12, 2022 | Speaker Recognition | —Unverified | 0 |
| The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022 | Oct 4, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022 | Sep 23, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The SpeakIn System Description for CNSRC2022 | Sep 22, 2022 | RetrievalSpeaker Recognition | —Unverified | 0 |
| The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022 | Sep 21, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge | Sep 21, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022 | Sep 19, 2022 | ClusteringDomain Adaptation | —Unverified | 0 |
| A Benchmark for Understanding and Generating Dialogue between Characters in Stories | Sep 18, 2022 | Dialogue GenerationSpeaker Recognition | —Unverified | 0 |
| Disentangled Speaker Representation Learning via Mutual Information Minimization | Aug 17, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition | Aug 4, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception | Jul 26, 2022 | Adversarial AttackSpeaker Recognition | —Unverified | 0 |
| Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification | Jul 8, 2022 | FairnessSpeaker Identification | —Unverified | 0 |
| A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion | Jun 28, 2022 | Speaker RecognitionVoice Conversion | —Unverified | 0 |
| Towards End-to-End Private Automatic Speaker Recognition | Jun 23, 2022 | Privacy PreservingSpeaker Recognition | —Unverified | 0 |
| AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems | Jun 7, 2022 | Adversarial AttackSpeaker Recognition | —Unverified | 0 |
| WeCanTalk: A New Multi-language, Multi-modal Resource for Speaker Recognition | Jun 1, 2022 | Speaker Recognition | —Unverified | 0 |
| Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus | Jun 1, 2022 | DenoisingSpeaker Recognition | —Unverified | 0 |
| Dynamic Recognition of Speakers for Consent Management by Contrastive Embedding Replay | May 17, 2022 | Contrastive LearningInductive Bias | —Unverified | 0 |
| Baselines and Protocols for Household Speaker Recognition | Apr 30, 2022 | Speaker Recognition | CodeCode Available | 0 |
| Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? | Apr 27, 2022 | Self-Supervised LearningSpeaker Recognition | —Unverified | 0 |
| Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data | Apr 25, 2022 | ClusteringSpeaker Recognition | —Unverified | 0 |
| The 2021 NIST Speaker Recognition Evaluation | Apr 21, 2022 | Data AugmentationFace Recognition | —Unverified | 0 |
| The NIST CTS Speaker Recognition Challenge | Apr 21, 2022 | Data AugmentationSpeaker Recognition | —Unverified | 0 |
| Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective | Apr 5, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Robust Speaker Recognition with Transformers Using wav2vec 2.0 | Mar 28, 2022 | Data AugmentationRepresentation Learning | —Unverified | 0 |
| Curriculum learning for self-supervised speaker verification | Mar 28, 2022 | Self-Supervised LearningSpeaker Recognition | —Unverified | 0 |
| To train or not to train adversarially: A study of bias mitigation strategies for speaker recognition | Mar 17, 2022 | Face RecognitionFairness | CodeCode Available | 0 |