| Speaker and Language Change Detection using Wav2vec2 and Whisper | Feb 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 |
| Audio Representation Learning by Distilling Video as Privileged Information | Feb 6, 2023 | Emotion RecognitionKnowledge Distillation | —Unverified | 0 |
| Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification | Jan 22, 2023 | Domain AdaptationMulti-Task Learning | —Unverified | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description | Jan 17, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Introducing Model Inversion Attacks on Automatic Speaker Recognition | Jan 9, 2023 | modelSpeaker Recognition | —Unverified | 0 |
| SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Dec 20, 2022 | Dialog Act ClassificationQuestion Answering | —Unverified | 0 |
| Probing Deep Speaker Embeddings for Speaker-related Tasks | Dec 14, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition | Dec 1, 2022 | Speaker RecognitionText-Independent Speaker Recognition | —Unverified | 0 |
| A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition | Nov 24, 2022 | Speaker Recognition | —Unverified | 0 |
| Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition | Nov 17, 2022 | Domain AdaptationSpeaker Recognition | —Unverified | 0 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 |
| Late Audio-Visual Fusion for In-The-Wild Speaker Diarization | Nov 2, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| I4U System Description for NIST SRE'20 CTS Challenge | Nov 2, 2022 | Speaker Recognition | —Unverified | 0 |
| Disentangled representation learning for multilingual speaker recognition | Nov 1, 2022 | DisentanglementMetric Learning | —Unverified | 0 |
| Universal speaker recognition encoders for different speech segments duration | Oct 28, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs | Oct 27, 2022 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach | Oct 25, 2022 | Representation LearningSpeaker Recognition | —Unverified | 0 |
| Large-scale learning of generalised representations for speaker recognition | Oct 20, 2022 | Inductive BiasSpeaker Recognition | —Unverified | 0 |
| Risk of re-identification for shared clinical speech recordings | Oct 18, 2022 | Speaker Recognition | CodeCode Available | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| THUEE system description for NIST 2020 SRE CTS challenge | Oct 12, 2022 | Speaker Recognition | —Unverified | 0 |
| The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022 | Oct 4, 2022 | Action DetectionActivity Detection | —Unverified | 0 |