| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 |
| VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark | Jul 16, 2024 | DiversitySpeaker Identification | CodeCode Available | 5 |
| Magnitude-aware Probabilistic Speaker Embeddings | Feb 28, 2022 | Out-of-Distribution DetectionSpeaker Verification | CodeCode Available | 3 |
| Pushing the limits of raw waveform speaker recognition | Mar 16, 2022 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 |
| Ludwig: a type-based declarative deep learning toolbox | Sep 17, 2019 | DecoderDeep Learning | CodeCode Available | 3 |
| ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models | Jan 30, 2024 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 |
| SALMONN: Towards Generic Hearing Abilities for Large Language Models | Oct 20, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 3 |
| Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification | Dec 6, 2023 | AllSpeaker Verification | CodeCode Available | 3 |
| Towards A Unified Conformer Structure: from ASR to ASV Task | Nov 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Singer Identity Representation Learning using Self-Supervised Techniques | Jan 10, 2024 | Domain GeneralizationRepresentation Learning | CodeCode Available | 2 |
| Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT | May 15, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 2 |
| u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality | Jul 14, 2022 | Speaker Verificationspeech-recognition | CodeCode Available | 2 |
| Crossed-Time Delay Neural Network for Speaker Recognition | May 31, 2020 | Speaker RecognitionSpeaker Verification | CodeCode Available | 1 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 |
| Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification | Feb 22, 2023 | Speaker VerificationText-Independent Speaker Verification | CodeCode Available | 1 |
| Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks | Jul 19, 2021 | Speaker Verification | CodeCode Available | 1 |
| Bts-e: Audio deepfake detection using breathing-talking-silence encoder | May 5, 2023 | Audio Deepfake DetectionDeepFake Detection | CodeCode Available | 1 |
| Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings | Jul 13, 2022 | Age EstimationSpeaker Verification | CodeCode Available | 1 |
| Cross-modal information fusion for voice spoofing detection | Feb 1, 2023 | Automatic Speech Recognitionfake voice detection | CodeCode Available | 1 |
| Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances | Apr 4, 2021 | Speaker Verification | CodeCode Available | 1 |
| Attack on practical speaker verification system using universal adversarial perturbations | May 19, 2021 | Real-World Adversarial AttackRoom Impulse Response (RIR) | CodeCode Available | 1 |
| Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer Learning | Aug 8, 2020 | Speaker VerificationTransfer Learning | CodeCode Available | 1 |
| Backdoor Attack against Speaker Verification | Oct 22, 2020 | Backdoor AttackClustering | CodeCode Available | 1 |
| Bias in Automated Speaker Recognition | Jan 24, 2022 | BIG-bench Machine LearningFace Recognition | CodeCode Available | 1 |
| ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection | Apr 14, 2019 | Speaker Verification | CodeCode Available | 1 |