| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 | 5 |
| VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark | Jul 16, 2024 | DiversitySpeaker Identification | CodeCode Available | 5 | 5 |
| Pushing the limits of raw waveform speaker recognition | Mar 16, 2022 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 | 5 |
| Ludwig: a type-based declarative deep learning toolbox | Sep 17, 2019 | DecoderDeep Learning | CodeCode Available | 3 | 5 |
| ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models | Jan 30, 2024 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 | 5 |
| SALMONN: Towards Generic Hearing Abilities for Large Language Models | Oct 20, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 3 | 5 |
| Magnitude-aware Probabilistic Speaker Embeddings | Feb 28, 2022 | Out-of-Distribution DetectionSpeaker Verification | CodeCode Available | 3 | 5 |
| Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification | Dec 6, 2023 | AllSpeaker Verification | CodeCode Available | 3 | 5 |
| u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality | Jul 14, 2022 | Speaker Verificationspeech-recognition | CodeCode Available | 2 | 5 |
| Towards A Unified Conformer Structure: from ASR to ASV Task | Nov 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT | May 15, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 2 | 5 |
| Singer Identity Representation Learning using Self-Supervised Techniques | Jan 10, 2024 | Domain GeneralizationRepresentation Learning | CodeCode Available | 2 | 5 |
| Generalized End-to-End Loss for Speaker Verification | Oct 28, 2017 | Domain AdaptationSpeaker Verification | CodeCode Available | 1 | 5 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention | Oct 27, 2020 | DisentanglementSpeaker Verification | CodeCode Available | 1 | 5 |
| FastAudio: A Learnable Audio Front-End for Spoof Speech Detection | Sep 6, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 1 | 5 |
| ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification | Jan 10, 2025 | Speaker Verification | CodeCode Available | 1 | 5 |
| FilterAugment: An Acoustic Environmental Data Augmentation Method | Oct 7, 2021 | Data AugmentationEvent Detection | CodeCode Available | 1 | 5 |
| Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms | Apr 1, 2020 | Speaker VerificationText-Independent Speaker Verification | CodeCode Available | 1 | 5 |
| End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection | Jul 27, 2021 | Audio Deepfake DetectionDeepFake Detection | CodeCode Available | 1 | 5 |
| Attack on practical speaker verification system using universal adversarial perturbations | May 19, 2021 | Real-World Adversarial AttackRoom Impulse Response (RIR) | CodeCode Available | 1 | 5 |
| Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer Learning | Aug 8, 2020 | Speaker VerificationTransfer Learning | CodeCode Available | 1 | 5 |
| Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification | Feb 22, 2023 | Speaker VerificationText-Independent Speaker Verification | CodeCode Available | 1 | 5 |
| Exploring Binary Classification Loss For Speaker Verification | Jul 17, 2023 | Binary ClassificationClassification | CodeCode Available | 1 | 5 |
| Extended U-Net for Speaker Verification in Noisy Environments | Jun 27, 2022 | DenoisingSpeaker Identification | CodeCode Available | 1 | 5 |
| Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks | Jul 19, 2021 | Speaker Verification | CodeCode Available | 1 | 5 |
| DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker Verification | Mar 20, 2023 | Speaker VerificationText-Independent Speaker Verification | CodeCode Available | 1 | 5 |
| Crossed-Time Delay Neural Network for Speaker Recognition | May 31, 2020 | Speaker RecognitionSpeaker Verification | CodeCode Available | 1 | 5 |
| Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings | Jul 13, 2022 | Age EstimationSpeaker Verification | CodeCode Available | 1 | 5 |
| From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint | May 10, 2020 | Speaker VerificationSpeech Synthesis | CodeCode Available | 1 | 5 |
| Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation | Feb 24, 2022 | Audio Deepfake DetectionData Augmentation | CodeCode Available | 1 | 5 |
| Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection | Jun 25, 2020 | Binary ClassificationSpeaker Verification | CodeCode Available | 1 | 5 |
| Evaluation of Speech Representations for MOS prediction | Jun 16, 2023 | PredictionSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Deep multi-metric learning for text-independent speaker verification | Jul 17, 2020 | Metric LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion | Sep 9, 2022 | De-identificationSpeaker Verification | CodeCode Available | 1 | 5 |
| An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning | Feb 6, 2020 | Reinforcement LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| An Unsupervised Autoregressive Model for Speech Representation Learning | Apr 5, 2019 | General Classificationmodel | CodeCode Available | 1 | 5 |
| A Fully Tensorized Recurrent Neural Network | Oct 8, 2020 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| A Speaker Verification Backend with Robust Performance across Conditions | Feb 2, 2021 | Speaker Verification | CodeCode Available | 1 | 5 |
| ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan | Sep 1, 2021 | Face SwappingSpeaker Verification | CodeCode Available | 1 | 5 |
| DropClass and DropAdapt: Dropping classes for deep speaker representation learning | Feb 2, 2020 | General ClassificationRepresentation Learning | CodeCode Available | 1 | 5 |
| ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection | Apr 14, 2019 | Speaker Verification | CodeCode Available | 1 | 5 |
| Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models | Sep 14, 2023 | Speaker VerificationSpeech Enhancement | CodeCode Available | 1 | 5 |
| Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances | Apr 4, 2021 | Speaker Verification | CodeCode Available | 1 | 5 |
| Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection | Sep 5, 2021 | Speaker VerificationSpoof Detection | CodeCode Available | 1 | 5 |
| End-to-end anti-spoofing with RawNet2 | Nov 2, 2020 | Audio Deepfake DetectionSpeaker Verification | CodeCode Available | 1 | 5 |
| Bias in Automated Speaker Recognition | Jan 24, 2022 | BIG-bench Machine LearningFace Recognition | CodeCode Available | 1 | 5 |
| Backdoor Attack against Speaker Verification | Oct 22, 2020 | Backdoor AttackClustering | CodeCode Available | 1 | 5 |
| Bts-e: Audio deepfake detection using breathing-talking-silence encoder | May 5, 2023 | Audio Deepfake DetectionDeepFake Detection | CodeCode Available | 1 | 5 |
| An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems | Apr 3, 2021 | Data AugmentationMulti-Task Learning | CodeCode Available | 1 | 5 |