| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 | 5 |
| VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark | Jul 16, 2024 | DiversitySpeaker Identification | CodeCode Available | 5 | 5 |
| Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification | Dec 6, 2023 | AllSpeaker Verification | CodeCode Available | 3 | 5 |
| Magnitude-aware Probabilistic Speaker Embeddings | Feb 28, 2022 | Out-of-Distribution DetectionSpeaker Verification | CodeCode Available | 3 | 5 |
| Pushing the limits of raw waveform speaker recognition | Mar 16, 2022 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 | 5 |
| ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models | Jan 30, 2024 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 | 5 |
| SALMONN: Towards Generic Hearing Abilities for Large Language Models | Oct 20, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 3 | 5 |
| Ludwig: a type-based declarative deep learning toolbox | Sep 17, 2019 | DecoderDeep Learning | CodeCode Available | 3 | 5 |
| u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality | Jul 14, 2022 | Speaker Verificationspeech-recognition | CodeCode Available | 2 | 5 |
| Towards A Unified Conformer Structure: from ASR to ASV Task | Nov 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| Singer Identity Representation Learning using Self-Supervised Techniques | Jan 10, 2024 | Domain GeneralizationRepresentation Learning | CodeCode Available | 2 | 5 |
| Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT | May 15, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 2 | 5 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification | Feb 22, 2023 | Speaker VerificationText-Independent Speaker Verification | CodeCode Available | 1 | 5 |
| An Unsupervised Autoregressive Model for Speech Representation Learning | Apr 5, 2019 | General Classificationmodel | CodeCode Available | 1 | 5 |
| Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings | Jul 13, 2022 | Age EstimationSpeaker Verification | CodeCode Available | 1 | 5 |
| Crossed-Time Delay Neural Network for Speaker Recognition | May 31, 2020 | Speaker RecognitionSpeaker Verification | CodeCode Available | 1 | 5 |
| Cross-modal information fusion for voice spoofing detection | Feb 1, 2023 | Automatic Speech Recognitionfake voice detection | CodeCode Available | 1 | 5 |
| AutoSpeech: Neural Architecture Search for Speaker Recognition | May 7, 2020 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Bias in Automated Speaker Recognition | Jan 24, 2022 | BIG-bench Machine LearningFace Recognition | CodeCode Available | 1 | 5 |
| Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer Learning | Aug 8, 2020 | Speaker VerificationTransfer Learning | CodeCode Available | 1 | 5 |
| Backdoor Attack against Speaker Verification | Oct 22, 2020 | Backdoor AttackClustering | CodeCode Available | 1 | 5 |
| An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning | Feb 6, 2020 | Reinforcement LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification | Feb 10, 2022 | Speaker Verification | CodeCode Available | 1 | 5 |
| An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems | Apr 3, 2021 | Data AugmentationMulti-Task Learning | CodeCode Available | 1 | 5 |