| iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre | Jun 29, 2022 | DisentanglementSpeaker Identification | —Unverified | 0 |
| Extended U-Net for Speaker Verification in Noisy Environments | Jun 27, 2022 | DenoisingSpeaker Identification | CodeCode Available | 1 |
| Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems | Jun 18, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations | Jun 16, 2022 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| Speaker Identification using Speech Recognition | May 29, 2022 | Speaker Identificationspeech-recognition | —Unverified | 0 |
| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 |
| Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information | May 8, 2022 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification | Apr 28, 2022 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| ATST: Audio Representation Learning with Teacher-Student Transformer | Apr 26, 2022 | Audio ClassificationInstrument Recognition | CodeCode Available | 1 |