| Ordered and Binary Speaker Embedding | May 25, 2023 | ClusteringRetrieval | —Unverified | 0 |
| On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Security and Privacy Problems in Voice Assistant Applications: A Survey | Apr 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Speech Representation Pooling Using Vector Quantization | Apr 8, 2023 | Emotion Recognitionintent-classification | CodeCode Available | 0 |
| HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones | Mar 13, 2023 | Event DetectionSound Event Detection | —Unverified | 0 |
| Ensemble knowledge distillation of self-supervised speech models | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ExARN: self-attending RNN for target speaker extraction | Dec 2, 2022 | Speaker IdentificationTarget Speaker Extraction | —Unverified | 0 |
| Multi-Label Training for Text-Independent Speaker Identification | Nov 14, 2022 | Ensemble LearningSpeaker Identification | —Unverified | 0 |
| Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples | Nov 10, 2022 | De-identificationSpeaker Identification | —Unverified | 0 |
| Symmetric Saliency-based Adversarial Attack To Speaker Identification | Oct 30, 2022 | Adversarial AttackDecoder | —Unverified | 0 |
| Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input | Oct 26, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 |
| Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG | Oct 23, 2022 | Speaker Identification | —Unverified | 0 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Text Independent Speaker Identification System for Access Control | Sep 26, 2022 | Speaker Identification | —Unverified | 0 |
| Computing with Hypervectors for Efficient Speaker Identification | Aug 28, 2022 | CPUQuantization | —Unverified | 0 |
| Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models | Jul 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification | Jul 8, 2022 | FairnessSpeaker Identification | —Unverified | 0 |
| Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones | Jul 1, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre | Jun 29, 2022 | DisentanglementSpeaker Identification | —Unverified | 0 |
| Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems | Jun 18, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations | Jun 16, 2022 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| Speaker Identification using Speech Recognition | May 29, 2022 | Speaker Identificationspeech-recognition | —Unverified | 0 |
| Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information | May 8, 2022 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification | Apr 28, 2022 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention | Apr 24, 2022 | Audio ClassificationFew-Shot Learning | —Unverified | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Listen only to me! How well can target speech extraction handle false alarms? | Apr 11, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| Karaoker: Alignment-free singing voice synthesis with speech training data | Apr 8, 2022 | Singing Voice SynthesisSpeaker Identification | —Unverified | 0 |
| AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification | Apr 8, 2022 | Representation LearningSpeaker Identification | —Unverified | 0 |
| Improved Relation Networks for End-to-End Speaker Verification and Identification | Mar 31, 2022 | Meta-LearningRelation | —Unverified | 0 |
| NeuraGen-A Low-Resource Neural Network based approach for Gender Classification | Mar 29, 2022 | Gender ClassificationSpeaker Identification | —Unverified | 0 |
| Speaker Identification Experiments Under Gender De-Identification | Mar 9, 2022 | De-identificationSpeaker Identification | —Unverified | 0 |
| On the relevance of bandwidth extension for speaker identification | Feb 24, 2022 | Bandwidth ExtensionSpeaker Identification | —Unverified | 0 |
| openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer | Feb 24, 2022 | Open Set LearningSpeaker Identification | —Unverified | 0 |
| Speech watermarking: an approach for the forensic analysis of digital telephonic recordings | Feb 23, 2022 | ArticlesSpeaker Identification | —Unverified | 0 |
| Tubes Among Us: Analog Attack on Automatic Speaker Identification | Feb 6, 2022 | BIG-bench Machine LearningSpeaker Identification | —Unverified | 0 |
| Cross-Lingual Speaker Identification from Weak Local Evidence | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices | Dec 15, 2021 | Speaker IdentificationVoice Conversion | —Unverified | 0 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions | Oct 23, 2021 | Speaker Identification | —Unverified | 0 |
| Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction | Oct 3, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification | Sep 9, 2021 | ClusteringFew-Shot Learning | CodeCode Available | 0 |
| Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets | Sep 6, 2021 | Speaker Identification | —Unverified | 0 |
| Towards Making the Most of Dialogue Characteristics for Neural Chat Translation | Sep 2, 2021 | Machine TranslationResponse Generation | CodeCode Available | 0 |
| QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Real-time Speaker Diarization System Based on Spatial Spectrum | Jul 20, 2021 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |