| Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention | Apr 24, 2022 | Audio ClassificationFew-Shot Learning | —Unverified | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Listen only to me! How well can target speech extraction handle false alarms? | Apr 11, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification | Apr 8, 2022 | Representation LearningSpeaker Identification | —Unverified | 0 |
| Karaoker: Alignment-free singing voice synthesis with speech training data | Apr 8, 2022 | Singing Voice SynthesisSpeaker Identification | —Unverified | 0 |
| Improved Relation Networks for End-to-End Speaker Verification and Identification | Mar 31, 2022 | Meta-LearningRelation | —Unverified | 0 |
| Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings | Mar 30, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| NeuraGen-A Low-Resource Neural Network based approach for Gender Classification | Mar 29, 2022 | Gender ClassificationSpeaker Identification | —Unverified | 0 |
| Speaker Identification Experiments Under Gender De-Identification | Mar 9, 2022 | De-identificationSpeaker Identification | —Unverified | 0 |
| On the relevance of bandwidth extension for speaker identification | Feb 24, 2022 | Bandwidth ExtensionSpeaker Identification | —Unverified | 0 |
| openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer | Feb 24, 2022 | Open Set LearningSpeaker Identification | —Unverified | 0 |
| Speech watermarking: an approach for the forensic analysis of digital telephonic recordings | Feb 23, 2022 | ArticlesSpeaker Identification | —Unverified | 0 |
| Tubes Among Us: Analog Attack on Automatic Speaker Identification | Feb 6, 2022 | BIG-bench Machine LearningSpeaker Identification | —Unverified | 0 |
| Cross-Lingual Speaker Identification from Weak Local Evidence | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices | Dec 15, 2021 | Speaker IdentificationVoice Conversion | —Unverified | 0 |
| SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech | Nov 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions | Oct 23, 2021 | Speaker Identification | —Unverified | 0 |
| SSAST: Self-Supervised Audio Spectrogram Transformer | Oct 19, 2021 | Audio ClassificationClassification | CodeCode Available | 2 |
| SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing | Oct 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training | Oct 12, 2021 | Data AugmentationMulti-Task Learning | CodeCode Available | 1 |
| PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR | Oct 7, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction | Oct 3, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 |
| Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification | Sep 9, 2021 | ClusteringFew-Shot Learning | CodeCode Available | 0 |