| Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems | Jul 9, 2021 | Representation LearningSpeaker Identification | —Unverified | 0 |
| QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus | Jun 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition | Jun 18, 2021 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Graph-based Label Propagation for Semi-Supervised Speaker Identification | Jun 15, 2021 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings | May 5, 2021 | ClusteringSpeaker Identification | —Unverified | 0 |
| Streaming Multi-talker Speech Recognition with Joint Speaker Identification | Apr 5, 2021 | Speaker Identificationspeech-recognition | —Unverified | 0 |
| End-to-End Speaker-Attributed ASR with Transformer | Apr 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Survey on Paralinguistics in Tamil Speech Processing | Apr 1, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 |
| Voice Privacy with Smart Digital Assistants in Educational Settings | Mar 24, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Triplet loss based embeddings for forensic speaker identification in Spanish | Feb 24, 2021 | Speaker IdentificationTriplet | —Unverified | 0 |
| CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions | Feb 11, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 |
| Speaker attribution with voice profiles by graph-based semi-supervised learning | Feb 6, 2021 | Speaker Identification | —Unverified | 0 |
| Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario | Jan 7, 2021 | Multi-Task LearningSpeaker Identification | CodeCode Available | 0 |
| Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings | Jan 6, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Study of Few-Shot Audio Classification | Dec 2, 2020 | Audio ClassificationBIG-bench Machine Learning | —Unverified | 0 |
| How Far Are We from Robust Voice Conversion: A Survey | Nov 24, 2020 | Speaker IdentificationSurvey | —Unverified | 0 |
| Multi-Modal Emotion Detection with Transfer Learning | Nov 13, 2020 | Speaker IdentificationTransfer Learning | —Unverified | 0 |
| T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model | Oct 29, 2020 | Speaker Identification | —Unverified | 0 |
| Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers | Oct 22, 2020 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 |
| Contrastive Learning of General-Purpose Audio Representations | Oct 21, 2020 | CoLAContrastive Learning | CodeCode Available | 0 |
| A Lightweight Speaker Recognition System Using Timbre Properties | Oct 12, 2020 | GPUSpeaker Identification | —Unverified | 0 |
| Remarks on Optimal Scores for Speaker Recognition | Oct 10, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems | Jul 13, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers | Jun 19, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Integrated Replay Spoofing-aware Text-independent Speaker Verification | Jun 10, 2020 | Multi-Task LearningSpeaker Identification | —Unverified | 0 |
| Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features | May 25, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| Identify Speakers in Cocktail Parties with End-to-End Attention | May 22, 2020 | Speaker IdentificationSpeech Separation | CodeCode Available | 0 |
| Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation | May 18, 2020 | Self-Supervised LearningSpeaker Identification | CodeCode Available | 0 |
| Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification | May 15, 2020 | Speaker Identification | —Unverified | 0 |
| Speaker Recognition in Bengali Language from Nonlinear Features | Apr 15, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification | Mar 13, 2020 | Data AugmentationDenoising | —Unverified | 0 |
| Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data | Mar 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speaker Identification using EEG | Mar 7, 2020 | EEGElectroencephalogram (EEG) | —Unverified | 0 |
| Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition | Mar 3, 2020 | Emotion Recognition in ConversationMulti-Task Learning | —Unverified | 0 |
| Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention | Feb 14, 2020 | Multi-Task LearningSpeaker Identification | —Unverified | 0 |
| Supervised Speaker Embedding De-Mixing in Two-Speaker Environment | Jan 14, 2020 | Speaker IdentificationVocal Bursts Valence Prediction | —Unverified | 0 |
| Robust Speaker Recognition Using Speech Enhancement And Attention Model | Jan 14, 2020 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| The Deterministic plus Stochastic Model of the Residual Signal and its Applications | Dec 29, 2019 | Speaker IdentificationSpeech Synthesis | —Unverified | 0 |
| Advances in Online Audio-Visual Meeting Transcription | Dec 10, 2019 | Sound Source Localizationspeaker-diarization | —Unverified | 0 |
| Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion? | Nov 12, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals | Nov 11, 2019 | Speaker Identification | —Unverified | 0 |
| Reducing audio membership inference attack accuracy to chance: 4 defenses | Oct 31, 2019 | Inference AttackMembership Inference Attack | —Unverified | 0 |
| Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors | Oct 25, 2019 | Speaker Identification | —Unverified | 0 |
| Delving into VoxCeleb: environment invariant speaker recognition | Oct 24, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing | Oct 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model | Oct 17, 2019 | Speaker Identification | —Unverified | 0 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| Emirati-Accented Speaker Identification in Stressful Talking Conditions | Sep 28, 2019 | Speaker Identification | —Unverified | 0 |
| Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model | Sep 24, 2019 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |