| 基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese] | Nov 1, 2017 | Speaker Identification | —Unverified | 0 |
| A Preliminary Exploration with GPT-4o Voice Mode | Feb 14, 2025 | Age ClassificationAudio Deepfake Detection | —Unverified | 0 |
| A Real-time Speaker Diarization System Based on Spatial Spectrum | Jul 20, 2021 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions | Oct 23, 2021 | Speaker Identification | —Unverified | 0 |
| A Study of Few-Shot Audio Classification | Dec 2, 2020 | Audio ClassificationBIG-bench Machine Learning | —Unverified | 0 |
| A Survey on Paralinguistics in Tamil Speech Processing | Apr 1, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 |
| A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR | Sep 9, 2024 | Automatic Speech Recognitionspeaker-diarization | —Unverified | 0 |
| A user study to compare two conversational assistants designed for people with hearing impairments | Jun 1, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music | Apr 29, 2017 | EEGElectroencephalogram (EEG) | —Unverified | 0 |
| CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions | Feb 11, 2021 | Emotion RecognitionSpeaker Identification | —Unverified | 0 |
| Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models | Jan 24, 2025 | Emotion ClassificationSpeaker Identification | —Unverified | 0 |
| Comparison of Gender- and Speaker-adaptive Emotion Recognition | May 1, 2014 | AttributeEmotion Classification | —Unverified | 0 |
| Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification | Jul 14, 2017 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Computer-assisted Speaker Diarization: How to Evaluate Human Corrections | May 1, 2018 | Active LearningFace Recognition | —Unverified | 0 |
| Computing with Hypervectors for Efficient Speaker Identification | Aug 28, 2022 | CPUQuantization | —Unverified | 0 |
| Cosine similarity-based adversarial process | Jul 1, 2019 | Speaker Identification | —Unverified | 0 |
| Cross-Lingual Speaker Identification from Weak Local Evidence | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Curie: A method for protecting SVM Classifier from Poisoning Attack | Jun 5, 2016 | BIG-bench Machine LearningSpeaker Identification | —Unverified | 0 |
| DASB -- Discrete Audio and Speech Benchmark | Jun 20, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data | Mar 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models | Jul 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Delving into VoxCeleb: environment invariant speaker recognition | Oct 24, 2019 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks | Dec 1, 2016 | Dialect IdentificationInformation Retrieval | —Unverified | 0 |
| Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods | Feb 26, 2024 | Speaker Identification | —Unverified | 0 |