| Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker Embeddings | Mar 13, 2025 | Speaker Identificationspeech-recognition | CodeCode Available | 1 |
| Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization | Feb 18, 2025 | Automatic Speech RecognitionSpeaker Identification | —Unverified | 0 |
| A Preliminary Exploration with GPT-4o Voice Mode | Feb 14, 2025 | Age ClassificationAudio Deepfake Detection | —Unverified | 0 |
| SCDiar: a streaming diarization system based on speaker change detection and speech recognition | Jan 28, 2025 | Change Detectionspeaker-diarization | —Unverified | 0 |
| Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models | Jan 24, 2025 | Emotion ClassificationSpeaker Identification | —Unverified | 0 |
| PolInterviews -- A Dataset of German Politician Public Broadcast Interviews | Jan 8, 2025 | Speaker Identification | —Unverified | 0 |
| Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding | Dec 23, 2024 | Speaker Identification | CodeCode Available | 0 |
| Machine Unlearning reveals that the Gender-based Violence Victim Condition can be detected from Speech in a Speaker-Agnostic Setting | Nov 27, 2024 | Machine UnlearningSpeaker Identification | —Unverified | 0 |
| Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network | Nov 22, 2024 | Data AugmentationSpeaker Identification | CodeCode Available | 0 |
| Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications | Nov 20, 2024 | Emotion RecognitionSpeaker Identification | —Unverified | 0 |