| The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities | Oct 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample | Sep 24, 2024 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection | Sep 22, 2024 | Depression DetectionEmotion Recognition | —Unverified | 0 |
| Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Sep 21, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels | Sep 16, 2024 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models | Sep 16, 2024 | Data AugmentationSpeaker Recognition | —Unverified | 0 |
| Text-To-Speech Synthesis In The Wild | Sep 13, 2024 | BenchmarkingSpeaker Recognition | —Unverified | 0 |
| Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings | Aug 30, 2024 | speaker-diarizationSpeaker Diarization | —Unverified | 0 |
| The VoxCeleb Speaker Recognition Challenge: A Retrospective | Aug 27, 2024 | Domain AdaptationSpeaker Recognition | —Unverified | 0 |
| Convexity-based Pruning of Speech Representation Models | Aug 16, 2024 | Keyword SpottingSelf-Supervised Learning | —Unverified | 0 |
| Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation | Aug 1, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning | Jul 21, 2024 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Team HYU ASML ROBOVOX SP Cup 2024 System Description | Jul 16, 2024 | Data AugmentationSpeaker Recognition | —Unverified | 0 |
| Phonetic Richness for Improved Automatic Speaker Verification | Jul 10, 2024 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states | Jul 9, 2024 | ArticlesClassification | CodeCode Available | 0 |
| Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation | Jul 8, 2024 | Automatic Speech RecognitionEmotion Recognition | —Unverified | 0 |
| We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings | Jul 5, 2024 | Speaker RecognitionSpeech Synthesis | —Unverified | 0 |
| Prosody-Driven Privacy-Preserving Dementia Detection | Jul 3, 2024 | AttributeDiagnostic | CodeCode Available | 0 |
| Open-Source Conversational AI with SpeechBrain 1.0 | Jun 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CEC: A Noisy Label Detection Method for Speaker Recognition | Jun 19, 2024 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| Challenging margin-based speaker embedding extractors by using the variational information bottleneck | Jun 18, 2024 | Speaker Recognition | —Unverified | 0 |
| PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation | Jun 10, 2024 | Age EstimationEmotion Recognition | —Unverified | 0 |
| The Reasonable Effectiveness of Speaker Embeddings for Violence Detection | Jun 10, 2024 | Speaker Recognition | —Unverified | 0 |
| Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting | May 30, 2024 | Audio SynthesisRepresentation Learning | —Unverified | 0 |
| Speaker Characterization by means of Attention Pooling | May 7, 2024 | Emotion RecognitionSpeaker Recognition | —Unverified | 0 |