Toeplitz Inverse Covariance based Robust Speaker Clustering for Naturalistic Audio Streams Jul 12, 2019 Clustering speaker-diarization
— Unverified 0TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch Dec 11, 2024 Denoising speaker-diarization
— Unverified 0Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations Jul 1, 2020 Action Detection Activity Detection
— Unverified 0Late Audio-Visual Fusion for In-The-Wild Speaker Diarization Nov 2, 2022 speaker-diarization Speaker Diarization
— Unverified 0Towards Measuring and Scoring Speaker Diarization Fairness Feb 20, 2023 Fairness Sentence
— Unverified 0Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio May 21, 2023 speaker-diarization Speaker Diarization
— Unverified 0Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders Jul 2, 2024 Clustering speaker-diarization
— Unverified 0Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries Mar 29, 2022 speaker-diarization Speaker Diarization
— Unverified 0Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR Oct 7, 2021 Action Detection Activity Detection
— Unverified 0Triplet Network with Attention for Speaker Diarization Aug 4, 2018 Metric Learning speaker-diarization
— Unverified 0TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge Oct 26, 2022 Action Detection Activity Detection
— Unverified 0Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification Dec 28, 2023 speaker-diarization Speaker Diarization
— Unverified 0Unified Audio Event Detection Sep 13, 2024 Event Detection Sound Event Detection
— Unverified 0Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection Jan 7, 2025 Action Detection Activity Detection
— Unverified 0UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing Oct 25, 2023 speaker-diarization Speaker Diarization
— Unverified 0Unsupervised Adaptation of SPLDA Nov 20, 2015 speaker-diarization Speaker Diarization
— Unverified 0Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning Apr 16, 2024 Change Detection Federated Learning
— Unverified 0Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free Jul 25, 2022 speaker-diarization Speaker Diarization
— Unverified 0Using Active Speaker Faces for Diarization in TV shows Mar 30, 2022 Face Clustering Face Detection
— Unverified 0Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones Jul 31, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge May 27, 2019 Clustering speaker-diarization
— Unverified 0VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION Nov 25, 2020 Action Detection Activity Detection
— Unverified 0VoxRAG: A Step Toward Transcription-Free RAG Systems in Spoken Question Answering May 22, 2025 Question Answering RAG
— Unverified 0Weakly Supervised Training of Speaker Identification Models Jun 22, 2018 speaker-diarization Speaker Diarization
— Unverified 0An approach to optimize inference of the DIART speaker diarization pipeline Aug 5, 2024 Inference Optimization Knowledge Distillation
— Unverified 0Polish Read Speech Corpus for Speech Tools and Services Jun 1, 2017 Action Detection Activity Detection
— Unverified 0Preparation of Bangla Speech Corpus from Publicly Available Audio \& Text May 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pretraining Multi-Speaker Identification for Neural Speaker Diarization May 30, 2025 speaker-diarization Speaker Diarization
— Unverified 0Privacy-preserving Automatic Speaker Diarization Oct 26, 2022 Privacy Preserving speaker-diarization
— Unverified 0Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation Oct 18, 2023 Action Detection Activity Detection
— Unverified 0psifx -- Psychological and Social Interactions Feature Extraction Package Jul 14, 2024 Pose Estimation speaker-diarization
— Unverified 0Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings Aug 30, 2024 speaker-diarization Speaker Diarization
— Unverified 0Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure Apr 26, 2022 Clustering Community Detection
— Unverified 0Role-specific Language Models for Processing Recorded Neuropsychological Exams Jun 1, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge Feb 10, 2022 speaker-diarization Speaker Diarization
— Unverified 0SCDiar: a streaming diarization system based on speaker change detection and speech recognition Jan 28, 2025 Change Detection speaker-diarization
— Unverified 0SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition Jun 15, 2025 Decoder speaker-diarization
— Unverified 0SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models Jan 14, 2025 speaker-diarization Speaker Diarization
— Unverified 0Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Segmentation et Regroupement en Locuteurs d'une collection de documents audio (Cross-show speaker diarization) [in French] Jun 1, 2012 speaker-diarization Speaker Diarization
— Unverified 0Self-supervised learning for audio-visual speaker diarization Feb 13, 2020 Self-Supervised Learning speaker-diarization
— Unverified 0Self-supervised Speaker Diarization Apr 8, 2022 speaker-diarization Speaker Diarization
— Unverified 0Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech Apr 8, 2020 Acoustic Modelling Action Detection
— Unverified 0Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech May 1, 2020 Acoustic Modelling Action Detection
— Unverified 0Semi-supervised acoustic model training for speech with code-switching Oct 23, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Semi-supervised multi-channel speaker diarization with cross-channel attention Jul 17, 2023 speaker-diarization Speaker Diarization
— Unverified 0SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors Mar 20, 2025 speaker-diarization Speaker Diarization
— Unverified 0Separation Guided Speaker Diarization in Realistic Mismatched Conditions Jul 6, 2021 Clustering speaker-diarization
— Unverified 0Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation Nov 21, 2024 Action Detection Activity Detection
— Unverified 0