Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation Oct 18, 2023 Action Detection Activity Detection
— Unverified 0End-to-end Online Speaker Diarization with Target Speaker Tracking Oct 12, 2023 Action Detection Activity Detection
— Unverified 0One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Oct 2, 2023 All Automatic Speech Recognition
— Unverified 0Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors Sep 25, 2023 Decoder speaker-diarization
Code Code Available 1NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization Sep 22, 2023 Automatic Speech Recognition speaker-diarization
— Unverified 0Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation Sep 19, 2023 speaker-diarization Speaker Diarization
— Unverified 0Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture Sep 17, 2023 speaker-diarization Speaker Diarization
Code Code Available 1Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DiaCorrect: Error Correction Back-end For Speaker Diarization Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 1Aligning Speakers: Evaluating and Visualizing Text-based Diarization Using Efficient Multiple Sequence Alignment (Extended Version) Sep 14, 2023 Multiple Sequence Alignment speaker-diarization
— Unverified 0DiariST: Streaming Speech Translation with Speaker Diarization Sep 14, 2023 speaker-diarization Speaker Diarization
Code Code Available 1Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Sep 11, 2023 speaker-diarization Speaker Diarization
Code Code Available 1The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge Aug 28, 2023 speaker-diarization Speaker Diarization
— Unverified 0Implicit Self-supervised Language Representation for Spoken Language Diarization Aug 21, 2023 speaker-diarization Speaker Diarization
— Unverified 0Home monitoring for frailty detection through sound and speaker diarization analysis Aug 17, 2023 Privacy Preserving speaker-diarization
— Unverified 0GIST-AiTeR Speaker Diarization System for VoxCeleb Speaker Recognition Challenge (VoxSRC) 2023 Aug 15, 2023 speaker-diarization Speaker Diarization
— Unverified 0Speaker Diarization of Scripted Audiovisual Content Aug 4, 2023 speaker-diarization Speaker Diarization
— Unverified 0Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains Jul 24, 2023 Multi-class Classification speaker-diarization
— Unverified 0Semi-supervised multi-channel speaker diarization with cross-channel attention Jul 17, 2023 speaker-diarization Speaker Diarization
— Unverified 0Long-term Conversation Analysis: Exploring Utility and Privacy Jun 28, 2023 Action Detection Activity Detection
Code Code Available 0Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization Jun 26, 2023 Clustering Community Detection
— Unverified 0Implicit spoken language diarization Jun 22, 2023 Language Modeling Language Modelling
— Unverified 0Speech Emotion Diarization: Which Emotion Appears When? Jun 22, 2023 Emotion Recognition speaker-diarization
Code Code Available 1Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction Jun 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features Jun 7, 2023 Action Detection Activity Detection
— Unverified 0Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks Jun 7, 2023 Audio Classification Audio Tagging
Code Code Available 1An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings May 29, 2023 Clustering speaker-diarization
— Unverified 0Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization May 23, 2023 Clustering speaker-diarization
— Unverified 0Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization May 22, 2023 speaker-diarization Speaker Diarization
— Unverified 0Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio May 21, 2023 speaker-diarization Speaker Diarization
— Unverified 0Neural Diarization with Non-autoregressive Intermediate Attractors Mar 13, 2023 Decoder speaker-diarization
Code Code Available 0TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization Mar 8, 2023 speaker-diarization Speaker Diarization
Code Code Available 0A Light Weight Model for Active Speaker Detection Mar 8, 2023 Active Speaker Detection Audio-Visual Active Speaker Detection
Code Code Available 1Improving Transformer-based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads Mar 2, 2023 Action Detection Activity Detection
— Unverified 0DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments Mar 1, 2023 speaker-diarization Speaker Diarization
— Unverified 0Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization Feb 24, 2023 Clustering Graph Clustering
Code Code Available 0A Reinforcement Learning Framework for Online Speaker Diarization Feb 21, 2023 Decision Making Domain Adaptation
— Unverified 0VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge Feb 20, 2023 Speaker Diarization Speaker Recognition
Code Code Available 1Towards Measuring and Scoring Speaker Diarization Fairness Feb 20, 2023 Fairness Sentence
— Unverified 0The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description Jan 17, 2023 Action Detection Activity Detection
— Unverified 0BER: Balanced Error Rate For Speaker Diarization Nov 8, 2022 speaker-diarization Speaker Diarization
Code Code Available 1Late Audio-Visual Fusion for In-The-Wild Speaker Diarization Nov 2, 2022 speaker-diarization Speaker Diarization
— Unverified 0A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings Nov 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DiaCorrect: End-to-end error correction for speaker diarization Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction Oct 28, 2022 Action Detection Activity Detection
— Unverified 0On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Privacy-preserving Automatic Speaker Diarization Oct 26, 2022 Privacy Preserving speaker-diarization
— Unverified 0TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge Oct 26, 2022 Action Detection Activity Detection
— Unverified 0Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering Oct 25, 2022 Clustering CPU
Code Code Available 2