Speaker Diarization with LSTM Oct 28, 2017 Clustering speaker-diarization
Code Code Available 15 The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines Aug 17, 2022 Machine Translation speaker-diarization
Code Code Available 15 Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm Oct 21, 2020 speaker-diarization Speaker Diarization
Code Code Available 15 End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors May 20, 2020 Clustering Decoder
Code Code Available 15 DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors Dec 7, 2023 Decoder speaker-diarization
Code Code Available 15 Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Sep 11, 2023 speaker-diarization Speaker Diarization
Code Code Available 15 CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning Speaker Count Estimation Oct 28, 2018 blind source separation speaker-diarization
Code Code Available 05 Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features Mar 29, 2022 speaker-diarization Speaker Diarization
Code Code Available 05 Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions Jul 1, 2019 speaker-diarization Speaker Diarization
Code Code Available 05 A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers Oct 22, 2020 speaker-diarization Speaker Diarization
Code Code Available 05 TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization Mar 8, 2023 speaker-diarization Speaker Diarization
Code Code Available 05 Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification Sep 9, 2021 Clustering Few-Shot Learning
Code Code Available 05 The EURECOM Submission to the First DIHARD Challenge Sep 6, 2018 Clustering speaker-diarization
Code Code Available 05 The Second DIHARD Diarization Challenge: Dataset, task, and baselines Jun 18, 2019 Action Detection Activity Detection
Code Code Available 05 Supervised online diarization with sample mean loss for multi-domain data Nov 4, 2019 Clustering speaker-diarization
Code Code Available 05 Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information Nov 28, 2021 Action Detection Activity Detection
Code Code Available 05 3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization Mar 29, 2024 Self-Supervised Learning speaker-diarization
Code Code Available 05 Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios Mar 18, 2022 Action Detection Activity Detection
Code Code Available 05 Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization Feb 24, 2023 Clustering Graph Clustering
Code Code Available 05 Self-Tuning Spectral Clustering for Speaker Diarization Sep 16, 2024 Clustering speaker-diarization
Code Code Available 05 Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens Sep 10, 2024 speaker-diarization Speaker Diarization
Code Code Available 05 End-to-End Neural Speaker Diarization with Permutation-Free Objectives Sep 12, 2019 Clustering Domain Adaptation
Code Code Available 05 Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization Sep 14, 2021 Clustering Graph Clustering
Code Code Available 05 Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization Apr 19, 2021 Clustering Representation Learning
Code Code Available 05 Probabilistic embeddings for speaker diarization Apr 6, 2020 Clustering speaker-diarization
Code Code Available 05 Robust speaker recognition using unsupervised adversarial invariance Nov 3, 2019 speaker-diarization Speaker Diarization
Code Code Available 05 EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers Mar 31, 2022 Decoder speaker-diarization
Code Code Available 05 Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment Dec 1, 2024 Action Detection Activity Detection
Code Code Available 05 Powerset multi-class cross entropy loss for neural speaker diarization Oct 19, 2023 Multi-class Classification Multi-Label Classification
Code Code Available 05 Scalable Adaptation of State Complexity for Nonparametric Hidden Markov Models Dec 1, 2015 speaker-diarization Speaker Diarization
Code Code Available 05 Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings Apr 6, 2021 Clustering speaker-diarization
Code Code Available 05 Neural Speaker Diarization with Speaker-Wise Chain Rule Jun 2, 2020 speaker-diarization Speaker Diarization
Code Code Available 05 On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Neural Diarization with Non-autoregressive Intermediate Attractors Mar 13, 2023 Decoder speaker-diarization
Code Code Available 05 On the calibration of powerset speaker diarization models Sep 24, 2024 speaker-diarization Speaker Diarization
Code Code Available 05 LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization Jul 23, 2019 Change Detection Clustering
Code Code Available 05 Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 05 Multi-Stage Speaker Diarization for Noisy Classrooms May 16, 2025 Action Detection Activity Detection
Code Code Available 05 End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization Jan 23, 2024 Clustering Graph Clustering
Code Code Available 05 DiaCorrect: End-to-end error correction for speaker diarization Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Fully Supervised Speaker Diarization Oct 10, 2018 Clustering speaker-diarization
Code Code Available 05 Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Long-term Conversation Analysis: Exploring Utility and Privacy Jun 28, 2023 Action Detection Activity Detection
Code Code Available 05 Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge Feb 6, 2022 Action Detection Activity Detection
— Unverified 00 A sticky HDP-HMM with application to speaker diarization May 15, 2009 speaker-diarization Speaker Diarization
— Unverified 00 Constrained speaker diarization of TV series based on visual patterns Dec 18, 2018 Clustering speaker-diarization
— Unverified 00 Computer-assisted Speaker Diarization: How to Evaluate Human Corrections May 1, 2018 Active Learning Face Recognition
— Unverified 00 Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization Mar 21, 2024 Clustering speaker-diarization
— Unverified 00 All-neural online source separation, counting, and diarization for meeting analysis Feb 21, 2019 All Automatic Speech Recognition
— Unverified 00