Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization Sep 14, 2021 Clustering Graph Clustering
Code Code Available 0Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation Sep 14, 2021 Clustering Segmentation
Code Code Available 2Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification Sep 9, 2021 Clustering Few-Shot Learning
Code Code Available 0The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge Sep 5, 2021 Action Detection Activity Detection
— Unverified 0The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker Aug 7, 2021 Action Detection Activity Detection
— Unverified 0A Real-time Speaker Diarization System Based on Spatial Spectrum Jul 20, 2021 speaker-diarization Speaker Diarization
— Unverified 0A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio Jul 6, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Separation Guided Speaker Diarization in Realistic Mismatched Conditions Jul 6, 2021 Clustering speaker-diarization
— Unverified 0Development of a Conversation State Prediction System Jul 3, 2021 Prediction speaker-diarization
— Unverified 0Encoder-Decoder Based Attractors for End-to-End Neural Diarization Jun 20, 2021 Decoder speaker-diarization
Code Code Available 1Speaker-conversation factorial designs for diarization error analysis Jun 10, 2021 Clustering speaker-diarization
— Unverified 0End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection Jun 8, 2021 Clustering speaker-diarization
— Unverified 0DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding May 28, 2021 speaker-diarization Speaker Diarization
— Unverified 0Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech May 19, 2021 Clustering Constrained Clustering
Code Code Available 1X-Vectors with Multi-Scale Aggregation for Speaker Diarization May 16, 2021 speaker-diarization Speaker Diarization
— Unverified 0Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization Apr 19, 2021 Clustering Representation Learning
Code Code Available 0End-to-end speaker segmentation for overlap-aware resegmentation Apr 8, 2021 Action Detection Activity Detection
Code Code Available 1Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network Apr 7, 2021 Binary Classification speaker-diarization
— Unverified 0Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings Apr 6, 2021 Clustering speaker-diarization
Code Code Available 0LEAP Submission for the Third DIHARD Diarization Challenge Apr 6, 2021 Clustering speaker-diarization
— Unverified 0Speaker conditioned acoustic modeling for multi-speaker conversational ASR Apr 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem Apr 5, 2021 graph partitioning speaker-diarization
Code Code Available 1ECAPA-TDNN Embeddings for Speaker Diarization Apr 3, 2021 speaker-diarization Speaker Diarization
— Unverified 0Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Domain-Dependent Speaker Diarization for the Third DIHARD Challenge Jan 25, 2021 Clustering Dimensionality Reduction
— Unverified 0A Review of Speaker Diarization: Recent Advances with Deep Learning Jan 24, 2021 Deep Learning Retrieval
— Unverified 0End-to-End Speaker Diarization as Post-Processing Dec 18, 2020 Clustering Multi-Label Classification
— Unverified 0Speaker Recognition Based on Deep Learning: An Overview Dec 2, 2020 Deep Learning Domain Adaptation
— Unverified 0The Third DIHARD Diarization Challenge Dec 2, 2020 speaker-diarization Speaker Diarization
Code Code Available 1A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0VoxLingua107: a Dataset for Spoken Language Recognition Nov 25, 2020 Action Detection Activity Detection
Code Code Available 1VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION Nov 25, 2020 Action Detection Activity Detection
— Unverified 0BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers Nov 5, 2020 Clustering Decoder
— Unverified 0Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis Nov 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Third DIHARD Challenge Evaluation Plan Oct 30, 2020 speaker-diarization Speaker Diarization
— Unverified 0EML System Description for VoxCeleb Speaker Diarization Challenge 2020 Oct 23, 2020 CPU speaker-diarization
— Unverified 0Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers Oct 22, 2020 speaker-diarization Speaker Diarization
Code Code Available 0Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm Oct 21, 2020 speaker-diarization Speaker Diarization
Code Code Available 1Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings Oct 13, 2020 Clustering speaker-diarization
— Unverified 0Utterance Clustering Using Stereo Audio Channels Sep 10, 2020 Audio Signal Processing Clustering
— Unverified 0asya: Mindful verbal communication using deep learning Aug 20, 2020 Deep Learning speaker-diarization
— Unverified 0"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II) Aug 4, 2020 Action Detection Activity Detection
— Unverified 0Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones Jul 31, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations Jul 1, 2020 Action Detection Activity Detection
— Unverified 0Speaker Diarization: Using Recurrent Neural Networks Jun 10, 2020 speaker-diarization Speaker Diarization
Code Code Available 1Speaker Diarization as a Fully Online Learning Problem in MiniVox Jun 8, 2020 Self-Supervised Learning speaker-diarization
Code Code Available 1Online End-to-End Neural Diarization with Speaker-Tracing Buffer Jun 4, 2020 speaker-diarization Speaker Diarization
— Unverified 0Neural Speaker Diarization with Speaker-Wise Chain Rule Jun 2, 2020 speaker-diarization Speaker Diarization
Code Code Available 0Speaker diarization with session-level speaker embedding refinement using graph neural networks May 22, 2020 Clustering speaker-diarization
— Unverified 0