Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios Jun 17, 2022 Action Detection Activity Detection
— Unverified 0Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models Sep 17, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting Sep 24, 2022 speaker-diarization Speaker Diarization
— Unverified 0Spatial-Temporal Activity-Informed Diarization and Separation Jan 30, 2024 speaker-diarization Speaker Diarization
— Unverified 0Speaker-conversation factorial designs for diarization error analysis Jun 10, 2021 Clustering speaker-diarization
— Unverified 0Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones Jul 1, 2022 speaker-diarization Speaker Diarization
— Unverified 0Speaker conditioned acoustic modeling for multi-speaker conversational ASR Apr 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning Apr 23, 2025 Self-Supervised Learning speaker-diarization
— Unverified 0Speaker Diarization of Scripted Audiovisual Content Aug 4, 2023 speaker-diarization Speaker Diarization
— Unverified 0Speaker Diarization using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings Aug 9, 2017 speaker-diarization Speaker Diarization
— Unverified 0Speaker diarization using latent space clustering in generative adversarial network Oct 24, 2019 Clustering Diagnostic
— Unverified 0Utterance Clustering Using Stereo Audio Channels Sep 10, 2020 Audio Signal Processing Clustering
— Unverified 0Speaker Diarization With Lexical Information Nov 27, 2018 Clustering speaker-diarization
— Unverified 0Speaker Diarization with Lexical Information Apr 13, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker diarization with session-level speaker embedding refinement using graph neural networks May 22, 2020 Clustering speaker-diarization
— Unverified 0Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization May 15, 2024 Action Detection Activity Detection
— Unverified 0Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Dec 18, 2023 speaker-diarization Speaker Diarization
— Unverified 0Speaker Recognition Based on Deep Learning: An Overview Dec 2, 2020 Deep Learning Domain Adaptation
— Unverified 0Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization Jun 26, 2024 Clustering Form
— Unverified 0Speaker Tagging Correction With Non-Autoregressive Language Models Aug 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context May 1, 2016 speaker-diarization Speaker Diarization
— Unverified 0Summary of the DISPLACE Challenge 2023 -- DIarization of SPeaker and LAnguage in Conversational Environments Nov 21, 2023 speaker-diarization Speaker Diarization
— Unverified 0Systematic Evaluation of Online Speaker Diarization Systems Regarding their Latency Jul 5, 2024 Online Clustering Segmentation
— Unverified 0System Description for the Displace Speaker Diarization Challenge 2023 Jun 20, 2024 Clustering speaker-diarization
— Unverified 0Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system Mar 9, 2020 All speaker-diarization
— Unverified 0TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024 Jul 17, 2024 speaker-diarization Speaker Diarization
— Unverified 0Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario May 14, 2020 Action Detection Activity Detection
— Unverified 0Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction Oct 28, 2022 Action Detection Activity Detection
— Unverified 0Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker Aug 7, 2021 Action Detection Activity Detection
— Unverified 0Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization Aug 27, 2022 Action Detection Activity Detection
— Unverified 0Target Speech Diarization with Multimodal Prompts Jun 11, 2024 speaker-diarization Speaker Diarization
— Unverified 0TCG CREST System Description for the Second DISPLACE Challenge Sep 16, 2024 Action Detection Activity Detection
— Unverified 0The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences Jun 14, 2024 Depth Estimation Image Segmentation
— Unverified 0The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Oct 18, 2023 Automatic Speech Recognition speaker-diarization
— Unverified 0The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Feb 4, 2022 Action Detection Activity Detection
— Unverified 0The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge Sep 5, 2021 Action Detection Activity Detection
— Unverified 0The ETAPE speech processing evaluation May 1, 2014 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition May 20, 2025 Audio-Visual Speech Recognition speaker-diarization
— Unverified 0The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description Jan 17, 2023 Action Detection Activity Detection
— Unverified 0End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization Jan 23, 2024 Clustering Graph Clustering
Code Code Available 0End-to-End Neural Speaker Diarization with Permutation-Free Objectives Sep 12, 2019 Clustering Domain Adaptation
Code Code Available 0Powerset multi-class cross entropy loss for neural speaker diarization Oct 19, 2023 Multi-class Classification Multi-Label Classification
Code Code Available 0EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers Mar 31, 2022 Decoder speaker-diarization
Code Code Available 0CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning Speaker Count Estimation Oct 28, 2018 blind source separation speaker-diarization
Code Code Available 0DiaCorrect: End-to-end error correction for speaker diarization Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Probabilistic embeddings for speaker diarization Apr 6, 2020 Clustering speaker-diarization
Code Code Available 0Neural Diarization with Non-autoregressive Intermediate Attractors Mar 13, 2023 Decoder speaker-diarization
Code Code Available 0Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0On the calibration of powerset speaker diarization models Sep 24, 2024 speaker-diarization Speaker Diarization
Code Code Available 0