Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-scale Speaker Diarization with Dynamic Scale Weighting Mar 30, 2022 Decoder speaker-diarization
— Unverified 0Using Active Speaker Faces for Diarization in TV shows Mar 30, 2022 Face Clustering Face Detection
— Unverified 0Generation of Speaker Representations Using Heterogeneous Training Batch Assembly Mar 30, 2022 speaker-diarization Speaker Diarization
— Unverified 0Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries Mar 29, 2022 speaker-diarization Speaker Diarization
— Unverified 0Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features Mar 29, 2022 speaker-diarization Speaker Diarization
Code Code Available 0Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios Mar 18, 2022 Action Detection Activity Detection
Code Code Available 0Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model Feb 14, 2022 Clustering speaker-diarization
— Unverified 0The xmuspeech system for multi-channel multi-party meeting transcription challenge Feb 11, 2022 speaker-diarization Speaker Diarization
— Unverified 0The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge Feb 10, 2022 Action Detection Activity Detection
— Unverified 0Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge Feb 10, 2022 speaker-diarization Speaker Diarization
— Unverified 0The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Feb 9, 2022 Data Augmentation Language Modelling
— Unverified 0Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge Feb 6, 2022 Action Detection Activity Detection
— Unverified 0The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Feb 4, 2022 Action Detection Activity Detection
— Unverified 0Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information Nov 28, 2021 Action Detection Activity Detection
Code Code Available 0Low-Latency Online Speaker Diarization with Graph-Based Label Generation Nov 27, 2021 Clustering speaker-diarization
— Unverified 0Auxiliary Loss of Transformer with Residual Connection for End-to-End Speaker Diarization Oct 14, 2021 speaker-diarization Speaker Diarization
— Unverified 0Multi-Channel End-to-End Neural Diarization with Distributed Microphones Oct 10, 2021 speaker-diarization Speaker Diarization
— Unverified 0Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR Oct 7, 2021 Action Detection Activity Detection
— Unverified 0North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021 Sep 28, 2021 Clustering speaker-diarization
— Unverified 0Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization Sep 14, 2021 Clustering Graph Clustering
Code Code Available 0Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification Sep 9, 2021 Clustering Few-Shot Learning
Code Code Available 0The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge Sep 5, 2021 Action Detection Activity Detection
— Unverified 0The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker Aug 7, 2021 Action Detection Activity Detection
— Unverified 0A Real-time Speaker Diarization System Based on Spatial Spectrum Jul 20, 2021 speaker-diarization Speaker Diarization
— Unverified 0A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio Jul 6, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Separation Guided Speaker Diarization in Realistic Mismatched Conditions Jul 6, 2021 Clustering speaker-diarization
— Unverified 0Development of a Conversation State Prediction System Jul 3, 2021 Prediction speaker-diarization
— Unverified 0Speaker-conversation factorial designs for diarization error analysis Jun 10, 2021 Clustering speaker-diarization
— Unverified 0End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection Jun 8, 2021 Clustering speaker-diarization
— Unverified 0DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding May 28, 2021 speaker-diarization Speaker Diarization
— Unverified 0X-Vectors with Multi-Scale Aggregation for Speaker Diarization May 16, 2021 speaker-diarization Speaker Diarization
— Unverified 0Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization Apr 19, 2021 Clustering Representation Learning
Code Code Available 0Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network Apr 7, 2021 Binary Classification speaker-diarization
— Unverified 0LEAP Submission for the Third DIHARD Diarization Challenge Apr 6, 2021 Clustering speaker-diarization
— Unverified 0Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings Apr 6, 2021 Clustering speaker-diarization
Code Code Available 0Speaker conditioned acoustic modeling for multi-speaker conversational ASR Apr 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ECAPA-TDNN Embeddings for Speaker Diarization Apr 3, 2021 speaker-diarization Speaker Diarization
— Unverified 0Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Domain-Dependent Speaker Diarization for the Third DIHARD Challenge Jan 25, 2021 Clustering Dimensionality Reduction
— Unverified 0A Review of Speaker Diarization: Recent Advances with Deep Learning Jan 24, 2021 Deep Learning Retrieval
— Unverified 0End-to-End Speaker Diarization as Post-Processing Dec 18, 2020 Clustering Multi-Label Classification
— Unverified 0Speaker Recognition Based on Deep Learning: An Overview Dec 2, 2020 Deep Learning Domain Adaptation
— Unverified 0A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION Nov 25, 2020 Action Detection Activity Detection
— Unverified 0BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers Nov 5, 2020 Clustering Decoder
— Unverified 0Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis Nov 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Third DIHARD Challenge Evaluation Plan Oct 30, 2020 speaker-diarization Speaker Diarization
— Unverified 0EML System Description for VoxCeleb Speaker Diarization Challenge 2020 Oct 23, 2020 CPU speaker-diarization
— Unverified 0