The USTC-NERCSLIP Systems for The ICMC-ASR Challenge Jul 2, 2024 Automatic Speech Recognition Pseudo Label
— Unverified 00 The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge Feb 10, 2022 Action Detection Activity Detection
— Unverified 00 The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Feb 9, 2022 Data Augmentation Language Modelling
— Unverified 00 The xmuspeech system for multi-channel multi-party meeting transcription challenge Feb 11, 2022 speaker-diarization Speaker Diarization
— Unverified 00 Third DIHARD Challenge Evaluation Plan Oct 30, 2020 speaker-diarization Speaker Diarization
— Unverified 00 "This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II) Aug 4, 2020 Action Detection Activity Detection
— Unverified 00 Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network Apr 7, 2021 Binary Classification speaker-diarization
— Unverified 00 Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model Feb 14, 2022 Clustering speaker-diarization
— Unverified 00 Toeplitz Inverse Covariance based Robust Speaker Clustering for Naturalistic Audio Streams Jul 12, 2019 Clustering speaker-diarization
— Unverified 00 TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch Dec 11, 2024 Denoising speaker-diarization
— Unverified 00 Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations Jul 1, 2020 Action Detection Activity Detection
— Unverified 00 Late Audio-Visual Fusion for In-The-Wild Speaker Diarization Nov 2, 2022 speaker-diarization Speaker Diarization
— Unverified 00 Towards Measuring and Scoring Speaker Diarization Fairness Feb 20, 2023 Fairness Sentence
— Unverified 00 Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio May 21, 2023 speaker-diarization Speaker Diarization
— Unverified 00 Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders Jul 2, 2024 Clustering speaker-diarization
— Unverified 00 Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries Mar 29, 2022 speaker-diarization Speaker Diarization
— Unverified 00 Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR Oct 7, 2021 Action Detection Activity Detection
— Unverified 00 Triplet Network with Attention for Speaker Diarization Aug 4, 2018 Metric Learning speaker-diarization
— Unverified 00 TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge Oct 26, 2022 Action Detection Activity Detection
— Unverified 00 Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification Dec 28, 2023 speaker-diarization Speaker Diarization
— Unverified 00 Unified Audio Event Detection Sep 13, 2024 Event Detection Sound Event Detection
— Unverified 00 Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection Jan 7, 2025 Action Detection Activity Detection
— Unverified 00 UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing Oct 25, 2023 speaker-diarization Speaker Diarization
— Unverified 00 Unsupervised Adaptation of SPLDA Nov 20, 2015 speaker-diarization Speaker Diarization
— Unverified 00 Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning Apr 16, 2024 Change Detection Federated Learning
— Unverified 00 Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free Jul 25, 2022 speaker-diarization Speaker Diarization
— Unverified 00 Using Active Speaker Faces for Diarization in TV shows Mar 30, 2022 Face Clustering Face Detection
— Unverified 00 Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones Jul 31, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge May 27, 2019 Clustering speaker-diarization
— Unverified 00 VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION Nov 25, 2020 Action Detection Activity Detection
— Unverified 00 VoxRAG: A Step Toward Transcription-Free RAG Systems in Spoken Question Answering May 22, 2025 Question Answering RAG
— Unverified 00 Weakly Supervised Training of Speaker Identification Models Jun 22, 2018 speaker-diarization Speaker Diarization
— Unverified 00 An approach to optimize inference of the DIART speaker diarization pipeline Aug 5, 2024 Inference Optimization Knowledge Distillation
— Unverified 00 X-Vectors with Multi-Scale Aggregation for Speaker Diarization May 16, 2021 speaker-diarization Speaker Diarization
— Unverified 00 A Benchmark for Multi-speaker Anonymization Jul 8, 2024 Benchmarking Disentanglement
— Unverified 00 A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio Jul 6, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings Nov 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Advances in Online Audio-Visual Meeting Transcription Dec 10, 2019 Sound Source Localization speaker-diarization
— Unverified 00 A framework for the automatic inference of stochastic turn-taking styles Sep 1, 2016 Speaker Diarization
— Unverified 00 Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond Feb 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 AG-LSEC: Audio Grounded Lexical Speaker Error Correction Jun 25, 2024 Language Modeling Language Modelling
— Unverified 00 Aligning Speakers: Evaluating and Visualizing Text-based Diarization Using Efficient Multiple Sequence Alignment (Extended Version) Sep 14, 2023 Multiple Sequence Alignment speaker-diarization
— Unverified 00 All-neural online source separation, counting, and diarization for meeting analysis Feb 21, 2019 All Automatic Speech Recognition
— Unverified 00 An Alternative to Low-level-Sychrony-Based Methods for Speech Detection Dec 1, 2010 Facial Expression Recognition Facial Expression Recognition (FER)
— Unverified 00 An automated medical scribe for documenting clinical encounters Jun 1, 2018 speaker-diarization Speaker Diarization
— Unverified 00 An Effortless Way To Create Large-Scale Datasets For Famous Speakers May 1, 2014 Person Identification Speaker Diarization
— Unverified 00 An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings May 29, 2023 Clustering speaker-diarization
— Unverified 00 An Infinite Hidden Markov Model With Similarity-Biased Transitions Jul 21, 2017 speaker-diarization Speaker Diarization
— Unverified 00 基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統 (Speaker Diarization based on I-vector PLDA Scoring and using GMM-HMM Forced Alignment) [In Chinese] Nov 1, 2017 speaker-diarization Speaker Diarization
— Unverified 00