TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge Jun 2, 2025 Language Identification speech-recognition
— Unverified 0Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric Jun 2, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training Jun 1, 2025 Automatic Speech Recognition speech-recognition
Code Code Available 0Enhancing Speech Instruction Understanding and Disambiguation in Robotics via Speech Prosody Jun 1, 2025 In-Context Learning speech-recognition
— Unverified 0Towards Temporally Explainable Dysarthric Speech Clarity Assessment May 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition May 31, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Causal Structure Discovery for Error Diagnostics of Children's ASR May 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction May 31, 2025 Prediction speech-recognition
— Unverified 0Chain-of-Thought Training for Open E2E Spoken Dialogue Systems May 31, 2025 Language Modeling Language Modelling
— Unverified 0Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition May 30, 2025 Decoder speech-recognition
— Unverified 0Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach May 30, 2025 Automatic Speech Recognition Quantization
— Unverified 0Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition May 28, 2025 speech-recognition Speech Recognition
— Unverified 0NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding May 28, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis May 28, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge May 27, 2025 Diversity speech-recognition
— Unverified 0Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Topological Deep Learning for Speech Data May 27, 2025 Deep Learning Phoneme Recognition
— Unverified 0Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing May 27, 2025 speech-recognition Speech Recognition
— Unverified 0Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis May 27, 2025 Accented Speech Recognition Self-Supervised Learning
— Unverified 0PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection May 26, 2025 Alzheimer's Disease Detection Automatic Speech Recognition
— Unverified 0Robust fine-tuning of speech recognition models via model merging: application to disordered speech May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring Generative Error Correction for Dysarthric Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy May 26, 2025 Speaker anonymization speech-recognition
— Unverified 0KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically May 26, 2025 Retrieval speech-recognition
— Unverified 0In-context Language Learning for Endangered Languages in Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages May 26, 2025 Automatic Speech Recognition Diversity
— Unverified 0Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WhisperD: Dementia Speech Recognition and Filler Word Detection with Whisper May 25, 2025 speech-recognition Speech Recognition
— Unverified 0CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR May 24, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Building a Functional Machine Translation Corpus for Kpelle May 24, 2025 Data Augmentation Language Modelling
— Unverified 0StandUp4AI: A New Multilingual Dataset for Humor Detection in Stand-up Comedy Videos May 24, 2025 Humor Detection speech-recognition
— Unverified 0Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition May 23, 2025 speech-recognition Speech Recognition
— Unverified 0LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Effective Training Framework for Light-Weight Automatic Speech Recognition Models May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding May 22, 2025 Action Classification Automatic Speech Recognition
Code Code Available 0Large Language Models based ASR Error Correction for Child Conversations May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Word Level Timestamp Generation for Automatic Speech Recognition and Translation May 21, 2025 Automatic Speech Recognition automatic-speech-translation
— Unverified 0