PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0In-context Language Learning for Endangered Languages in Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust fine-tuning of speech recognition models via model merging: application to disordered speech May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection May 26, 2025 Alzheimer's Disease Detection Automatic Speech Recognition
— Unverified 0Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring Generative Error Correction for Dysarthric Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR May 24, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding May 22, 2025 Action Classification Automatic Speech Recognition
Code Code Available 0An Effective Training Framework for Light-Weight Automatic Speech Recognition Models May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Large Language Models based ASR Error Correction for Child Conversations May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems May 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models May 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition for African Low-Resource Languages: Challenges and Future Directions May 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors May 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio May 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Stage Speaker Diarization for Noisy Classrooms May 16, 2025 Action Detection Activity Detection
Code Code Available 0Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients May 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations May 8, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech May 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation May 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model May 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 4Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play May 5, 2025 AI Agent Automatic Speech Recognition
Code Code Available 3Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments May 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction Apr 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition Apr 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides Apr 21, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models Apr 21, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope Apr 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning Apr 16, 2025 Arabic Speech Recognition Automatic Speech Recognition
— Unverified 0Spatial Audio Processing with Large Language Model on Wearable Devices Apr 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Visual-Aware Speech Recognition for Noisy Scenarios Apr 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation Apr 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect Apr 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Chain of Correction for Full-text Speech Recognition with Large Language Models Apr 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems Apr 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR Mar 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Mar 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages Mar 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 4Qwen2.5-Omni Technical Report Mar 26, 2025 Automatic Speech Recognition (ASR) GSM8K
Code Code Available 7