AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Customizing Speech Recognition Model with Large Language Model Feedback Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Jun 5, 2025 Audio-Visual Speech Recognition speech-recognition
— Unverified 0LLM-based phoneme-to-grapheme for phoneme-based speech recognition Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Child Speech Recognition and Reading Mistake Detection by Using Prompts Jun 4, 2025 Mistake Detection speech-recognition
— Unverified 0Structured State Space Model Dynamics and Parametrization for Spiking Neural Networks Jun 4, 2025 speech-recognition Speech Recognition
Code Code Available 0Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR Jun 4, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MFLA: Monotonic Finite Look-ahead Attention for Streaming Speech Recognition Jun 4, 2025 speech-recognition Speech Recognition
— Unverified 0A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation Jun 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning Jun 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss Jun 3, 2025 Automatic Lyrics Transcription Automatic Speech Recognition
— Unverified 0Whale: Large-Scale multilingual ASR model with w2v-BERT and E-Branchformer with large speech data Jun 2, 2025 Decoder speech-recognition
— Unverified 0TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge Jun 2, 2025 Language Identification speech-recognition
— Unverified 0HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DNCASR: End-to-End Training for Speaker-Attributed ASR Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Riemannian Time Warping: Multiple Sequence Alignment in Curved Spaces Jun 2, 2025 Multiple Sequence Alignment speech-recognition
— Unverified 0Cocktail-Party Audio-Visual Speech Recognition Jun 2, 2025 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric Jun 2, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Analyzing the Importance of Blank for CTC-Based Knowledge Distillation Jun 2, 2025 Automatic Speech Recognition Knowledge Distillation
— Unverified 0WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing Jun 2, 2025 Keyword Spotting speech-recognition
— Unverified 0Enhancing Speech Instruction Understanding and Disambiguation in Robotics via Speech Prosody Jun 1, 2025 In-Context Learning speech-recognition
— Unverified 0What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training Jun 1, 2025 Automatic Speech Recognition speech-recognition
Code Code Available 0GigaAM: Efficient Self-Supervised Learner for Speech Recognition Jun 1, 2025 Automatic Speech Recognition Language Modeling
Code Code Available 4No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction May 31, 2025 Prediction speech-recognition
— Unverified 0DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition May 31, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Chain-of-Thought Training for Open E2E Spoken Dialogue Systems May 31, 2025 Language Modeling Language Modelling
— Unverified 0Causal Structure Discovery for Error Diagnostics of Children's ASR May 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Temporally Explainable Dysarthric Speech Clarity Assessment May 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition May 30, 2025 Decoder speech-recognition
— Unverified 0Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach May 30, 2025 Automatic Speech Recognition Quantization
— Unverified 0Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding May 28, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis May 28, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition May 28, 2025 speech-recognition Speech Recognition
— Unverified 0CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge May 27, 2025 Diversity speech-recognition
— Unverified 0Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing May 27, 2025 speech-recognition Speech Recognition
— Unverified 0Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation May 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0