Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems Jun 16, 2025 Decoder Language Modeling
— Unverified 0BUT System for the MLC-SLM Challenge Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Silent Speech Decoding System from EEG and EMG with Heterogenous Electrode Configurations Jun 16, 2025 EEG speech-recognition
— Unverified 0NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 Jun 16, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition Jun 15, 2025 Decoder speaker-diarization
— Unverified 0Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling automatic transcription of child-centered audio recordings from real-world environments Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition Jun 12, 2025 Automatic Speech Recognition Contrastive Learning
— Unverified 0Joint ASR and Speaker Role Tagging with Serialized Output Training Jun 12, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Improving Named Entity Transcription with Contextual LLM-based Revision Jun 12, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms Jun 12, 2025 Automatic Speech Recognition Keyword Spotting
Code Code Available 0Regularizing Learnable Feature Extraction for Automatic Speech Recognition Jun 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary Jun 11, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia Jun 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research Jun 10, 2025 Automatic Speech Recognition Data Augmentation
— Unverified 0Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation Jun 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech Jun 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Uncovering the Functional Roles of Nonlinearity in Memory Jun 9, 2025 speech-recognition Speech Recognition
— Unverified 0Unified Semi-Supervised Pipeline for Automatic Speech Recognition Jun 9, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition Jun 9, 2025 Automatic Speech Recognition Multi-Task Learning
— Unverified 0Speech Recognition on TV Series with Video-guided Post-Correction Jun 8, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition of African American English: Lexical and Contextual Effects Jun 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs Jun 7, 2025 Emotion Recognition speech-recognition
— Unverified 0Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models Jun 6, 2025 Automatic Speech Recognition speaker-diarization
— Unverified 0Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Customizing Speech Recognition Model with Large Language Model Feedback Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Jun 5, 2025 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LLM-based phoneme-to-grapheme for phoneme-based speech recognition Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Structured State Space Model Dynamics and Parametrization for Spiking Neural Networks Jun 4, 2025 speech-recognition Speech Recognition
Code Code Available 0Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR Jun 4, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MFLA: Monotonic Finite Look-ahead Attention for Streaming Speech Recognition Jun 4, 2025 speech-recognition Speech Recognition
— Unverified 0Improving Child Speech Recognition and Reading Mistake Detection by Using Prompts Jun 4, 2025 Mistake Detection speech-recognition
— Unverified 0A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation Jun 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss Jun 3, 2025 Automatic Lyrics Transcription Automatic Speech Recognition
— Unverified 0Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning Jun 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge Jun 2, 2025 Language Identification speech-recognition
— Unverified 0Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Analyzing the Importance of Blank for CTC-Based Knowledge Distillation Jun 2, 2025 Automatic Speech Recognition Knowledge Distillation
— Unverified 0Whale: Large-Scale multilingual ASR model with w2v-BERT and E-Branchformer with large speech data Jun 2, 2025 Decoder speech-recognition
— Unverified 0Riemannian Time Warping: Multiple Sequence Alignment in Curved Spaces Jun 2, 2025 Multiple Sequence Alignment speech-recognition
— Unverified 0WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing Jun 2, 2025 Keyword Spotting speech-recognition
— Unverified 0Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric Jun 2, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0