Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization Mar 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispering in Amharic: Fine-tuning Whisper for Low-resource Language Mar 24, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Your voice is your voice: Supporting Self-expression through Speech Generation and LLMs in Augmented and Alternative Communication Mar 21, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces Mar 19, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization Mar 12, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment Mar 12, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR Mar 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling Mar 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Building English ASR model with regional language support Mar 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment Mar 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Qieemo: Speech Is All You Need in the Emotion Recognition in Conversations Mar 5, 2025 All Automatic Speech Recognition
— Unverified 0Direct Speech to Speech Translation: A Review Mar 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-Tuning Whisper for Inclusive Prosodic Stress Analysis Mar 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems Mar 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation Feb 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications Feb 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision Feb 26, 2025 Audio Synthesis Automatic Speech Recognition
— Unverified 0CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition Feb 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring Gender Disparities in Automatic Speech Recognition Technology Feb 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving the Inclusivity of Dutch Speech Recognition by Fine-tuning Whisper on the JASMIN-CGN Corpus Feb 24, 2025 Automatic Speech Recognition (ASR) speech-recognition
Code Code Available 0Understanding Zero-shot Rare Word Recognition Improvements Through LLM Integration Feb 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages Feb 21, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders Feb 21, 2025 Audio captioning Automatic Speech Recognition
— Unverified 0Adopting Whisper for Confidence Estimation Feb 19, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models Feb 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics Feb 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders Feb 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MTLM: Incorporating Bidirectional Text Information to Enhance Language Model Training in Speech Recognition Systems Feb 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Feb 12, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification Feb 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance Feb 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond Feb 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling Feb 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition Feb 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport Feb 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition Feb 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language Feb 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation Feb 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Language Bias in Self-Supervised Learning For Automatic Speech Recognition Jan 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Jan 31, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition Jan 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SEAL: Speech Embedding Alignment Learning for Speech Large Language Model with Retrieval-Augmented Generation Jan 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? Jan 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Translation Refinement using Large Language Models Jan 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration Jan 24, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 5Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Jan 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0