Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data Jan 21, 2025 Domain Adaptation speech-recognition
— Unverified 0Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio Jan 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges Jan 20, 2025 Automatic Speech Recognition Diversity
— Unverified 0Enhancing Neural Spoken Language Recognition: An Exploration with Multilingual Datasets Jan 19, 2025 speech-recognition Speech Recognition
— Unverified 0A Benchmark of French ASR Systems Based on Error Severity Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition for Sanskrit with Transfer Learning Jan 17, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition Jan 16, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0PIER: A Novel Metric for Evaluating What Matters in Code-Switching Jan 16, 2025 Automatic Speech Recognition Decoder
Code Code Available 0Teaching Wav2Vec2 the Language of the Brain Jan 16, 2025 Brain Decoding speech-recognition
Code Code Available 0Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0persoDA: Personalized Data Augmentation for Personalized ASR Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Non-autoregressive Model for Joint STT and TTS Jan 15, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Selective Attention Merging for low resource tasks: A case study of Child ASR Jan 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications Jan 14, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR Jan 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding Jan 13, 2025 Automatic Speech Recognition intent-classification
Code Code Available 0Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Survey on Spoken Italian Datasets and Corpora Jan 11, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Discrete Speech Unit Extraction via Independent Component Analysis Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Benchmarking Rotary Position Embeddings for Automatic Speech Recognition Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Universal-2-TF: Robust All-Neural Text Formatting for ASR Jan 10, 2025 All Automatic Speech Recognition
— Unverified 0Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer Jan 10, 2025 speech-recognition Speech Recognition
— Unverified 0Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Jan 10, 2025 Automatic Speech Recognition Classification
Code Code Available 0LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Jan 8, 2025 Lip Reading speech-recognition
— Unverified 0Methods to Increase the Amount of Data for Speech Recognition for Low Resource Languages Jan 8, 2025 speech-recognition Speech Recognition
— Unverified 0Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection Jan 7, 2025 Action Detection Activity Detection
— Unverified 0Deep Learning for Pathological Speech: A Survey Jan 7, 2025 Automatic Speech Recognition Data Augmentation
— Unverified 0Towards a Generalizable Speech Marker for Parkinson's Disease Diagnosis Jan 7, 2025 Diagnostic Domain Adaptation
— Unverified 0Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models Jan 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition Jan 3, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer Jan 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Jan 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing Jan 1, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition Jan 1, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Incremental Dialogue Management: Survey, Discussion, and Implications for HRI Jan 1, 2025 Dialogue Management Management
— Unverified 0LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Jan 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages Dec 31, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 0Fotheidil: an Automatic Transcription System for the Irish Language Dec 31, 2024 Action Detection Activity Detection
— Unverified 0Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization Dec 27, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Towards a Single ASR Model That Generalizes to Disordered Speech Dec 26, 2024 Fairness speech-recognition
— Unverified 0Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization Dec 26, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Dec 25, 2024 Attribute speech-recognition
— Unverified 0Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning Dec 25, 2024 Language Modeling Language Modelling
— Unverified 0Zero-resource Speech Translation and Recognition with LLMs Dec 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0