Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Jan 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement Jan 22, 2025 Object Recognition speech-recognition
— Unverified 0Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data Jan 21, 2025 Domain Adaptation speech-recognition
— Unverified 0Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio Jan 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges Jan 20, 2025 Automatic Speech Recognition Diversity
— Unverified 0Enhancing Neural Spoken Language Recognition: An Exploration with Multilingual Datasets Jan 19, 2025 speech-recognition Speech Recognition
— Unverified 0A Benchmark of French ASR Systems Based on Error Severity Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition for Sanskrit with Transfer Learning Jan 17, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Teaching Wav2Vec2 the Language of the Brain Jan 16, 2025 Brain Decoding speech-recognition
Code Code Available 0PIER: A Novel Metric for Evaluating What Matters in Code-Switching Jan 16, 2025 Automatic Speech Recognition Decoder
Code Code Available 0Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition Jan 16, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0A Non-autoregressive Model for Joint STT and TTS Jan 15, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0persoDA: Personalized Data Augmentation for Personalized ASR Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Selective Attention Merging for low resource tasks: A case study of Child ASR Jan 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications Jan 14, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding Jan 13, 2025 Automatic Speech Recognition intent-classification
Code Code Available 0Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR Jan 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Discrete Speech Unit Extraction via Independent Component Analysis Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Survey on Spoken Italian Datasets and Corpora Jan 11, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer Jan 10, 2025 speech-recognition Speech Recognition
— Unverified 0Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Jan 10, 2025 Automatic Speech Recognition Classification
Code Code Available 0Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Benchmarking Rotary Position Embeddings for Automatic Speech Recognition Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Universal-2-TF: Robust All-Neural Text Formatting for ASR Jan 10, 2025 All Automatic Speech Recognition
— Unverified 0Methods to Increase the Amount of Data for Speech Recognition for Low Resource Languages Jan 8, 2025 speech-recognition Speech Recognition
— Unverified 0LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Jan 8, 2025 Lip Reading speech-recognition
— Unverified 0Deep Learning for Pathological Speech: A Survey Jan 7, 2025 Automatic Speech Recognition Data Augmentation
— Unverified 0Towards a Generalizable Speech Marker for Parkinson's Disease Diagnosis Jan 7, 2025 Diagnostic Domain Adaptation
— Unverified 0Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection Jan 7, 2025 Action Detection Activity Detection
— Unverified 0Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models Jan 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition Jan 3, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer Jan 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Jan 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Jan 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition Jan 1, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing Jan 1, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation Jan 1, 2025 Automatic Speech Recognition Decoder
Code Code Available 1Incremental Dialogue Management: Survey, Discussion, and Implications for HRI Jan 1, 2025 Dialogue Management Management
— Unverified 0Fotheidil: an Automatic Transcription System for the Irish Language Dec 31, 2024 Action Detection Activity Detection
— Unverified 0Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages Dec 31, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 0DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition Dec 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization Dec 27, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0