FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio Jan 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Benchmark of French ASR Systems Based on Error Severity Jan 18, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0persoDA: Personalized Data Augmentation for Personalized ASR Jan 15, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Selective Attention Merging for low resource tasks: A case study of Child ASR Jan 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR Jan 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Discrete Speech Unit Extraction via Independent Component Analysis Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Universal-2-TF: Robust All-Neural Text Formatting for ASR Jan 10, 2025 All Automatic Speech Recognition
— Unverified 0Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Benchmarking Rotary Position Embeddings for Automatic Speech Recognition Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models Jan 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition Jan 3, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer Jan 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Jan 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Jan 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition Dec 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Zero-resource Speech Translation and Recognition with LLMs Dec 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition Dec 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Retrieval-Augmented Generation without Automatic Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency Dec 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speak & Improve Challenge 2025: Tasks and Baseline Systems Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation Dec 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection Dec 9, 2024 Alzheimer's Disease Detection Automatic Speech Recognition
— Unverified 0Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection Dec 9, 2024 All Alzheimer's Disease Detection
— Unverified 0Comprehensive Audio Query Handling System with Integrated Expert Models and Contextual Understanding Dec 5, 2024 Audio Generation Automatic Speech Recognition
— Unverified 0ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction Dec 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot Dec 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 7Late fusion ensembles for speech recognition on diverse input audio representations Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models Nov 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AMPS: ASR with Multimodal Paraphrase Supervision Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Learning in Machine Speech Chain Using Gradient Episodic Memory Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Aligning Pre-trained Models for Spoken Language Translation Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0