Mamba for Streaming ASR Combined with Unimodal Aggregation Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Alignment-Free Training for Transducer-based Multi-Talker ASR Sep 30, 2024 All Automatic Speech Recognition
— Unverified 0AfriHuBERT: A self-supervised speech representation model for African languages Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Long-Form Speech Recognition for General Speech In-Context Learning Sep 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility Sep 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A GEN AI Framework for Medical Note Generation Sep 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking Sep 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study Sep 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Deep CLAS: Deep Contextual Listen, Attend and Spell Sep 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events Sep 25, 2024 Audio Tagging Automatic Speech Recognition
— Unverified 0Speech Recognition Rescoring with Large Speech-Text Foundation Models Sep 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition Sep 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Revisiting Acoustic Features for Robust ASR Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction Sep 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder Sep 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalized Speech Recognition for Children with Test-Time Adaptation Sep 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition Sep 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR Sep 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Models are Strong Audio-Visual Speech Recognition Learners Sep 18, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2ASR Benchmarking: Need for a More Representative Conversational Dataset Sep 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses Sep 17, 2024 Action Detection Activity Detection
— Unverified 0Chain-of-Thought Prompting for Speech Translation Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WER We Stand: Benchmarking Urdu ASR Models Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmenting Automatic Speech Recognition Models with Disfluency Detection Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition Sep 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR Error Correction using Large Language Models Sep 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring SSL Discrete Tokens for Multilingual ASR Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Full-text Error Correction for Chinese Speech Recognition with Large Language Model Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WhisperNER: Unified Open Named Entity and Speech Recognition Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Enhancing CTC-Based Visual Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0