Towards a Single ASR Model That Generalizes to Disordered Speech Dec 26, 2024 Fairness speech-recognition
— Unverified 0Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization Dec 26, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning Dec 25, 2024 Language Modeling Language Modelling
— Unverified 0Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Dec 25, 2024 Attribute speech-recognition
— Unverified 0Zero-resource Speech Translation and Recognition with LLMs Dec 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning Dec 23, 2024 Backdoor Attack Bayesian Optimization
— Unverified 0Deep Learning in Proteomics Informatics: Applications, Challenges, and Future Directions Dec 23, 2024 Deep Learning speech-recognition
— Unverified 0Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution Dec 23, 2024 Audio Deepfake Detection DeepFake Detection
— Unverified 0UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition Dec 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Uncovering the Visual Contribution in Audio-Visual Speech Recognition Dec 22, 2024 Audio-Visual Speech Recognition Informativeness
— Unverified 0Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Retrieval-Augmented Generation without Automatic Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency Dec 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speak & Improve Challenge 2025: Tasks and Baseline Systems Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond Dec 16, 2024 Language Modeling Language Modelling
— Unverified 0Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition Dec 15, 2024 Automatic Speech Recognition Domain Adaptation
— Unverified 0Efficient Adaptation of Multilingual Models for Japanese ASR Dec 14, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Robust Persian Digit Recognition in Noisy Environments Using Hybrid CNN-BiGRU Model Dec 14, 2024 speech-recognition Speech Recognition
— Unverified 0MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models Dec 13, 2024 speech-recognition Speech Recognition
— Unverified 0Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation Dec 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition Dec 11, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Style-agnostic evaluation of ASR using multiple reference transcripts Dec 10, 2024 speech-recognition Speech Recognition
— Unverified 0Ensemble Machine Learning Model for Inner Speech Recognition: A Subject-Specific Investigation Dec 9, 2024 EEG feature selection
— Unverified 0Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection Dec 9, 2024 Alzheimer's Disease Detection Automatic Speech Recognition
— Unverified 0Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection Dec 9, 2024 All Alzheimer's Disease Detection
— Unverified 0SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR Dec 7, 2024 Automatic Speech Recognition Data Augmentation
Code Code Available 0Adaptive Dropout for Pruning Conformers Dec 6, 2024 speech-recognition Speech Recognition
— Unverified 0Comprehensive Audio Query Handling System with Integrated Expert Models and Contextual Understanding Dec 5, 2024 Audio Generation Automatic Speech Recognition
— Unverified 0Speech Recognition-based Feature Extraction for Enhanced Automatic Severity Classification in Dysarthric Speech Dec 5, 2024 severity prediction speech-recognition
— Unverified 0ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction Dec 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot Dec 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 7Late fusion ensembles for speech recognition on diverse input audio representations Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment Dec 1, 2024 Action Detection Activity Detection
Code Code Available 0A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models Nov 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ArEEG_Words: Dataset for Envisioned Speech Recognition using EEG for Arabic Words Nov 28, 2024 Brain Computer Interface EEG
— Unverified 0MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models Nov 27, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation Nov 27, 2024 Question Answering Speech Enhancement
— Unverified 0Aligning Pre-trained Models for Spoken Language Translation Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Learning in Machine Speech Chain Using Gradient Episodic Memory Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0