NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech Jul 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine Jul 17, 2025 Audio Classification Automatic Speech Recognition
— Unverified 0WhisperKit: On-device Real-time ASR with Billion-Scale Transformers Jul 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Jul 8, 2025 Automatic Speech Recognition Lip Reading
— Unverified 0A Hybrid Machine Learning Framework for Optimizing Crop Selection via Agronomic and Economic Forecasting Jul 6, 2025 Hybrid Machine Learning speech-recognition
— Unverified 0First Steps Towards Voice Anonymization for Code-Switching Speech Jul 2, 2025 speech-recognition Speech Recognition
Code Code Available 0MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement Jul 1, 2025 Automatic Speech Recognition Mamba
Code Code Available 2VOICE CONTROL ROBOT USING ARDUINO MANAGEMENT SYSTEM PROJECT. Jun 25, 2025 Management speech-recognition
— Unverified 0Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR Jun 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Representation Learning and Fusion Jun 25, 2025 AutoML Representation Learning
— Unverified 0AUTOMATIC PRONUNCIATION MISTAKE DETECTOR PROJECT REPORT Jun 25, 2025 Mistake Detection speech-recognition
— Unverified 0End-to-End Spoken Grammatical Error Correction Jun 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AI-Generated Song Detection via Lyrics Transcripts Jun 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Jun 22, 2025 Automatic Speech Recognition speech-recognition
Code Code Available 0OpusLM: A Family of Open Unified Speech Language Models Jun 21, 2025 Decoder speech-recognition
— Unverified 0State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition Jun 20, 2025 Automatic Speech Recognition Diversity
— Unverified 0LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization Jun 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages Jun 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Weight Factorization and Centralization for Continual Learning in Speech Recognition Jun 19, 2025 Continual Learning speech-recognition
— Unverified 0Automatic Speech Recognition Biases in Newcastle English: an Error Analysis Jun 19, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition Jun 17, 2025 Data Augmentation Language Modeling
— Unverified 0Unifying Streaming and Non-streaming Zipformer-based ASR Jun 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios Jun 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 Jun 16, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0BUT System for the MLC-SLM Challenge Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Silent Speech Decoding System from EEG and EMG with Heterogenous Electrode Configurations Jun 16, 2025 EEG speech-recognition
— Unverified 0Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems Jun 16, 2025 Decoder Language Modeling
— Unverified 0Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition Jun 15, 2025 Decoder speaker-diarization
— Unverified 0Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling automatic transcription of child-centered audio recordings from real-world environments Jun 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms Jun 12, 2025 Automatic Speech Recognition Keyword Spotting
Code Code Available 0FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition Jun 12, 2025 Automatic Speech Recognition Contrastive Learning
— Unverified 0Joint ASR and Speaker Role Tagging with Serialized Output Training Jun 12, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Improving Named Entity Transcription with Contextual LLM-based Revision Jun 12, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary Jun 11, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Regularizing Learnable Feature Extraction for Automatic Speech Recognition Jun 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research Jun 10, 2025 Automatic Speech Recognition Data Augmentation
— Unverified 0Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia Jun 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech Jun 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition Jun 9, 2025 Automatic Speech Recognition Multi-Task Learning
— Unverified 0Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation Jun 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unified Semi-Supervised Pipeline for Automatic Speech Recognition Jun 9, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Uncovering the Functional Roles of Nonlinearity in Memory Jun 9, 2025 speech-recognition Speech Recognition
— Unverified 0Speech Recognition on TV Series with Video-guided Post-Correction Jun 8, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs Jun 7, 2025 Emotion Recognition speech-recognition
— Unverified 0Automatic Speech Recognition of African American English: Lexical and Contextual Effects Jun 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0