Continual Learning in Machine Speech Chain Using Gradient Episodic Memory Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Scaling Speech-Text Pre-training with Synthetic Interleaved Data Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 7High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR Nov 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks Nov 22, 2024 Gesture Recognition Hand Gesture Recognition
— Unverified 0Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering Nov 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge Nov 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications Nov 20, 2024 Emotion Recognition Speaker Identification
— Unverified 0From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CAFE A Novel Code switching Dataset for Algerian Dialect French and English Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whisper Finetuning on Nepali Language Nov 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children Nov 18, 2024 Diagnostic speech-recognition
— Unverified 0Inter-linguistic Phonetic Composition (IPC): A Theoretical and Computational Approach to Enhance Second Language Pronunciation Nov 17, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 1BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization Nov 16, 2024 Machine Translation speech-recognition
Code Code Available 0WavChat: A Survey of Spoken Dialogue Models Nov 15, 2024 speech-recognition Speech Recognition
Code Code Available 3XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection Nov 15, 2024 Audio Deepfake Detection Automatic Speech Recognition
Code Code Available 1DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization Nov 15, 2024 DeepFake Detection Face Swapping
Code Code Available 0Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems Nov 15, 2024 Machine Translation Quantization
— Unverified 0Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data Nov 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transferable Adversarial Attacks against ASR Nov 14, 2024 Action Detection Activity Detection
— Unverified 0DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions Nov 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CTC-Assisted LLM-Based Contextual ASR Nov 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dialectal Coverage And Generalization in Arabic Speech Recognition Nov 7, 2024 Arabic Speech Recognition Automatic Speech Recognition
Code Code Available 2Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages Nov 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Nov 4, 2024 Lipreading speech-recognition
Code Code Available 1SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation Nov 3, 2024 speech-recognition Speech Recognition
— Unverified 0Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO Nov 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval Nov 1, 2024 Quantization Retrieval
— Unverified 0Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? Oct 31, 2024 Rhythm speech-recognition
— Unverified 0Augmenting Polish Automatic Speech Recognition System With Synthetic Data Oct 30, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising Oct 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription Oct 29, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Multilingual Standalone Trustworthy Voice-Based Social Network for Disaster Situations Oct 28, 2024 speech-recognition Speech Recognition
— Unverified 0Asynchronous Tool Usage for Real-Time Agents Oct 28, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs Oct 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography Oct 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts Oct 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0kNN For Whisper And Its Effect On Bias And Speaker Adaptation Oct 24, 2024 Machine Translation speech-recognition
— Unverified 0A Survey on Speech Large Language Models Oct 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0STTATTS: Unified Speech-To-Text And Text-To-Speech Model Oct 24, 2024 Multi-Task Learning speech-recognition
Code Code Available 1Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model Oct 24, 2024 speech-recognition Speech Recognition
— Unverified 0VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning Oct 23, 2024 Question Answering Speech Recognition
Code Code Available 1ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams Oct 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DENOASR: Debiasing ASRs through Selective Denoising Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0