Exploration of Adapter for Noise Robust Automatic Speech Recognition Feb 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps Feb 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models Feb 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement Feb 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ArEEG_Chars: Dataset for Envisioned Speech Recognition using EEG for Arabic Characters Feb 24, 2024 Brain Computer Interface EEG
— Unverified 0Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3Efficient data selection employing Semantic Similarity-based Graph Structures for model training Feb 22, 2024 Semantic Similarity Semantic Textual Similarity
— Unverified 0HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced Attention Feb 22, 2024 Image Inpainting speech-recognition
Code Code Available 2Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition Feb 20, 2024 Decoder speech-recognition
— Unverified 0Breaking Down Power Barriers in On-Device Streaming ASR: Insights and Solutions Feb 20, 2024 speech-recognition Speech Recognition
— Unverified 0OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification Feb 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena Feb 20, 2024 Automatic Speech Recognition image-classification
— Unverified 0Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru Feb 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models Feb 14, 2024 Automatic Speech Recognition Decoder
— Unverified 0Listening to Multi-talker Conversations: Modular and End-to-end Perspectives Feb 14, 2024 GPU speaker-diarization
— Unverified 0Syllable based DNN-HMM Cantonese Speech to Text System Feb 13, 2024 speech-recognition Speech Recognition
— Unverified 0An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Feb 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Careless Whisper: Speech-to-Text Hallucination Harms Feb 12, 2024 Hallucination Language Modeling
Code Code Available 0The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese Feb 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SALAD: Smart AI Language Assistant Daily Feb 12, 2024 Language Acquisition speech-recognition
— Unverified 0AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension Feb 12, 2024 2k Automatic Speech Recognition
Code Code Available 2The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models Feb 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition Feb 10, 2024 Contrastive Learning Emotion Recognition
Code Code Available 0DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine Extraction Feb 10, 2024 Decision Making speech-recognition
Code Code Available 0Self-consistent context aware conformer transducer for speech recognition Feb 9, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation Feb 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training Feb 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR Feb 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Resolving Transcription Ambiguity in Spanish: A Hybrid Acoustic-Lexical System for Punctuation Restoration Feb 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems Feb 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens Feb 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Digits micro-model for accurate and secure transactions Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Sequence Transduction through Dynamic Compression Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Introduction to speech recognition Feb 1, 2024 Dynamic Time Warping speech-recognition
— Unverified 0Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases Feb 1, 2024 speech-recognition Speech Recognition
— Unverified 0Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Jan 31, 2024 Lip Reading speech-recognition
— Unverified 0Exploring the limits of decoder-only models trained on public speech recognition corpora Jan 31, 2024 Decoder speech-recognition
— Unverified 0SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 0OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Jan 30, 2024 speech-recognition Speech Recognition
— Unverified 0Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition Jan 28, 2024 All Automatic Speech Recognition
— Unverified 0On Speaker Attribution with SURT Jan 28, 2024 speech-recognition Speech Recognition
— Unverified 0Towards Event Extraction from Speech with Contextual Clues Jan 27, 2024 Event Extraction speech-recognition
Code Code Available 0Comparison of parameters of vowel sounds of russian and english languages Jan 26, 2024 speech-recognition Speech Recognition
— Unverified 0Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline Jan 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion Jan 25, 2024 speech-recognition Speech Recognition
Code Code Available 1