Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition Feb 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems Feb 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploration of Adapter for Noise Robust Automatic Speech Recognition Feb 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps Feb 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement Feb 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models Feb 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ArEEG_Chars: Dataset for Envisioned Speech Recognition using EEG for Arabic Characters Feb 24, 2024 Brain Computer Interface EEG
— Unverified 0Efficient data selection employing Semantic Similarity-based Graph Structures for model training Feb 22, 2024 Semantic Similarity Semantic Textual Similarity
— Unverified 0Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Breaking Down Power Barriers in On-Device Streaming ASR: Insights and Solutions Feb 20, 2024 speech-recognition Speech Recognition
— Unverified 0How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena Feb 20, 2024 Automatic Speech Recognition image-classification
— Unverified 0Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition Feb 20, 2024 Decoder speech-recognition
— Unverified 0OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification Feb 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru Feb 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models Feb 14, 2024 Automatic Speech Recognition Decoder
— Unverified 0Listening to Multi-talker Conversations: Modular and End-to-end Perspectives Feb 14, 2024 GPU speaker-diarization
— Unverified 0Syllable based DNN-HMM Cantonese Speech to Text System Feb 13, 2024 speech-recognition Speech Recognition
— Unverified 0SALAD: Smart AI Language Assistant Daily Feb 12, 2024 Language Acquisition speech-recognition
— Unverified 0The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese Feb 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Careless Whisper: Speech-to-Text Hallucination Harms Feb 12, 2024 Hallucination Language Modeling
Code Code Available 0The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models Feb 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition Feb 10, 2024 Contrastive Learning Emotion Recognition
Code Code Available 0DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine Extraction Feb 10, 2024 Decision Making speech-recognition
Code Code Available 0Self-consistent context aware conformer transducer for speech recognition Feb 9, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training Feb 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Resolving Transcription Ambiguity in Spanish: A Hybrid Acoustic-Lexical System for Punctuation Restoration Feb 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems Feb 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens Feb 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Sequence Transduction through Dynamic Compression Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Digits micro-model for accurate and secure transactions Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases Feb 1, 2024 speech-recognition Speech Recognition
— Unverified 0Introduction to speech recognition Feb 1, 2024 Dynamic Time Warping speech-recognition
— Unverified 0SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 0Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Jan 31, 2024 Lip Reading speech-recognition
— Unverified 0Exploring the limits of decoder-only models trained on public speech recognition corpora Jan 31, 2024 Decoder speech-recognition
— Unverified 0OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Jan 30, 2024 speech-recognition Speech Recognition
— Unverified 0On Speaker Attribution with SURT Jan 28, 2024 speech-recognition Speech Recognition
— Unverified 0Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition Jan 28, 2024 All Automatic Speech Recognition
— Unverified 0Towards Event Extraction from Speech with Contextual Clues Jan 27, 2024 Event Extraction speech-recognition
Code Code Available 0Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline Jan 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparison of parameters of vowel sounds of russian and english languages Jan 26, 2024 speech-recognition Speech Recognition
— Unverified 0MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction Jan 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CNN architecture extraction on edge GPU Jan 24, 2024 GPU image-classification
— Unverified 0SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering Jan 24, 2024 Passage Retrieval Question Answering
— Unverified 0Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study Jan 23, 2024 Language Modeling Language Modelling
— Unverified 0Locality enhanced dynamic biasing and sampling strategies for contextual ASR Jan 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Consistency Based Unsupervised Self-training For ASR Personalisation Jan 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0