An efficient text augmentation approach for contextualized Mandarin speech recognition Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation Jun 14, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Unsupervised Speech Recognition Without Pronunciation Models Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transformer-based Model for ASR N-Best Rescoring and Rewriting Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reading Miscue Detection in Primary School through Automatic Speech Recognition Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0mHuBERT-147: A Compact Multilingual HuBERT Model Jun 10, 2024 Automatic Speech Recognition (ASR) Diversity
Code Code Available 0ASTRA: Aligning Speech and Text Representations for Asr without Sampling Jun 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations Jun 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Flexible Multichannel Speech Enhancement for Noise-Robust Frontend Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Hypernetworks for Personalizing ASR to Atypical Speech Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text Injection for Neural Contextual Biasing Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning Jun 5, 2024 Automatic Speech Recognition (ASR) de-en
Code Code Available 5Enhancing CTC-based speech recognition with diverse modeling units Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Keyword-Guided Adaptation of Automatic Speech Recognition Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach Jun 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning Jun 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities May 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation May 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition May 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FairLENS: Assessing Fairness in Law Enforcement Speech Recognition May 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models May 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings May 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer May 15, 2024 Adversarial Attack Automatic Speech Recognition
— Unverified 0