An efficient text augmentation approach for contextualized Mandarin speech recognition Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Optimizing Byte-level Representation for End-to-end ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transformer-based Model for ASR N-Best Rescoring and Rewriting Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Unsupervised Speech Recognition Without Pronunciation Models Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reading Miscue Detection in Primary School through Automatic Speech Recognition Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0mHuBERT-147: A Compact Multilingual HuBERT Model Jun 10, 2024 Automatic Speech Recognition (ASR) Diversity
Code Code Available 0ASTRA: Aligning Speech and Text Representations for Asr without Sampling Jun 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations Jun 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Flexible Multichannel Speech Enhancement for Noise-Robust Frontend Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hypernetworks for Personalizing ASR to Atypical Speech Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Enhancing CTC-based speech recognition with diverse modeling units Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text Injection for Neural Contextual Biasing Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Keyword-Guided Adaptation of Automatic Speech Recognition Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach Jun 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning Jun 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities May 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation May 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition May 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FairLENS: Assessing Fairness in Law Enforcement Speech Recognition May 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models May 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings May 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer May 15, 2024 Adversarial Attack Automatic Speech Recognition
— Unverified 0Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech May 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Open Implementation and Study of BEST-RQ for Speech Processing May 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition May 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition May 3, 2024 Active Learning Automatic Speech Recognition
— Unverified 0