| Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations | Jul 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The USTC-NERCSLIP Systems for The ICMC-ASR Challenge | Jul 2, 2024 | Automatic Speech RecognitionPseudo Label | —Unverified | 0 |
| Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition | Jun 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Jun 27, 2024 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 0 |
| Tradition or Innovation: A Comparison of Modern ASR Methods for Forced Alignment | Jun 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network | Jun 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over | Jun 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR | Jun 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition for Hindi | Jun 26, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Dynamic Data Pruning for Automatic Speech Recognition | Jun 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research | Jun 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Sequential Editing for Lifelong Training of Speech Recognition Models | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 | Jun 24, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Decoder-only Architecture for Streaming End-to-end Speech Recognition | Jun 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss | Jun 23, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Perception of Phonological Assimilation by Neural Speech Recognition Models | Jun 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PI-Whisper: Designing an Adaptive and Incremental Automatic Speech Recognition System for Edge Devices | Jun 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks | Jun 20, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Intelligent Interface: Enhancing Lecture Engagement with Didactic Activity Summaries | Jun 20, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| ManWav: The First Manchu ASR Model | Jun 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control | Jun 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transcribe, Align and Segment: Creating speech datasets for low-resource languages | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Online Continual Learning for Automatic Speech Recognition | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Performant ASR Models for Medical Entities in Accented Speech | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automatic Speech Recognition for Biomedical Data in Bengali Language | Jun 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving | Jun 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Models for Dysfluency Detection in Stuttered Speech | Jun 16, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition | Jun 16, 2024 | Automatic Speech RecognitionData Poisoning | —Unverified | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| Learning Language Structures through Grounding | Jun 14, 2024 | Automatic Speech RecognitionDependency Parsing | —Unverified | 0 |
| Optimizing Byte-level Representation for End-to-end ASR | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An efficient text augmentation approach for contextualized Mandarin speech recognition | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Modal Retrieval For Large Language Model Based Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR | Jun 12, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Unsupervised Speech Recognition Without Pronunciation Models | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Transformer-based Model for ASR N-Best Rescoring and Rewriting | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |