Dynamic Data Pruning for Automatic Speech Recognition Jun 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs Jun 26, 2024 ArzEn Code-switched Translation to ara ArzEn Code-switched Translation to eng
Code Code Available 1FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Sequential Editing for Lifelong Training of Speech Recognition Models Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization Jun 25, 2024 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model Jun 25, 2024 Automatic Lyrics Transcription Automatic Speech Recognition
Code Code Available 1A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR Jun 25, 2024 Language Modeling Language Modelling
— Unverified 0Investigating Confidence Estimation Measures for Speaker Diarization Jun 24, 2024 speaker-diarization Speaker Diarization
— Unverified 0Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Jun 24, 2024 Action Detection Activity Detection
— Unverified 0Decoder-only Architecture for Streaming End-to-end Speech Recognition Jun 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss Jun 23, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment Jun 22, 2024 speech-recognition Speech Recognition
— Unverified 0PI-Whisper: Designing an Adaptive and Incremental Automatic Speech Recognition System for Edge Devices Jun 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0InterBiasing: Boost Unseen Word Recognition through Biasing Intermediate Predictions Jun 21, 2024 speech-recognition Speech Recognition
— Unverified 0Perception of Phonological Assimilation by Neural Speech Recognition Models Jun 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks Jun 20, 2024 Automatic Speech Recognition Decoder
— Unverified 0Intelligent Interface: Enhancing Lecture Engagement with Didactic Activity Summaries Jun 20, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0DASB -- Discrete Audio and Speech Benchmark Jun 20, 2024 Benchmarking Emotion Recognition
— Unverified 0Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control Jun 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Children's Speech Recognition through Discrete Token Enhancement Jun 19, 2024 speech-recognition Speech Recognition
— Unverified 0ManWav: The First Manchu ASR Model Jun 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting Jun 18, 2024 Decoder Language Identification
— Unverified 0Transcribe, Align and Segment: Creating speech datasets for low-resource languages Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Performant ASR Models for Medical Entities in Accented Speech Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Unsupervised Online Continual Learning for Automatic Speech Recognition Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization Jun 18, 2024 Landmark-based Lipreading Lipreading
Code Code Available 2Self-Train Before You Transcribe Jun 17, 2024 Domain Adaptation Language Modelling
Code Code Available 0GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement Jun 17, 2024 speech-recognition Speech Recognition
Code Code Available 3Automatic Speech Recognition for Biomedical Data in Bengali Language Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Large Language Models for Dysfluency Detection in Stuttered Speech Jun 16, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0Optimized Speculative Sampling for GPU Hardware Accelerators Jun 16, 2024 Automatic Speech Recognition GPU
Code Code Available 0Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition Jun 16, 2024 Automatic Speech Recognition Data Poisoning
— Unverified 0CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Trading Devil: Robust backdoor attack via Stochastic investment models and Bayesian approach Jun 15, 2024 Backdoor Attack speech-recognition
— Unverified 0Speech Emotion Recognition Using CNN and Its Use Case in Digital Healthcare Jun 15, 2024 Emotion Recognition Speech Emotion Recognition
— Unverified 0ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge Jun 14, 2024 speech-recognition Speech Recognition
— Unverified 0Optimizing Byte-level Representation for End-to-end ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Evaluation of Speech Foundation Models for Spoken Language Understanding Jun 14, 2024 Benchmarking Prediction
— Unverified 0An efficient text augmentation approach for contextualized Mandarin speech recognition Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection Jun 14, 2024 Decoder speech-recognition
Code Code Available 2Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation Jun 14, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition Jun 14, 2024 speech-recognition Speech Recognition
— Unverified 0Learning Language Structures through Grounding Jun 14, 2024 Automatic Speech Recognition Dependency Parsing
— Unverified 0Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0