Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition Jan 4, 2024 Attribute Automatic Speech Recognition
Code Code Available 0CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition Jan 4, 2024 Knowledge Distillation speech-recognition
— Unverified 0The Art of Deception: Robust Backdoor Attack using Dynamic Stacking of Triggers Jan 3, 2024 Backdoor Attack speech-recognition
— Unverified 0Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models Jan 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations Jan 1, 2024 Audio-Visual Speech Recognition Lipreading
— Unverified 0Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition Dec 27, 2023 Automatic Speech Recognition Decoder
— Unverified 0Towards Probing Contact Center Large Language Models Dec 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge Dec 26, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BLSTM-Based Confidence Estimation for End-to-End Speech Recognition Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BANSpEmo: A Bangla Emotional Speech Recognition Dataset Dec 21, 2023 speech-recognition Speech Recognition
— Unverified 0Multi-Sentence Grounding for Long-term Instructional Video Dec 21, 2023 Denoising Descriptive
— Unverified 0Collaborative Learning with Artificial Intelligence Speakers (CLAIS): Pre-Service Elementary Science Teachers' Responses to the Prototype Dec 20, 2023 speech-recognition Speech Recognition
— Unverified 0Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition? Dec 19, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0SpokesBiz -- an Open Corpus of Conversational Polish Dec 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative linguistic representation for spoken language identification Dec 18, 2023 Decoder Language Identification
— Unverified 0Efficiency-oriented approaches for self-supervised speech representation learning Dec 18, 2023 Automatic Speech Recognition Representation Learning
— Unverified 0Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers Dec 18, 2023 Form speech-recognition
— Unverified 0Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Dec 18, 2023 speaker-diarization Speaker Diarization
— Unverified 0OAVA: the open audio-visual archives aggregator Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Conformer-Based Speech Recognition On Extreme Edge-Computing Devices Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Seq2seq for Automatic Paraphasia Detection in Aphasic Speech Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Dec 15, 2023 Quantization speech-recognition
Code Code Available 0Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition Dec 15, 2023 Automatic Speech Recognition Language Identification
— Unverified 0Phoneme-aware Encoding for Prefix-tree-based Contextual ASR Dec 15, 2023 speech-recognition Speech Recognition
— Unverified 0Generative Context-aware Fine-tuning of Self-supervised Speech Models Dec 15, 2023 Automatic Speech Recognition named-entity-recognition
— Unverified 0IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases Dec 15, 2023 Dynamic Time Warping Silent Speech Recognition
— Unverified 0FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge Dec 15, 2023 Backdoor Attack Data Poisoning
Code Code Available 1LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data Dec 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FastInject: Injecting Unpaired Text Data into CTC-based ASR training Dec 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Automatic Data Augmentation for Disordered Speech Recognition Dec 14, 2023 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Attention-Guided Adaptation for Code-Switching Speech Recognition Dec 14, 2023 Language Identification speech-recognition
— Unverified 0Audio-visual fine-tuning of audio-only ASR models Dec 14, 2023 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0Personalized Autonomous Driving with Large Language Models: Field Experiments Dec 14, 2023 Autonomous Driving Autonomous Vehicles
Code Code Available 1Revisiting the Entropy Semiring for Neural Speech Recognition Dec 13, 2023 speech-recognition Speech Recognition
— Unverified 0On Robustness to Missing Video for Audiovisual Speech Recognition Dec 13, 2023 speech-recognition Speech Recognition
— Unverified 0USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition Dec 13, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Efficient Representation of the Activation Space in Deep Neural Networks Dec 13, 2023 Anomaly Detection speech-recognition
— Unverified 0Extending Whisper with prompt tuning to target-speaker ASR Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification Dec 12, 2023 Automatic Speech Recognition Dialect Identification
— Unverified 0The GUA-Speech System Description for CNVSRC Challenge 2023 Dec 12, 2023 Decoder Language Modeling
— Unverified 0Deep Photonic Reservoir Computer for Speech Recognition Dec 11, 2023 speech-recognition Speech Recognition
— Unverified 0Creating Spoken Dialog Systems in Ultra-Low Resourced Settings Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Revisiting the Role of Label Smoothing in Enhanced Text Sentiment Classification Dec 11, 2023 Classification image-classification
— Unverified 0Batched Low-Rank Adaptation of Foundation Models Dec 9, 2023 Code Generation speech-recognition
— Unverified 0