Multi-Sentence Grounding for Long-term Instructional Video Dec 21, 2023 Denoising Descriptive
— Unverified 0Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Collaborative Learning with Artificial Intelligence Speakers (CLAIS): Pre-Service Elementary Science Teachers' Responses to the Prototype Dec 20, 2023 speech-recognition Speech Recognition
— Unverified 0Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition? Dec 19, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0SpokesBiz -- an Open Corpus of Conversational Polish Dec 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficiency-oriented approaches for self-supervised speech representation learning Dec 18, 2023 Automatic Speech Recognition Representation Learning
— Unverified 0Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Dec 18, 2023 speaker-diarization Speaker Diarization
— Unverified 0Generative linguistic representation for spoken language identification Dec 18, 2023 Decoder Language Identification
— Unverified 0Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers Dec 18, 2023 Form speech-recognition
— Unverified 0Seq2seq for Automatic Paraphasia Detection in Aphasic Speech Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Conformer-Based Speech Recognition On Extreme Edge-Computing Devices Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OAVA: the open audio-visual archives aggregator Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative Context-aware Fine-tuning of Self-supervised Speech Models Dec 15, 2023 Automatic Speech Recognition named-entity-recognition
— Unverified 0Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Dec 15, 2023 Quantization speech-recognition
Code Code Available 0Phoneme-aware Encoding for Prefix-tree-based Contextual ASR Dec 15, 2023 speech-recognition Speech Recognition
— Unverified 0LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data Dec 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition Dec 15, 2023 Automatic Speech Recognition Language Identification
— Unverified 0IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases Dec 15, 2023 Dynamic Time Warping Silent Speech Recognition
— Unverified 0Towards Automatic Data Augmentation for Disordered Speech Recognition Dec 14, 2023 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Audio-visual fine-tuning of audio-only ASR models Dec 14, 2023 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0Attention-Guided Adaptation for Code-Switching Speech Recognition Dec 14, 2023 Language Identification speech-recognition
— Unverified 0FastInject: Injecting Unpaired Text Data into CTC-based ASR training Dec 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On Robustness to Missing Video for Audiovisual Speech Recognition Dec 13, 2023 speech-recognition Speech Recognition
— Unverified 0USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Representation of the Activation Space in Deep Neural Networks Dec 13, 2023 Anomaly Detection speech-recognition
— Unverified 0PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition Dec 13, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Revisiting the Entropy Semiring for Neural Speech Recognition Dec 13, 2023 speech-recognition Speech Recognition
— Unverified 0The GUA-Speech System Description for CNVSRC Challenge 2023 Dec 12, 2023 Decoder Language Modeling
— Unverified 0Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification Dec 12, 2023 Automatic Speech Recognition Dialect Identification
— Unverified 0Creating Spoken Dialog Systems in Ultra-Low Resourced Settings Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Deep Photonic Reservoir Computer for Speech Recognition Dec 11, 2023 speech-recognition Speech Recognition
— Unverified 0Revisiting the Role of Label Smoothing in Enhanced Text Sentiment Classification Dec 11, 2023 Classification image-classification
— Unverified 0Batched Low-Rank Adaptation of Foundation Models Dec 9, 2023 Code Generation speech-recognition
— Unverified 0Keyword spotting -- Detecting commands in speech using deep learning Dec 9, 2023 Deep Learning Feature Engineering
— Unverified 0A Review of Hybrid and Ensemble in Deep Learning for Natural Language Processing Dec 9, 2023 Deep Learning Language Modeling
— Unverified 0FreqFed: A Frequency Analysis-Based Approach for Mitigating Poisoning Attacks in Federated Learning Dec 7, 2023 Federated Learning image-classification
— Unverified 0Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation Dec 6, 2023 Cross-Lingual Transfer Phoneme Recognition
— Unverified 0Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models Dec 6, 2023 Automatic Speech Recognition Decoder
— Unverified 0Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition Dec 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features Dec 5, 2023 cross-modal alignment Decoder
— Unverified 0Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training Dec 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Speech-to-Text Translation: A Survey Dec 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self Generated Wargame AI: Double Layer Agent Task Planning Based on Large Language Model Dec 2, 2023 Decision Making Language Modeling
— Unverified 0Mavericks at NADI 2023 Shared Task: Unravelling Regional Nuances through Dialect Identification using Transformer-based Approach Nov 30, 2023 Dialect Identification Multi-class Classification
— Unverified 0Speech Understanding on Tiny Devices with A Learning Cache Nov 30, 2023 speech-recognition Speech Recognition
Code Code Available 0Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets Nov 29, 2023 speech-recognition Speech Recognition
— Unverified 0End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data Nov 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Phonetic-aware speaker embedding for far-field speaker verification Nov 27, 2023 Speaker Recognition Speaker Verification
— Unverified 0