On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition Oct 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting the adapters for code-switching in multilingual ASR Oct 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Discriminative Speech Recognition Rescoring with Pre-trained Language Models Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic Model Fusion for End-to-end Speech Recognition Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding Oct 9, 2023 slot-filling Slot Filling
Code Code Available 0Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis Oct 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond Oct 9, 2023 Language Identification speech-recognition
— Unverified 0ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction Oct 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition Oct 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition Oct 7, 2023 Domain Adaptation Lip Reading
— Unverified 0LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 2A privacy-preserving method using secret key for convolutional neural network-based speech classification Oct 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder Oct 6, 2023 Alzheimer's Disease Detection speech-recognition
Code Code Available 0HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model Oct 6, 2023 Automatic Speech Recognition Representation Learning
— Unverified 0Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset Oct 5, 2023 speech-recognition Speech Recognition
— Unverified 0Neural Language Model Pruning for Automatic Speech Recognition Oct 5, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0The North System for Formosa Speech Recognition Challenge 2023 Oct 5, 2023 speech-recognition Speech Recognition
— Unverified 0DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Oct 5, 2023 Decoder Logical Reasoning
Code Code Available 0An Integrated Algorithm for Robust and Imperceptible Audio Adversarial Examples Oct 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models Oct 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions Oct 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer Layers Oct 3, 2023 speech-recognition Speech Recognition
— Unverified 0Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching Oct 3, 2023 speech-recognition Speech Recognition
Code Code Available 1One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Oct 2, 2023 All Automatic Speech Recognition
— Unverified 0Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 1AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR Sep 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SLM: Bridge the thin gap between speech and text foundation models Sep 30, 2023 Instruction Following Language Modeling
— Unverified 0Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition Sep 30, 2023 Active Learning Emotion Recognition
— Unverified 0The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Sep 29, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Code-switching Speech Recognition with Interactive Language Biases Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR Sep 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System Sep 28, 2023 Action Detection Activity Detection
— Unverified 0HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Sep 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study Sep 27, 2023 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0Speech collage: code-switched audio generation by collaging monolingual corpora Sep 27, 2023 Audio Generation Automatic Speech Recognition
Code Code Available 1Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study Sep 27, 2023 Automatic Speech Recognition Keyword Spotting
— Unverified 0Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting Sep 27, 2023 In-Context Learning speech-recognition
— Unverified 0Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition Sep 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project Sep 26, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Sep 26, 2023 Language Modeling Language Modelling
— Unverified 0Updated Corpora and Benchmarks for Long-Form Speech Recognition Sep 26, 2023 Form speech-recognition
Code Code Available 1Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference Sep 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0