Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition Oct 17, 2023 speech-recognition Speech Recognition
— Unverified 0Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Correction Focused Language Model Training for Speech Recognition Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative error correction for code-switching speech recognition using large language models Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-stage Large Language Model Correction for Speech Recognition Oct 17, 2023 Language Modeling Language Modelling
— Unverified 0Optimized Tokenization for Transcribed Error Correction Oct 16, 2023 speech-recognition Speech Recognition
— Unverified 0Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model Oct 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization Oct 16, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis Oct 16, 2023 Automatic Speech Recognition Decoder
— Unverified 0Large Vocabulary Spontaneous Speech Recognition for Tigrigna Oct 15, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers Oct 15, 2023 Decoder speech-recognition
Code Code Available 0Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation Oct 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition Oct 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text Oct 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting the adapters for code-switching in multilingual ASR Oct 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Discriminative Speech Recognition Rescoring with Pre-trained Language Models Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic Model Fusion for End-to-end Speech Recognition Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond Oct 9, 2023 Language Identification speech-recognition
— Unverified 0Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis Oct 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding Oct 9, 2023 slot-filling Slot Filling
Code Code Available 0ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction Oct 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition Oct 7, 2023 Domain Adaptation Lip Reading
— Unverified 0Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition Oct 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model Oct 6, 2023 Automatic Speech Recognition Representation Learning
— Unverified 0Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder Oct 6, 2023 Alzheimer's Disease Detection speech-recognition
Code Code Available 0A privacy-preserving method using secret key for convolutional neural network-based speech classification Oct 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The North System for Formosa Speech Recognition Challenge 2023 Oct 5, 2023 speech-recognition Speech Recognition
— Unverified 0DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Oct 5, 2023 Decoder Logical Reasoning
Code Code Available 0Neural Language Model Pruning for Automatic Speech Recognition Oct 5, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset Oct 5, 2023 speech-recognition Speech Recognition
— Unverified 0An Integrated Algorithm for Robust and Imperceptible Audio Adversarial Examples Oct 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions Oct 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer Layers Oct 3, 2023 speech-recognition Speech Recognition
— Unverified 0One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Oct 2, 2023 All Automatic Speech Recognition
— Unverified 0Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition Sep 30, 2023 Active Learning Emotion Recognition
— Unverified 0SLM: Bridge the thin gap between speech and text foundation models Sep 30, 2023 Instruction Following Language Modeling
— Unverified 0AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR Sep 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Sep 29, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Code-switching Speech Recognition with Interactive Language Biases Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System Sep 28, 2023 Action Detection Activity Detection
— Unverified 0Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR Sep 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting Sep 27, 2023 In-Context Learning speech-recognition
— Unverified 0